I'm finding SIDs in Index.php

Discussions about SEO principles. Learn how to get better indexed.

Moderator: Moderators

I'm finding SIDs in Index.php

Postby Peter77 » Sat Jul 01, 2006 11:14 pm

I've been running into links on Yahoo search pages, for example, that are showing my forums index.php with SIDs on it. I will even recognize some of the content that was cached from not to long ago.. that is how I know it is not an old link. Is this normal to still show SIDs in index.php? or does it mean something could be wrong in my functions.php? ( cyberalien MOD )
Last edited by Peter77 on Tue Dec 19, 2006 5:04 pm, edited 1 time in total.
Peter77
phpBB SEO Team
phpBB SEO Team
 
Posts: 532
Joined: Wed May 10, 2006 9:46 am

Advertisement

Postby dcz » Sat Jul 01, 2006 11:24 pm

If they only concern the index.php, then they must have been spidered before you implemented the guest session mod to get rid of SIDs.

Make sure you have :

Code: Select all
Disallow: /phpbb/index.php?


In your robots.txt file, that's the main interest of this rule. Now I have observed that Yahoo keeps old URLs in it's listing for quite a while, but they should not remain in cache too long once they are disallowed with the robots.txt.

Then, Google and MSN should get rid of those faster is they ever had them in cache.

++
Useful links :
SEO Forum || SEO Directory || SEO phpBB || Search
____________________

Liens Utiles :
Forum référencement || Annuaire référencement || Référencement phpBB || Recherche
dcz
Admin
Admin
 
Posts: 21407
Joined: Fri Apr 28, 2006 9:03 pm

Postby Peter77 » Sat Jul 01, 2006 11:33 pm

Okay I will add that. should I do the same for my portal? because in old indexed pages of my site I see
?start=10?sid=c-ect- where does that "start" come from? I know lmsg_start comes form the 'x' next topics from the recently posted. but "start=" :?:
Last edited by Peter77 on Thu Apr 10, 2008 1:41 pm, edited 1 time in total.
Peter77
phpBB SEO Team
phpBB SEO Team
 
Posts: 532
Joined: Wed May 10, 2006 9:46 am

Postby dcz » Sat Jul 01, 2006 11:36 pm

This is the last message pagination (the yellow arrows up and down), when the portal is accessed though www.example.com/ and not www.example.com/index.php.

Otherwise the link would have mention index.php.

You can thus add :

Code: Select all
Disallow: /mxbb/?


in your robots.txt ;)
Useful links :
SEO Forum || SEO Directory || SEO phpBB || Search
____________________

Liens Utiles :
Forum référencement || Annuaire référencement || Référencement phpBB || Recherche
dcz
Admin
Admin
 
Posts: 21407
Joined: Fri Apr 28, 2006 9:03 pm

Postby Peter77 » Sat Jul 01, 2006 11:40 pm

That's it.. I couldn't think of the name. :D

Okay thank you very much! this should about do it. :)
Peter77
phpBB SEO Team
phpBB SEO Team
 
Posts: 532
Joined: Wed May 10, 2006 9:46 am

Postby Peter77 » Mon Jul 03, 2006 4:03 am

Wait a minute... by /mxbb/? what are you saying? my mxbb is on my root domain.. so would I do Disallow: index.php? :?:

Hmm that has to be it... but please correct me if im wrong.


Thanks.
Peter77
phpBB SEO Team
phpBB SEO Team
 
Posts: 532
Joined: Wed May 10, 2006 9:46 am

Postby dcz » Mon Jul 03, 2006 9:54 am

Code: Select all
Disallow: /mxbb/?
is not
Code: Select all
Disallow: /mxbb/
;)

For mxBB you don't want to disallow index.php? because this would disallow mxBB pages, which you don't want.

Here, we are only talking about links such as this one : www.example.com/mxbb/?var=bla&bli=blo

Those can only come from the last msg module so far ;)
Useful links :
SEO Forum || SEO Directory || SEO phpBB || Search
____________________

Liens Utiles :
Forum référencement || Annuaire référencement || Référencement phpBB || Recherche
dcz
Admin
Admin
 
Posts: 21407
Joined: Fri Apr 28, 2006 9:03 pm

Postby Peter77 » Sun Jul 16, 2006 11:14 am

The only reason that I think search engines are finding SIDs, is because of on a search engine page I will see text content that I recognize being made recently. then below it there is a URL with SID's on it. could it be that the preview content has nothing to to with the Link? because you've mentioned that, for example, Google holds cached pages from 'x' past months.
Peter77
phpBB SEO Team
phpBB SEO Team
 
Posts: 532
Joined: Wed May 10, 2006 9:46 am

Postby dcz » Sun Jul 16, 2006 3:57 pm

Actually I think it is, Google updated links spidered once with SIDs, the disallow in robots.txt should fix this sometime ;)

++
Useful links :
SEO Forum || SEO Directory || SEO phpBB || Search
____________________

Liens Utiles :
Forum référencement || Annuaire référencement || Référencement phpBB || Recherche
dcz
Admin
Admin
 
Posts: 21407
Joined: Fri Apr 28, 2006 9:03 pm


Return to SEO Principles

 


  • Related topics
    Replies
    Views
    Last post

Who is online

Users browsing this forum: No registered users and 2 guests


 
cron