phpBB SEO
Boards
Directory  
SEO  
Downloads
  phpBB SEO : Search Engine Optimization, Directory, Forums  
Index
Forums
Annuaire
Référencement
Télécharger
 
  Search Rechercher
    Register
Username :  Password :  Log me on automatically each visit  
S'enregistrer  
 
   
Google Sitemap module update to test
Goto page 1, 2, 3  Next
 
Post new topic   Reply to topic    phpBB SEO » SEO Forum  » phpBB Forum
::  
Author Message
dcz
Administrateur - Site Admin
Administrateur - Site Admin


Joined: 28 Apr 2006
Posts: 15135

Google Sitemap module update to testPosted: Tue Nov 07, 2006 11:56 pm    Post subject: Google Sitemap module update to test

Hello,

So here it is, we are going to have soon a beta for the new Google Sitemaps solution for phpBB Forums.
I say new because I think I'll rename it tosomething like Ultimate Google sitemaps & RSS, because yes, there are RSS Feeds, and also, but it's less important, a nice Yahoo! urllist.txt.

Here it goes.

Cache :
A complete cache system configurable from ACP.
All the maps (sitemaps, rss and urllist.txt) are entirely saved in a folder.
When a cached file is up to date and available, the module will just send it as is to the browser without further processing, making the output very, very fast, comparable to a direct physical file access.

Let's talk about time a bit Wink

    First page load : Cache is being build.
    Quote:
    <!-- URL list generated in 5.41892 s - 25 sql - 11834 URLs listed -->
    <!-- Output started from cache after 5.42756 s - sql -->
    <!-- Output from cache ended up after 6.93087 s - sql -->


    This means that the module is building a 11834 URLs list in 5.41892s, and that the cache file is saved in 0.00864s (2 119 631 octets)(the file being sent to the browser right after saving).
    The output ended 6.93087s after it was requested.

    Second load : And followings until cache expires (configurable in ACP)
    And it's more interesting :
    Quote:
    <!-- URL list generated in 5.41892 s - 25 sql - 11834 URLs listed -->
    <!-- Output started from cache after 0.00256 s - sql -->
    <!-- Output from cache ended up after 1.57475 s - sql -->


    The first line being itself cached, to recall how hard it was to build up such a long list before sending it. This is to be compared to the 0.00256 s here needed to start the output Very Happy
    Then, file transfer is relatively long, but the file is 2mo and if we take into account the large number of URLs and all the work we're asking to the browser, because we do, and I'll talk about this right after, it's pretty fast.

    Because of this, Gun-Zip compression is very powerful. The module is both able to save and output Gun-ziped datas. Our 2mo becomes here 48 ko.

    As again the cached file is sent as is to the browser, it really become fast to output a Google sitemaps listing 11834 URLs, as fast as sending a 48ko gif file more or less Very Happy
    Unfortunately, there are no stats available for this output, the function used to read and send a gun-ziped file makes it impossible it seems. But it is for sure a lot faster, even if the browser is again asked to work even more as it will be the one to uncompress the file which could end up making the page show up a bit later after it was fully sent.

Cache can be updated automatically, after a configurable period of time has passed between two requests on the same file, or manually in ACP.
Each type of page outputted, sitemaps, rss 2.0 feeds and urllist.txt has it's own cache time limit configurable in acp.

URL rewriting :

You can switch mod rewrite type in acp, so far you can select between the three phpBB SEO mod rewrites, but the code is set to allow a lot more URLs standards. Will auto detect the used phpBB SEO mod rewrite when they'll get updated.
Note that title injection does add some weight to the sitemaps. If we continue with our previous example and inject topic title we get :
First request : building cache.
Quote:
<!-- URL list generated in 7.27516 s - 25 sql - 11834 URLs listed -->
<!-- Output started from cache after 7.28377 s - sql -->
<!-- Output from cache ended up after 8.92257 s - sql -->


Generation time is longer, this is what it costs to inject and censor 11834 topic title in the same amount of URLs, 1.85s.
This can look long, but the list is huge and gets cached. If we go back to the forum scale, we obtain 0.00784s for 50 injections, which is after all a very good result.

The second request follow the main idea : be fast Very Happy
Code:
<!-- URL list generated in  7.52122 s  - 25 sql - 11834 URLs listed -->
<!--  Output started from cache after 0.00248 s -  sql -->
<!--  Output from cache ended up after 1.52267 s -  sql -->


Very fast for a 2.4 mo file (the weight of this many titles). This goes down to 285 ko if gun-ziped.
By the way, here we can see that the Gun-zip compression is harder on topic titles, it's because every one of them is unique, there is not "viewtopic.php?" or "topic" repeated 11834 times.

RSS 2.0 feeds

This is what made this a little longer than first expected. A lot of feeds, with a lot of options Wink

Let's proceed with examples.
This will be our occasion to talk about what we ask to the browser. All the RSS feeds and the Google sitemaps have their XSL transformation. This allows compatible browser to build up an html page out of the xml code sent. The server is just sending an additional stylesheet and it's the browser to do all the presentation.

And it goes like this: http://www.phpbb-seo.com/sitemaps.xml for the Google SitemapIndex.

For the RSS feeds, first, I still need to mod my module (lol) to properly handle both forums here, so you'll see french post from the french speaking forums for now, but you'll get the idea Wink
Then, as I implemented quite some, I also implemented a special channel, listing all available channels on the same page : http://www.phpbb-seo.com/rss-channels.xml
From there you can explore all RSS feeds. More a cosmetic feature than really an SEO enhancement, but quite handy.

For the types of RSS feeds :

For all feeds, it's possible to add too parameters, so far (I'll change these before release) : -l, -s and -m.
The two first are used to ask for a longer or shorter list. The last one ask for feeds with the messages content (can be summarized, configurable in ACP).
You can add on of the first two to any RSS feed URL, and / or the -m one and play with combinations. It's as well possible to output only the last post of all topics and or to keep the first one.

and it goes like this :
http://www.phpbb-seo.com/rss-m.xml
http://www.phpbb-seo.com/rss-l-m.xml
Etc, the -long and -short before the -m.
Here we talk about the rewritten parameters, these would look like &l, &s and &m if not.
The message content output does not breaks BBcodes and parses smiles.
Since xml in not allowing any html special characters, like < and > needed to activate links in posts, there is a little bit of JavaScript to make it work with Firefox. For once IE is being easier to work with.

Yahoo! urllist.txt :

Before I forget : http://www.phpbb-seo.com/urllist.txt
This was not really required as Yahoo! deals very well with RSS 2.0 (here you start to understand the interest of long RSS lists with links only) but it's still an extra option, and cached.

The list is for now grabbing x last post from each of the public forums, configurable in acp.

Small limitation :
If Gun-zip is activated in phpBB, it will be so in the module, it is though possible to use gun-zip compression on the module if it's turned off in phpBB.

With Gun-zip, the rewritten URLs are taking an extra .gz extension. The module is able to check if Gun-zip is supported by the browser or bot and eventually will uncompress or re-cache the uncompressed file (configurable in acp) and send it to the browser after an HTTP "307 Temporary Redirect" redirection. I think it's a good way to tell the real one is the other one, but it's very easy to go for a 301 is required in such cases, we'll see.

This to tell you you'll be redirected if you follow these links Wink

For now, the duplicate are destroyed only when building the cache, I'll work on a full solution.

Anyway, the sitemapindex was submitted and is used by Google with great satisfaction since yesterday Very Happy

So here is after all a very nice solution to build up Google sitemaps with 10 000 URLs in each Very Happy

It should be possible to go over this, but I was too lazy posting more topics to test further Laughing Maybe we'll have to set up some record, the first to output 50 000 wins a cookie:D

And this explains why it took some time to dev this. It's simple it's all rewritten from scratch, all OO, and as I bench-marked it with 10 000 URLs, I was able to do some optimization in the script ... where small changes make great difference Wink

About going further, I was thinking about the new Yahoo! API tools, like the update notification, could be nice to notify Yahoo! upon every RSS feed cache update Very Happy

++

_________________
Useful links :
SEO Forum || SEO Directory || SEO phpBB || SEO phpBB3 || Search
____________________

Liens Utiles :
Forum référencement || Annuaire référencement || Référencement phpBB || Référencement phpBB3 || Recherche


Last edited by dcz on Thu Nov 16, 2006 12:34 am; edited 2 times in total
Back to top
Visit poster's website
nims
phpBB SEO Team
phpBB SEO Team


Joined: 11 Oct 2006
Posts: 243
Location: New Delhi, India

Google Sitemap module update to testPosted: Mon Nov 13, 2006 9:41 am    Post subject: Re: Google Sitemap module update to test

Hey good to see this. Was really waiting for RSS MOD. So will you be coming up with seperate RSS MOD or will this be integrated with Sitemaps only ?

_________________
Audio Ads - Earn from every visitor to your site
Buy Latest mobiles *** SEO India
Back to top
Visit poster's website
dcz
Administrateur - Site Admin
Administrateur - Site Admin


Joined: 28 Apr 2006
Posts: 15135

Google Sitemap module update to testPosted: Mon Nov 13, 2006 11:40 am    Post subject: Re: Google Sitemap module update to test

The module is designed to be highly modular (sic), kind of the way it was before, every type of output is corresponding to an included file.

So, it come with RSS, Yahoo! urllist.txt and Google Sitemaps support, but you can chose to only use one type of output by just not uploading all the files, but rather the one you select to use.

Anyway, IMHO, we all need RSS and Google sitemaps, so I found it handy to provide with both in a single module even easier to install than the the previous mx Google sitemaps Wink

++

_________________
Useful links :
SEO Forum || SEO Directory || SEO phpBB || SEO phpBB3 || Search
____________________

Liens Utiles :
Forum référencement || Annuaire référencement || Référencement phpBB || Référencement phpBB3 || Recherche
Back to top
Visit poster's website
nims
phpBB SEO Team
phpBB SEO Team


Joined: 11 Oct 2006
Posts: 243
Location: New Delhi, India

Google Sitemap module update to testPosted: Tue Nov 14, 2006 11:09 am    Post subject: Re: Google Sitemap module update to test

Great !!! So when do you suggest I shud switch from earlier Google Sitemap to this one ??? Is it safe to do this. I mean will google ranking etc will not be damaged by this ?

_________________
Audio Ads - Earn from every visitor to your site
Buy Latest mobiles *** SEO India
Back to top
Visit poster's website
dcz
Administrateur - Site Admin
Administrateur - Site Admin


Joined: 28 Apr 2006
Posts: 15135

Google Sitemap module update to testPosted: Tue Nov 14, 2006 11:32 am    Post subject: Re: Google Sitemap module update to test

You'll be able to do so as soon as I publicly release it, eg soon. I am currently working on translating it in English (I wrote this one in French at first) and cleaning up the code (there is some).
Anyway it won't be anything but better than the previous one, will not be an issue to update.

The sitemap url themselves will change if you're using the phpBB SEO mixed or advanced mod rewrite, as there will be title injection in them, but it's no big deal, will just need to re submit the sitemap.

By the way, I added the Yahoo! notification in it Very Happy

Yahoo! will be notified upon each updates of the RSS feeds Wink

++

_________________
Useful links :
SEO Forum || SEO Directory || SEO phpBB || SEO phpBB3 || Search
____________________

Liens Utiles :
Forum référencement || Annuaire référencement || Référencement phpBB || Référencement phpBB3 || Recherche
Back to top
Visit poster's website
Peter77
phpBB SEO Team
phpBB SEO Team


Joined: 10 May 2006
Posts: 515
Location: Michigan

Google Sitemap module update to testPosted: Wed Nov 15, 2006 4:25 am    Post subject: Re: Google Sitemap module update to test

Brilliant... absolutely brilliant, dcz. Very Happy I like the Yahoo! notification idea also. I look forward for further updates!
Back to top
nims
phpBB SEO Team
phpBB SEO Team


Joined: 11 Oct 2006
Posts: 243
Location: New Delhi, India

Google Sitemap module update to testPosted: Thu Nov 16, 2006 10:04 am    Post subject: Re: Google Sitemap module update to test

Great !!!
I am eagerly waiting for the official release.

_________________
Audio Ads - Earn from every visitor to your site
Buy Latest mobiles *** SEO India
Back to top
Visit poster's website
kevins



Joined: 16 Nov 2006
Posts: 5

Google Sitemap module update to testPosted: Thu Nov 16, 2006 2:18 pm    Post subject: Re: Google Sitemap module update to test

When could we expect the first beta of this one?

Thank you!
Back to top
dcz
Administrateur - Site Admin
Administrateur - Site Admin


Joined: 28 Apr 2006
Posts: 15135

Google Sitemap module update to testPosted: Fri Nov 17, 2006 10:05 am    Post subject: Re: Google Sitemap module update to test

And Welcome Very Happy

Well, as you can see, release has never been closer.

And Good new, Yahoo!, MSN and Google just decided to start using the Google sitemap protocol : http://googleblog.blogspot.com/2006/11/search-engines-united.html

so this mod will even be more useful as it will do the job for all three of them at once now.

I was actually wondering since quite some time why wasn't it the case since Yahoo! and MSN bots where already crawling those, now it's just for sure they'll use them Very Happy

I guess the module is coming at the right time, the cache feature allowing for major linking and usage for these.

Anyway, I uploaded yesterday what is close to last version before a public beta.
I am not sure the total rewriting of the urllist.txt I just did will be this useful any more, but this is on of the difference with the very first version, the module is now able to output thousand of links as well (was no the case in the very fist one) and output paginated links for forums and topics.

Another thing I added is yahoo! notification, another thing made less attractive since the sitemap protocol was shared with Yahoo!, but still, we'll be able to notify yahoo with rss feeds, bringing a bit more information than the Google sitemap is.

Anyway, I am happy you all like this module Wink

++

_________________
Useful links :
SEO Forum || SEO Directory || SEO phpBB || SEO phpBB3 || Search
____________________

Liens Utiles :
Forum référencement || Annuaire référencement || Référencement phpBB || Référencement phpBB3 || Recherche
Back to top
Visit poster's website
nims
phpBB SEO Team
phpBB SEO Team


Joined: 11 Oct 2006
Posts: 243
Location: New Delhi, India

Google Sitemap module update to testPosted: Fri Nov 17, 2006 3:09 pm    Post subject: Re: Google Sitemap module update to test

Good news !!! Now we can just have one common sitemap and that can be used by all the 3 major search engines namely Google, Yahoo and MSN. Wow !!!

So what about this MOD then ??? Wud you be removing yahoo list etc. that r not reqd ?

_________________
Audio Ads - Earn from every visitor to your site
Buy Latest mobiles *** SEO India
Back to top
Visit poster's website
argenta



Joined: 18 Nov 2006
Posts: 16

Google Sitemap module update to testPosted: Sat Nov 18, 2006 10:09 am    Post subject: Re: Google Sitemap module update to test

dcz wrote:

Anyway, I uploaded yesterday what is close to last version before a public beta.


dcz, is it already available for tests? Where did u upload the version, here in forum? I can't find it...
Thanks!
Back to top
aznole



Joined: 19 Nov 2006
Posts: 26

Google Sitemap module update to testPosted: Sun Nov 19, 2006 11:36 am    Post subject: Re: Google Sitemap module update to test

whoops ...wrong forum
Back to top
dcz
Administrateur - Site Admin
Administrateur - Site Admin


Joined: 28 Apr 2006
Posts: 15135

Google Sitemap module update to testPosted: Sun Nov 19, 2006 1:14 pm    Post subject: Re: Google Sitemap module update to test

Welcome argenta & aznole Very Happy

So the mod is not available yet, but will soon Wink

This thread is dedicated to present and test the module, you're welcome to tell what you think of how it goes so far.

++

_________________
Useful links :
SEO Forum || SEO Directory || SEO phpBB || SEO phpBB3 || Search
____________________

Liens Utiles :
Forum référencement || Annuaire référencement || Référencement phpBB || Référencement phpBB3 || Recherche
Back to top
Visit poster's website
kevins



Joined: 16 Nov 2006
Posts: 5

Google Sitemap module update to testPosted: Tue Nov 28, 2006 10:33 pm    Post subject: Re: Google Sitemap module update to test

Oke then we will have wait a bit longer..

Don't you have a realese date or someting like that.

Before December xx Wink ?
Back to top
dcz
Administrateur - Site Admin
Administrateur - Site Admin


Joined: 28 Apr 2006
Posts: 15135

Google Sitemap module update to testPosted: Tue Nov 28, 2006 10:45 pm    Post subject: Re: Google Sitemap module update to test

Well this is not really the kind of post that makes it faster.

Maybe you could comment a bit about why are you so impatient, we could find some nice feature to add just before release.

++

_________________
Useful links :
SEO Forum || SEO Directory || SEO phpBB || SEO phpBB3 || Search
____________________

Liens Utiles :
Forum référencement || Annuaire référencement || Référencement phpBB || Référencement phpBB3 || Recherche
Back to top
Visit poster's website
Display posts from previous:   
Post new topic   Reply to topic    phpBB SEO » SEO Forum  » phpBB Forum
Page 1 of 3 Goto page 1, 2, 3  Next

Navigation Similar Topics

Jump to: