- 1.2.0RC1 is out : To update please read this thread.
What follows is kept for archive.
0.0.2 support will continue for some time for 0.0.2 user, until they update.
Do not use this version if you are first installing this mod.
mx Google Sitemaps
- Module name : mx Google Sitemaps
Module author : dcz / www.phpBB-SEO.com
MOD Description : This mod creates Google xml sitemaps for phpBB and mxBB Portal (if used).
Module version : v1.0.1
Installation Level: Easy for the Basics, Advanced if setting up mod_Rewrite for Google Sitemaps.
Installation Time : 5 Minutes for the Basics, a bit more if setting up mod_Rewrite for Google Sitemaps.
Author's Notes :
- Modular Google Sitemaps Solution for phpBB and mxBB Portal.
This mod will create a SitemapIndex, listing all the created Sitemaps.
Please make sure mxGoogle Sitemaps is working before you submit the sitemap.php file (or sitemaps.xml with mod Rewrite) to the Google Sitemaps Service.
This mod was first inspired by jhaskins's Google sitemap generator (Joe Haskins ~ http://www.streetrod3.com) but is doing a totally different work now.
The code is meant to be as fast and light as possible while giving the appropriates info's (URL and Last Modified Time). The template system is totally bypassed using echo, for a smother and much less ram consuming output allowing for numerous amount of topics to be listed and making the system ready for large boards.
So far, the code did very good job listing up to 5000 unique topics (meaning even more links since the mod outputs topic pagination as well) in a forum sitemap.
Should do it for even more, the Google sitemap standards allows up to 50 000 URLs per sitemap.
And since listing 50 000 topics URL would mean to use quite some resources and to dig out very old threads, you can as well limit the absolute number of outputted links, so that in all cases you will be able to use and take advantage out of this code.
Now why such a system :
- You may have noticed Google Bot, as well as other Search Engine Bots, are spidering your pages.
To do so they must first find a link to your page, which will be the home page at first in most case. Then, they'll eventually some find out links to other pages of your site.
Usually, they will come back a couple time before they will start to follow the new discovered links and will continue to 1) dicover new links in pages, and 2) explore new pages.
Now you understand this is taking some time. There are a lot of pages in the internet, and they need quite some visit in the end to find out all of your content, and even more is there are several level to go through before finding links to the actual search-able content.
With phpBB, interesting search-able content is to be found in topics. One have to go to the index to find links to forums, then to forums to find links to topics, at least the good ones (with the best url). And since forums are paginated, there are again other links to go through to spider the site.
So it can be quite long for a Bot to find all of your content.
This is why such a system exists, to provide a lot of links to the Google Bot at once. Everybody is winning, less visit are needed to just find link to search-able content and visits are a lot more efficient.
With this mod, the Google-Bot will visit the SitemapIndex, the only file to submit, and will find out you have one Sitemap per forum and one for forums, with a last modified date based on the last post made in those.
So it will only load the Sitemap with new content and find out up to 50 000 url at a time per sitemap and again, with a last modified date based on the last post of every thread, so it won't even have to re visit all threads before finding the new ones.
By default, new content appears at the top of every list, so one could, let's say, limit the number of outputted links (configurable in ACP) to the last 1000 active thread from each forum only, in order to save some server resources while still providing a reasonable amount of links.
But as well ordering them the other way, older first (configurable in ACP), could be useful for someone with a big board who'd want to list all of his site's content after he decided himself to get rid of SIDs in order to have his old content indexed as well. At least for a while, before coming back to the default sorting and put the priority on new content again as it should.
A new functionality was added to 1.0.1, you can now exclude public forums from the listings. As well, when mxBB or KB is installed, you can now exclude public mxBB pages and/or Kb categories. Can be handy when you think some of those should not be that optimized.
With KB, it's the same, one Sitemap per category listing approved and public articles. Last modified time is based on last post time in forum and comment thread.
Running mxBB Portal, a Sitemap listing the portal's pages will show up in the Sitemap Index, but without last modified date, since it does not apply here (There is no info about this in mx tables for now).
As a conclusion, this mod will not really help to increase Page Rank (PR) since links in those are not considered as Back-links, but it will help a lot to get indexed. The vast majority of the Google-Bot visits will now be used to cache pages instead of just trying to find new content. More page cached in less visit and new content found faster.
And if you just post a link to it somewhere (not to many though, you don't want too many visits on those if you list a lot of links) you will see many other bots exploring it.
This system is 100% stand alone (does not change a single phpbb line) and is totally self installing. Updates are easy as uploading files (and eventually, but I doubt it) run a sql script.
There should be no more issues, if ever was, using ORACLE and POSTGRE.
If you specific Rewrited url standard is currently not supported by the phpBB SEO mod Rewrites, please post a request for it in the phpBB mod rewrite Forum and the Team will provide premodded files.
Supported Languages :
- English, French.
- phpBB "Vanilla" Installation
The Forum n°2 Map
Preview mx Google SiteMaps ACP
Running mxBB :
Download File :
- phpBB :
To update from RC1 or RC2, you will have do the same as for a new install.
You will have to re-enter you settings in ACP this time.
- mxBB :
To update from RC1 or RC2, you will have to uninstall the module in ACP before you do the same as for a new install.
You will have to re-enter you settings in ACP this time.
Current Features :
- Sitemaps Created :
- Adds all necessary entries in the SitemapIndex
- One general category sitemap, listing all public categories
- One sitemap per category, listing approved articles.
- Adds an entry in the sitemap index
- Adds proper otpion for Kb to run with mx in acp (define mx kb page's ID)
- One sitemap listing all public mx pages.
- Code: Select all
| Down Up |
| | | |
v v v v
1, 2, 3 ... 10, 11, 12
One SitemapIndex listing all sitemaps
One general forum sitemap, listing public forums url
One sitemap per public forum listing topics.
- When kb is installed : (both on phpbb and mxBB PORTAL)
All major queries are separated into several cycles, configurable in ACP.
Possibility to limit the number of url outputed, configurable in ACP.
Possibility to output (and limit) paginated topics url, configurable in ACP. Pagination will be limited like this :
Only public and valid content is listed. 100% Valid output.
- Features :
Please suggest, cache feature maybe, support for Yahoo standard.
Other Mods support :
Will see if other mods would benefit such a listing.
Other mod_Rewrite support :
Will add premoded files in the package for any unsupported URL standard upon request.
- Pre modded files for several mod rewrite included.
If apply, just ovewrite the standard files with the right contrib files.
Supported url standars : phpBB SEO mod Rewrites, Webmedic's rewrite mod, able2know rewrite mod, GoogleBB Links ...
- English Support On this Thread. If you need some specific adaptation, please start a new thread in this Forum.
Here is what you will see ACP side. Note that in this example, mx Sitemaps is not Installed, the mx Google Sitemaps module will only use and install the Google SiteMaps menu. The mx SiteMaps menu will show up in the same menu category if installed.
Note : An extra option shows up while runnig mxBB Portal, allowing to set up the KB mxBB page ID and to set up proper exclusions.
ACP Menu :
The Google sitemap system allows GoogleBot to find pages far away from the Home page easyer. This system generates a sitemapp index pointing to the different sitemaps available.
You must register your sitemapIndex @ Google if you want to access some interesting stats.
You can proceed anonymous though
|Google Sitemaps Settings|
|SQL cycle |
Major queries are sparated into several cycles in order not to overload the SQL server. This is the maximum number of topics to fetch within a single query
|Url Limit |
Maximum number of url outputed in each sitemap.
This limit being checked in every SQL cycle, the actual outputed number of url is this limit +- 1 SQL cycle +- number of paginated topics (limited or not) in the last cycle.
Limited by default to 40 000, knowing Google will go up to 50 000 per sitemap file.
|Sort Order |
All outputed links are sorted in the same way topics are sorted by default in phpbb (last activity DESC).
You can set this to DESC for example if you whish to make it easyer for Google to find again links to archeological or locked threads (eg inactive for a looong time).
|Mod Rewrite Sitemaps |
If activated, the sitemap's url will be rewrited.
CAUTION : you MUST run Apache server with mod rewrite activated and set up the .htaccess located in this release's contrib/ folder properly.
NOTE : This will only affect the sitemaps url provided in the sitemap index. There is no problem for google to visit non url rewrited sitemaps.
|Forums Sitemaps Settings|
|Forum Exclusions |
You can exclude some public forums from the mx Google Sitemaps Listing.
Enter the exclude forum IDs list, coma separated : e.g 1,5,8.
Note : If not filled, all public forums will be listed.
|Announcement Priority |
Announcement Priority (must be a number between 0.0 & 1.0 inclusive)
|Sticky Priority |
Sticky Priority (must be a number between 0.0 & 1.0 inclusive)
|Default Priority |
Priority for regular topics (must be a number between 0.0 & 1.0 inclusive)
|Topic Pagination: Low Limit |
Paginated topic link output is handeled. Enter here how many paginated topic pages, from the begining, are to be outputed.
If set to 0, it won't output paginated links after the first topic page.
|Topic Pagination: Upper Limit |
Enter here how many paginated topic pages, starting from the last one, are to be outputed.
If set to 0, it won't output paginated links before the last topic page.
(C) 2006 dcz - http://www.phpbb-seo.com/