Sitemaps - the good, the bad and the crawled Print E-mail
SEO/SEM
Written by Jeff Jones   
Wednesday, 03 September 2008
XML SitemapsSitemaps are essential to creating a successful site from both a usability perspective and (of course) a search engine perspective. Implementing an HTML sitemap is a relatively simple task for any Web site, but the implementation of an XML sitemap (the standard sitemap’s cooler, more important big brother) --

-- is a trickier proposition if it’s set up incorrectly, it could actually be detrimental to your rankings.


The Good: Why XML sitemaps?


“Sitemaps are an easy way for webmasters to inform search engines about pages on their sites that are available for crawling. In its simplest form, a Sitemap is an XML file that lists URLs for a site along with additional metadata about each URL...” (Source: http://www.sitemaps.org).

Unlike a standard HTML sitemap, which is essentially just an unordered list of links to the important pages on your site, an XML sitemap conveys other important information targeted directly at the search engine spiders:

  • Assigning a priority to your pages: XML sitemaps can indicate (on a 1-5 scale with 1 being most important) the priority of certain pages in relation to other pages on your site. This is a great way to tell the spiders how to prioritize each page of content, resulting in your most important pages being ranked higher than your less important pages.*

  • Passing content update information: An XML sitemap can also include two tags:

a) changefreq - tells the spiders how often a page is updated and indicates how often it is likely to change

b) lastmod - tells the spiders when a page was last changed and/or updated

*Google is clear to say that the information you pass is considered to be a “hint” as to how content should be indexed by search engine spiders crawling your site.  As such, there is no guarantee that they will index or rank your pages exactly how you indicate. In addition to this, Google Webmaster Central provides some helpful information when you have a sitemap that can’t be overlooked. All of the above being said, Yahoo! and MSN also both look for XML sitemaps.

The Bad: Potential problems with XML sitemaps

 

  • Keeping your sitemap up-to-date: Unless your sitemap is directly connected to a Web content management system (Web CMS) or database that generates the sitemap every time a change is made to a page and submits it to Google, it’s easy for your XML sitemap to quickly become out-of-date.

  • Competitive intelligence: If you’re telling the search engines the relative priority of all of your pages, you can bet this information will also be of interest to your competitors. Unfortunately, there’s no way to protect your sitemap such that only the search engines can access it.

  • Reduction in natural crawling of your site: Some camps are recommending that XML sitemaps be implemented and verified (through Google, Yahoo!, etc.) but not actually submitted to the engines. The gist of these arguments (and they are valid) is that:

a) you probably aren’t as smart as the spiders at determining the relative “importance” of one page over another on your site (business blinders, not knowing your visitors, etc.), and

b) by adding an XML sitemap to the search engines you are telling them to ignore pages that you may have missed on the sitemap, which may in turn cause problems with crawling. In fact, natural spider crawls will often find issues with your architecture that you weren’t aware of.

Given the information above, if your site is new and doesn’t have hundreds (or thousands) of pages, it’s usually a good idea to create an XML sitemap but NOT submit it to the search engines - at least not until the site has been crawled and you’ve been able to determine if there are any problems with your architecture.

Submitting your sitemap


Google Webmaster Tools [http://www.google.com/webmasters/sitemaps/]


First, set up a Google account. Next, verify that you are the Web site owner through either Meta Tag or file creation. Google provides very clear instructions about how to do this - essentially, you need to have access to your site to perform these actions. Once you have been verified as the site owner, you can submit the URL of your sitemap. Google Webmaster Tools also provides a host of other information including crawl errors, pages with the highest PageRank, top search queries for your site, and much more.

Yahoo! Site Explorer [http://siteexplorer.search.yahoo.com/]


Much like Google Webmaster Tools, first you require a Yahoo! account and then you’ll need to verify that you are the Web site owner. Yahoo! will then provide you with a file to upload to the root of your site. Once you have been verified as the site owner, you can submit the URL of your sitemap to Yahoo! Site Explorer.

MSN [http://search.msn.com/docs/submit.aspx]


Currently, you can simply submit your XML sitemap through MSN’s regular URL submission page.

### 

About The Author

Jeff Jones is a Web-optimization specialist. He helps companies improve their Web sites, SEO results and PPC-campaign performance. Connect and collaborate with him on Twitter

Related Resources:
Comments (1)Add Comment
very nice!
written by chris m, September 4, 2008
thank you for educating me on the need for sitemaps
report abuse
vote down
vote up
Votes: +2

Write comment
quote
bold
italicize
underline
strike
url
image
quote
quote
smaller | bigger

busy
 
< Prev   Next >

Lyris HQ Client Login

Flash Player Required

Lyris HQ requires the most recent version of the Adobe Flash Player, a free browser plug-in.

Get Adobe Flash Player.

Get Adobe Flash Player

Advertisement
 
Email Marketing & Internet Marketing Tools - Lyris HQ
Maximize your marketing spend. Lyris HQ brings together email marketing, deliverability tools, content creation, Web analytics, search marketing and mobile marketing. Execute campaigns and review ROI performance from one integrated solution. That's the unbeatable power of Lyris HQ.
Join conversations and make connections at: