Search Optimization
  Home arrow Search Optimization arrow Advanced Use of Robots.txt
SEO Chat Forums  
Choosing Keywords  
Google Optimization  
Link Trading  
MSN Optimization  
Search Engine News  
Search Engine Spiders  
Search Optimization  
Web Directories  
Website Marketing  
Website Promotion  
Website Submission  
Yahoo Optimization  
SEO Tools
Adsense Calculator
AdSense Preview
Advanced Meta-Tags
Alexa Rank Tool
Check Server Headers
Class C Checker
Code to Text Ratio
CPM Calculator
Domain Age Check
Domain Typos
Future PageRank
Google Dance
Google Keywords
Google Search
Google Suggest
Google vs Yahoo
Indexed Pages
Keyword Cloud
Keyword Density
Keyword Difficulty
Keyword Optimizer
Keyword Position
Keyword Typos
Link Popularity
Link Price Calculator
Meta Analyzer
Meta Tag Generator
Multiple Link Popularity
Page Comparison
Page Size
PageRank Lookup
PageRank Search
Robots.txt Generator
ROI Calculator 
S.E. Comparison 
S.E. Keyword Position 
Site Link Analyzer 
Spider Simulator 
URL Redirect Check 
URL Rewriting 
Dedicated Servers  
Actuate Whitepapers 
VeriSign Whitepapers 
IBM® developerWorks 
Sun Developer Network 
SEO Weekly Newsletter
 
Developer Updates  
Free Website Content 
 RSS  Articles
 RSS  Forums
 RSS  All Feeds
Write For Us Get Paid 
Request Media Kit
Contact Us 
Site Map 
Privacy Policy 
Support 
 USERNAME
 
 PASSWORD
 
 
  >>> SIGN UP!  
  Lost Password? 
SEARCH OPTIMIZATION

Advanced Use of Robots.txt
By: Jennifer Sullivan Cassidy
  • Search For More Articles!
  • Disclaimer
  • Author Terms
  • Rating: 4 stars4 stars4 stars4 stars4 stars / 17
    2005-11-28

    Table of Contents:
  • Advanced Use of Robots.txt
  • MSNbot, Slurp, Googlebot, IA
  • Advanced Robots.txt Commands and Features
  • Meta Tag Instructions and Bandwidth
  • Using Robots.txt for Corporate Security

  • Rate this Article: Poor Best 
      ADD THIS ARTICLE TO:
      Del.ici.ous Digg
      Blink Simpy
      Google Spurl
      Y! MyWeb Furl
    Email Me Similar Content When Posted
    Add Developer Shed Article Feed To Your Site
    Email Article To Friend
    Print Version Of Article
    PDF Version Of Article
     
     
    ADVERTISEMENT

    Stay one step ahead of the competition. Evaluate and give feedback on some of the hottest web development tools on the market today. Make your opinion heard! Click Here

    Advanced Use of Robots.txt


    (Page 1 of 5 )

    With all of the SEO tips, tricks, and tutorials available to you, probably the easiest to achieve is the use of the robots.txt file. This is a simple file that gives instructions to search engine robots, or spiders, on how to crawl your website, and which files and directories to stay out of, and to not index in their databases.

    In an earlier tutorial by Clint Dixon, he showed you how to write a robots.txt file, and what information to include, such as User-Agent and the Disallow directive, instructing the search engine spiders on how to crawl your site.  In this article, I want to build upon what he showed you, and give you more information on the importance of the robots.txt in your SEO efforts, and some of the consequences of not having one, or having one written incorrectly.

    Behavior of Search Engines When Encountering Robots.txt

    Search engines behave differently upon encountering, or not, the robots.txt file during a crawl.  You only have to follow your web stats to know that the robots.txt is one of the most requested files by search engine spiders.  Many spiders check the robots.txt file first before ever performing a crawl, and some even pre-empt crawls by checking for the presence of, and commands in, the file; only to leave and come back another day.  In the absence of this file, many search engine spiders will still crawl your website, but you will find that even more will simply leave without indexing.  While this is seen as one of the most extreme consequences of excluding the robots.txt file, I will also show you consequences that I consider to be far worse.

    Some of the major search engine spiders and robots have distinct behavior patterns upon reading the robots.txt that you can track in your stats.  Sometimes, however, it is nice to have an outsider’s perspective on robot behaviors in order to compare it to what you may have noticed.  I view a lot of robots.txt files, and sites with and without them, so I’ve been able to come up with a few behavior patterns I would like to share with you.

    More Search Optimization Articles
    More By Jennifer Sullivan Cassidy


       · I wanted to show you another good example of utilizing your robots.txt to keep...
       · Thanks for the article. I was wondering your comment on that msnbot doesn't spider...
       · At the time the article was written (in early October, I think) that was the case...
       · Thanks for the reply. I'm really new at this whole SEO stuff. I just started doing...
       · MSN Bot spidered my sites just fine before the summer of '05 and I've never had a...
     

    SEARCH OPTIMIZATION ARTICLES

    - What to Consider When Hiring an SEO
    - Raising Your Visibility with LinkedIn
    - Optimize Your LinkedIn Profile
    - Basic SEO: Tricks and Tips
    - Navigating Your Way into Your Visitors` Hear...
    - Creating SEO-Friendly Content
    - Network Solutions: Unethical SEO
    - Basic SEO: What Search Engines Hate
    - Basic SEO: What Search Engines Love
    - I, Robots.txt
    - Same Game, Different Name
    - Use SEO to Get Your New Site Off to a Good S...
    - Basic SEO Troubleshooting
    - Getting an SEO Education
    - Site Target Marketing






    © 2003-2008 by Developer Shed. All rights reserved. DS Cluster 2 hosted by Hostway