Search Optimization
  Home arrow Search Optimization arrow Page 5 - Blocking Complicated URLs with Robots....
SEO Chat Forums  
Choosing Keywords  
Google Optimization  
Link Trading  
MSN Optimization  
Search Engine News  
Search Engine Spiders  
Search Optimization  
Web Directories  
Website Marketing  
Website Promotion  
Website Submission  
Yahoo Optimization  
SEO Tools
Adsense Calculator
AdSense Preview
Advanced Meta-Tags
Alexa Rank Tool
Check Server Headers
Class C Checker
Code to Text Ratio
CPM Calculator
Domain Age Check
Domain Typos
Future PageRank
Google Dance
Google Keywords
Google Search
Google Suggest
Google vs Yahoo
Indexed Pages
Keyword Cloud
Keyword Density
Keyword Difficulty
Keyword Optimizer
Keyword Position
Keyword Typos
Link Popularity
Link Price Calculator
Meta Analyzer
Meta Tag Generator
Multiple Link Popularity
Page Comparison
Page Size
PageRank Lookup
PageRank Search
Robots.txt Generator
ROI Calculator 
S.E. Comparison 
S.E. Keyword Position 
Site Link Analyzer 
Spider Simulator 
URL Redirect Check 
URL Rewriting 
Mobile Linux 
APP Generation ROI 
IBM® developerWorks 
SEO Weekly Newsletter
 
Developer Updates  
Free Website Content 
 RSS  Articles
 RSS  Forums
 RSS  All Feeds
Write For Us Get Paid 
Request Media Kit
Contact Us 
Site Map 
Privacy Policy 
Support 
 USERNAME
 
 PASSWORD
 
 
  >>> SIGN UP!  
  Lost Password? 
SEARCH OPTIMIZATION

Blocking Complicated URLs with Robots.txt
By: Codex-M
  • Search For More Articles!
  • Disclaimer
  • Author Terms
  • Rating: 4 stars4 stars4 stars4 stars4 stars / 4
    2008-10-28

    Table of Contents:
  • Blocking Complicated URLs with Robots.txt
  • Benefits of Proper Robots.txt Usage
  • Google Webmaster tools Robots.txt Analysis Tool
  • More Rules
  • Folder Name

  • Rate this Article: Poor Best 
      ADD THIS ARTICLE TO:
      Del.ici.ous Digg
      Blink Simpy
      Google Spurl
      Y! MyWeb Furl
    Email Me Similar Content When Posted
    Add Developer Shed Article Feed To Your Site
    Email Article To Friend
    Print Version Of Article
    PDF Version Of Article
     
     
    ADVERTISEMENT


    Blocking Complicated URLs with Robots.txt - Folder Name


    (Page 5 of 5 )


    4.Blocking a particular part of the overall folder name

    Examples of this include the following:

    http://www.thisisasampledomain.com/(X(zjaksjjwsdjwjehrhejjdjhfhrhe))/folder/productinfo.aspx?id=201

    http://www.thisisasampledomain.com/(X(tyntnrnendnfngnrnennwnswme))/folder/productinfo.aspx?id=205

    http://www.thisisasampledomain.com/(X(yturnjfhdjwhdgdbvfvgcbdbsbae))/folder/productinfo.aspx?id=306


    And depending on the site's purpose, there may be thousands of them. That would make it impossible to list them one by one in the robots.txt file. The correct approach is to identify a unique pattern.

    Based on the above URLs, there is a particular part of the URL that is repetitive. This is/(X

    However, since/(X is associated with different URLs and different query strings, it cannot be blocked using the ordinary robots.txt syntax. This means we must once again make use of regular expressions.

    Since we are only interested in blocking all those URLs containing/(X , we can use this an exact match like:

    User-agent: *

    Disallow: /(X(*/

    The above syntax will block all dynamic URLs beginning with /(Xsomewhere in the folder name. This is a very useful approach for big dynamic websites infected with massive duplicate content.

    Important: Always test your robots.txt file using Google Webmaster tools before uploading it to your root directory to see if it blocks the URLs you intend to block and does not affect other URLs.


    DISCLAIMER: The content provided in this article is not warranted or guaranteed by Developer Shed, Inc. The content provided is intended for entertainment and/or educational purposes in order to introduce to the reader key ideas, concepts, and/or product reviews. As such it is incumbent upon the reader to employ real-world tactics for security and implementation of best practices. We are not liable for any negative consequences that may result from implementing any information covered in our articles or tutorials. If this is a hardware review, it is not recommended to open and/or modify your hardware.

       · Like one excample:I have one website url...
     

    SEARCH OPTIMIZATION ARTICLES

    - Mobile SEO: Create, Post, and Track Content ...
    - Has Your Website Been Hacked?
    - WordPress 301 Redirect: Tips and Techniques
    - Five Ways to Optimize Pages
    - Updating WordPress Tips and Techniques
    - WordPress Database Tutorial: Security, Backu...
    - Submit and Update a WordPress Plug-in
    - Are You Optimized? Use SEO Analysis
    - WordPress SEO Tips: Benchmarking Matt Cutts ...
    - How to Increase the Conversion Rate of Your ...
    - SEO Strategies: A Guide to Which Ideas Work ...
    - Setting Up Feedburner for SEO
    - How to Use Feedburner for SEO
    - Statistical Process Control Implementation i...
    - Create Focused SEO with Subtitles



     



    © 2003-2010 by Developer Shed. All rights reserved. DS Cluster 6 Hosted by Hostway
    For more Enterprise Application Development news, visit eWeek