Search Optimization
  Home arrow Search Optimization arrow Page 4 - Preventing Duplicate Content on an E-C...
SEO Chat Forums  
Choosing Keywords  
Google Optimization  
Link Trading  
MSN Optimization  
Search Engine News  
Search Engine Spiders  
Search Optimization  
Web Directories  
Website Marketing  
Website Promotion  
Website Submission  
Yahoo Optimization  
SEO Tools
Adsense Calculator
AdSense Preview
Advanced Meta-Tags
Alexa Rank Tool
Check Server Headers
Class C Checker
Code to Text Ratio
CPM Calculator
Domain Age Check
Domain Typos
Future PageRank
Google Dance
Google Keywords
Google Search
Google Suggest
Google vs Yahoo
Indexed Pages
Keyword Cloud
Keyword Density
Keyword Difficulty
Keyword Optimizer
Keyword Position
Keyword Typos
Link Popularity
Link Price Calculator
Meta Analyzer
Meta Tag Generator
Multiple Link Popularity
Page Comparison
Page Size
PageRank Lookup
PageRank Search
Robots.txt Generator
ROI Calculator 
S.E. Comparison 
S.E. Keyword Position 
Site Link Analyzer 
Spider Simulator 
URL Redirect Check 
URL Rewriting 
Mobile Linux 
APP Generation ROI 
IBM® developerWorks 
SEO Weekly Newsletter
 
Developer Updates  
Free Website Content 
 RSS  Articles
 RSS  Forums
 RSS  All Feeds
Write For Us Get Paid 
Request Media Kit
Contact Us 
Site Map 
Privacy Policy 
Support 
 USERNAME
 
 PASSWORD
 
 
  >>> SIGN UP!  
  Lost Password? 
SEARCH OPTIMIZATION

Preventing Duplicate Content on an E-Commerce Site from Session IDs
By: Codex-M
  • Search For More Articles!
  • Disclaimer
  • Author Terms
  • Rating: 5 stars5 stars5 stars5 stars5 stars / 4
    2009-04-28

    Table of Contents:
  • Preventing Duplicate Content on an E-Commerce Site from Session IDs
  • Robots.txt and Sitemap (XML, dynamic and static versions)
  • Oscommerce Admin Configuration
  • The Link Rel Canonical Solution

  • Rate this Article: Poor Best 
      ADD THIS ARTICLE TO:
      Del.ici.ous Digg
      Blink Simpy
      Google Spurl
      Y! MyWeb Furl
    Email Me Similar Content When Posted
    Add Developer Shed Article Feed To Your Site
    Email Article To Friend
    Print Version Of Article
    PDF Version Of Article
     
     
    ADVERTISEMENT


    Preventing Duplicate Content on an E-Commerce Site from Session IDs - The Link Rel Canonical Solution


    (Page 4 of 4 )

     

    After many years of duplicate content desperation, Google finally came up with a solution that allows webmasters to specify their preferred (canonical) URLs.

    <link rel="canonical" href="http://www.yourwebsite.com/yourpreferredurl.php" />

    How does this work? It's very simple. By placing this link rel canonical tag in any of your affected website template files, Google will know the URL you prefer to have indexed without either your or them being forced to deal with an in-depth technical solution.

    For example:

    http://www.yoursite.com/osc/products_new.php?osCsid=cf66b6d1ecc142348775790bef595556 , is indexed by Google. When Googlebot visits this URL again, and you have done the reconfiguration described in the previous section, it is now confused. 

    Since http://www.yoursite.com/osc/products_new.php is the canonical URL, we will specify link rel=”canonical” tag in the products_new.php template.

    Copy and paste this code to the affected template file:

    <link rel="canonical" href="http://www.yoursite.com/osc/products_new.php" />

    To do this, download the template file in your desktop to edit it (do not forget to backup!), and then upload it back to your server via FTP.

    The link rel=”canonical” tag will be placed in the header section of the template file.

    If this is done correctly, it should look like the screen shot below:

     

    Link rel=”canonical” is a highly important solution for the following reasons:

    • It will transfer page rank and link juice.
    • It will also transfer other URL signals for establishing relevance in the search engines. This ensures that you will not lose any of your earned relevance and you will continue to rank well in search engines.

    Conclusion

    Duplicate content issues due to session IDs are serious, because they affect rankings in the search engines. All of the corrective actions outlined here are feasible but the most recommended actions include the following:

    • Create and use a sitemap containing the canonical list of your preferred URLs that does not use session IDs. An XML sitemap should be uploaded to the root directory and submitted to the Google Webmaster Tools Sitemap section.
    • Use “Prevent Spider Sessions” in the Oscommerce admin setup.
    • Specify your canonical URLs using the link rel=”canonical” tag in all of your affected website templates.

    Using these recommendations, it is assured that when the Googlebot indexes a URL containing a session ID, your server will return the canonical URL. And furthermore, when the Googlebot re-indexes a URL containing a session ID (from the previous crawl in the past), it will know the official URL because of the rel=”canonical” tag, and then update the indexed URL to show your preferred version.


    DISCLAIMER: The content provided in this article is not warranted or guaranteed by Developer Shed, Inc. The content provided is intended for entertainment and/or educational purposes in order to introduce to the reader key ideas, concepts, and/or product reviews. As such it is incumbent upon the reader to employ real-world tactics for security and implementation of best practices. We are not liable for any negative consequences that may result from implementing any information covered in our articles or tutorials. If this is a hardware review, it is not recommended to open and/or modify your hardware.

       · As eCommerce consultants, we've also recognized a few other sources of duplicate...
       · Nice article thanks. Those query sort (SORT=) variables in ecommerce sites is really...
       · Although slanted towards OsCommerce, the article was informative, however did not go...
       · The important thing is to block those in robots.txt if you can see clearly see a...
     

    SEARCH OPTIMIZATION ARTICLES

    - Has Your Website Been Hacked?
    - WordPress 301 Redirect: Tips and Techniques
    - Five Ways to Optimize Pages
    - Updating WordPress Tips and Techniques
    - WordPress Database Tutorial: Security, Backu...
    - Submit and Update a WordPress Plug-in
    - Are You Optimized? Use SEO Analysis
    - WordPress SEO Tips: Benchmarking Matt Cutts ...
    - How to Increase the Conversion Rate of Your ...
    - SEO Strategies: A Guide to Which Ideas Work ...
    - Setting Up Feedburner for SEO
    - How to Use Feedburner for SEO
    - Statistical Process Control Implementation i...
    - Create Focused SEO with Subtitles
    - Ivan`s SEO Tips





    © 2003-2010 by Developer Shed. All rights reserved. DS Cluster 10 Hosted by Hostway
    For more Enterprise Application Development news, visit eWeek