Search Optimization
  Home arrow Search Optimization arrow Page 3 - Preventing Duplicate Content on an E-C...
SEO Chat Forums  
Choosing Keywords  
Google Optimization  
Link Trading  
MSN Optimization  
Search Engine News  
Search Engine Spiders  
Search Optimization  
Web Directories  
Website Marketing  
Website Promotion  
Website Submission  
Yahoo Optimization  
SEO Tools
Adsense Calculator
AdSense Preview
Advanced Meta-Tags
Alexa Rank Tool
Check Server Headers
Class C Checker
Code to Text Ratio
CPM Calculator
Domain Age Check
Domain Typos
Future PageRank
Google Dance
Google Keywords
Google Search
Google Suggest
Google vs Yahoo
Indexed Pages
Keyword Cloud
Keyword Density
Keyword Difficulty
Keyword Optimizer
Keyword Position
Keyword Typos
Link Popularity
Link Price Calculator
Meta Analyzer
Meta Tag Generator
Multiple Link Popularity
Page Comparison
Page Size
PageRank Lookup
PageRank Search
Robots.txt Generator
ROI Calculator 
S.E. Comparison 
S.E. Keyword Position 
Site Link Analyzer 
Spider Simulator 
URL Redirect Check 
URL Rewriting 
Mobile Linux 
APP Generation ROI 
IBM® developerWorks 
SEO Weekly Newsletter
 
Developer Updates  
Free Website Content 
 RSS  Articles
 RSS  Forums
 RSS  All Feeds
Write For Us Get Paid 
Request Media Kit
Contact Us 
Site Map 
Privacy Policy 
Support 
 USERNAME
 
 PASSWORD
 
 
  >>> SIGN UP!  
  Lost Password? 
SEARCH OPTIMIZATION

Preventing Duplicate Content on an E-Commerce Site from Session IDs
By: Codex-M
  • Search For More Articles!
  • Disclaimer
  • Author Terms
  • Rating: 5 stars5 stars5 stars5 stars5 stars / 4
    2009-04-28

    Table of Contents:
  • Preventing Duplicate Content on an E-Commerce Site from Session IDs
  • Robots.txt and Sitemap (XML, dynamic and static versions)
  • Oscommerce Admin Configuration
  • The Link Rel Canonical Solution

  • Rate this Article: Poor Best 
      ADD THIS ARTICLE TO:
      Del.ici.ous Digg
      Blink Simpy
      Google Spurl
      Y! MyWeb Furl
    Email Me Similar Content When Posted
    Add Developer Shed Article Feed To Your Site
    Email Article To Friend
    Print Version Of Article
    PDF Version Of Article
     
     
    ADVERTISEMENT


    Preventing Duplicate Content on an E-Commerce Site from Session IDs - Oscommerce Admin Configuration


    (Page 3 of 4 )

    Oscommerce admin configuration includes a very useful feature called “Prevent Spider Sessions.” This feature works like this:

    • Googlebot visits the website URL containing a session ID.
    • The server will do the Apache mod rewrite then automatically 301 redirect URLs with session ID pointing to the canonical URL, so if Googlebot found this one:

      http:// www.yoursite.com/osc/specials.php?osCsid=cd5627128b63b13553aea5b6c2b3d65c

      The server will do a server side 301 redirect to http://www.yoursite.com/osc/specials.php . Therefore, instead of Googebot indexing URLs containing session IDs, they will crawl and index the canonical version (without a session ID).

    This should be set up at the earliest stage of the website's development. This is ideally done before allowing Googlebot to crawl the website's pages.

    To implement this solution, do the following: 

    • Log into your website oscommerce admin panel. 
    • Under Administration, you can find “Configuration.”
    • Under “Configuration,” you can find “Sessions.”
    • In “Sessions,” find one with “Prevent Spider Sessions” and click “Edit.”
    • In one of the Edit options, click “True” and click “Update.”

    After editing, it should like the screen shot below:

     

    To see the list of allowed spiders, navigate through /osc/includes , and find the spiders.txt file. Be careful about editing this, and always do a backup.

    This is an excellent corrective action in the early stage of your site, when Googlebot has still not indexed it. Indeed, it is better to take this action than the one discussed in the previous section. 

    However, if Googebot has already started indexing your site, along with the ugly session IDs, this solution can create duplicate content issues, because Googlebot will now index the canonical URLs, too. This will create content that duplicates what Google found at the already-indexed URLs with session IDs. 

    To fix this issue permanently requires another corrective action, which I'll discuss in the next section.

    More Search Optimization Articles
    More By Codex-M


       · As eCommerce consultants, we've also recognized a few other sources of duplicate...
       · Nice article thanks. Those query sort (SORT=) variables in ecommerce sites is really...
       · Although slanted towards OsCommerce, the article was informative, however did not go...
       · The important thing is to block those in robots.txt if you can see clearly see a...
     

    SEARCH OPTIMIZATION ARTICLES

    - Implementing Six Sigma Methodology for SEO
    - Introducing Six Sigma Methodology for SEO
    - What is Mobile SEO?
    - Using Lynx for SEO Analysis
    - Mastering Lynx (Open Source Text Browser) fo...
    - More Blogging Tips: Cooking with Gas
    - Blogging Tips from Julie and Julia
    - SEO Essentials: the Proper Web Server and Pl...
    - Steps to Higher Rankings and Traffic
    - Building Linkable Pieces and Titles
    - Page Rank Sculpting
    - Page Rank Optimization
    - ClickTale Review
    - Final Issues: Moving Blogger to WordPress wi...
    - Avoid the Mistakes New SEOs Make



     



    © 2003-2009 by Developer Shed. All rights reserved. DS Cluster 2 Hosted by Hostway
    For more Enterprise Application Development news, visit eWeek