Handling Duplicate Content - Dynamic Pages
(Page 4 of 4 )
Another big duplicate content issue is the huge number of content variations based on user preference. For example, some users may filter content by topic, color, date, reviews, price, etc. Essentially the content stays the same while display results get shuffled, producing an infinite number of variations that look identical to search engine spiders.
The solution is to block variations from search engine spiders and only show them one version of content. For example, if your site features products, you can allow search spiders to see a default layout of content (which may be by alphabet, price, etc), but block them from following links that filter results by other criteria. This way there's no duplicates, while users can still sort the information.
This also applies to blogs. Try putting content only in one section of the website, instead of multiple sections, and block search engine spiders from accessing archives by date.
Session IDs
Session IDs are evil.
Session ID websites give spiders a unique ID. As spiders follow URLs they all think they're following their own URLs, while in fact they're seeing exactly the same content. Search engine spiders usually ignore session IDs due to the duplication it produces and because there are no static URLs to feature in search results.
Changing or disabling session IDs is the solution.
Quotes, Similar Pages
Quotes or several copied paragraphs are okay. Internal pages with unique content, but several duplicate paragraphs are okay as well.
Google is much more forgiving of duplicate content issues to authoritative domains. In fact, old sites can get away with much more of this than new ones.
Summary
Duplicate content hurts an entire site's performance in the search results. Though only a single page may be filtered out from the results, the site's overall trust score may go down with it.
Duplicate content issues are especially important if you're a merchant, selling items that are similar to those sold by other merchants. Keep the template to content ratio at least on the same level, get rid of session IDs and block access to sorting features to feed search engines spiders better. The better you feed the spiders, the more they like you.
| DISCLAIMER: The content provided in this article is not warranted or guaranteed by Developer Shed, Inc. The content provided is intended for entertainment and/or educational purposes in order to introduce to the reader key ideas, concepts, and/or product reviews. As such it is incumbent upon the reader to employ real-world tactics for security and implementation of best practices. We are not liable for any negative consequences that may result from implementing any information covered in our articles or tutorials. If this is a hardware review, it is not recommended to open and/or modify your hardware. |