Internet Stuffing, Yummy - Overcrowding the Internet
(Page 3 of 4 )
Another byproduct of the "Internet stuffing" that is occurring is simply overcrowding of data. Envision an article about global warming being published tomorrow. Now envision a week from tomorrow there being 10,000 web sites who cover the same article by adding a sentence or two, and then regurgitating the same information. Now for that one single topic which was posted once by the original creator, it has been replicated 10,000 times over.
At the rate at which all kinds of information is being added to the Internet on a daily basis, it stands to reason that search engines will have to create much more stringent rules pertaining to which sites to index, and which not to index. At first glace this kind of thought process would be considered blasphemy; it brings up questions such as who decides what to index and what not to index, what guidelines or rules would be followed to make such determinations, would the small guy/site get screwed in the process, and so on.
All of these questions are valid ones and I don't pretend to have all the answers. But that should not preclude us from taking a closer look at this issue. Google has publicly stated that they are experiencing disk space issues, which is one of the reasons why they created a "supplemental index," so any content/sites not deemed to be Google worthy would get tossed in the supplemental index, making your site much harder to find. Logic dictates that very barbaric solution is only a stop gap, and invariably Google will have a much more widespread plan implemented which will weed out authority sites from non-authority sites to the point where a non-authority site won't show up on page 9 -- it simply won't show up at all, and will not be indexed.
One idea might be for Google to make some severe modifications as to how their Google sitemaps currently works. For instance, if it grabs your data on a nightly basis, maybe it should time stamp all the data it collects and keep track of these time stamps. Then if an article or web site attempts to copy it or feed off of it like many leach sites out there do, it will recognize the original authority of the article and give it preferential treatment when deciding rank and position for queries. Obviously I would also recommend all of the other search engines offer a similar type of service as well. I am sure there are some contingencies this simplistic solution does not account for, but at least it would be a move in the right direction when attempting to decide who the rightful originator of content is.
Next: So What Can be Done? >>
More Website Marketing Articles
More By Jonathan Caputo