Home arrow Search Engine News arrow Taking a DeepDyve into the Deep Web
SEARCH DEVARTICLES

TOOLS YOU CAN USE

advertisement

Taking a DeepDyve into the Deep Web


(Page 1 of 4 )

If the latest figures from Internet research companies can be believed, Google serves the needs of most web searchers most of the time. But serious researchers know that the web boasts nooks and crannies that even Google’s spiders can’t reach. DeepDyve thinks it can help solve this problem.

These secret caverns, nicknamed the “deep web,” hide their content behind subscription-based firewalls. Mainly of interest to academics and researchers in specialized fields (such as law or medicine), these databases of information are expensive to build and expensive to maintain. Fortunately, their main clientele includes those with the money to pay for access: lawyers, large libraries, medical schools, major universities and the like.

That’s not quite so helpful for the individual, independent researcher. There are parts of the deep web that don’t require a subscription, but a lone researcher may face other obstacles. Many important scientific papers don’t receive a lot of links, regardless of their scholarly citations. This stymies the search engines’ usual approach to finding documents. So what can you do when you know the information is out there, but you just can’t get to it by the usual means?

This is where DeepDyve comes in. Two bio-informatics scientists who worked on the Human Genome Project founded the company as Infovell in 2005. Their genetics background shows in the algorithm DeepDyve uses. Called “KeyPhrases,” it indexes passages up to 20 keywords in length rather than single keywords. Indeed, rather than focusing on key words, KeyPhrases matches patterns and symbols. DeepDyve CEO William Park told Wired that the algorithm “is really doing pattern matching; it’s not at all language dependent. In fact it’s actually language agnostic.”

This reflects a genetics background in at least two ways. First, think of the length of the human genome; it’s huge, and made up entirely of three-letter “words,” the amino acid codes that together form the chains of proteins that keep our bodies functioning. DeepDyve’s KeyPhrases algorithm uses indexing techniques from the field of genomics; if they work on the human genome, what can they do for the Internet?

More Search Engine News Articles
More By Terri Wells

blog comments powered by Disqus

SEARCH ENGINE NEWS ARTICLES

- Zurker: Social Network for the 99 Percent?
- SOPA and PIPA: Bad Ideas
- Siri`s Search Strangeness Not Apple`s Fault
- Google Plus One Rivals Facebook Like Button
- Google Launches Media Ads for AdWords
- Targeting Keyword Domains Next on Google Age...
- Google Cracking Down on Fake Goods
- Google Panda Update Slams Content Farms
- What the JC Penney Link Buying Scandal Mean...
- New Panguso Search Engine Launches in China
- Google Changes Algorithm for Low Quality Sit...
- Google`s New Chrome Extension
- Update Your SEO Vocabulary
- Bing Searches Increase Strongly in January
- Facebook Unveils New Sponsored Stories Featu...
 
SEO Chat Forums  
 RSS  Articles
 RSS  Forums
 RSS  All Feeds
Contact Us 
Site Map 
Request Media Kit
Write For Us Get Paid 
SEO Weekly Newsletter
 
SEO Tools
Adsense Calculator
AdSense Preview
Advanced Meta-Tags
Alexa Rank Tool
Check Server Headers
Class C Checker
Code to Text Ratio
CPM Calculator
Domain Age Check
Domain Typos
Future PageRank
Google Dance
Google Keywords
Google Search
Google Suggest
Google vs Yahoo
Indexed Pages
Keyword Cloud
Keyword Density
Keyword Difficulty
Keyword Optimizer
Keyword Position
Keyword Typos
Link Popularity
Link Price Calculator
Meta Analyzer
Meta Tag Generator
Multiple Link Popularity
Page Comparison
Page Size
PageRank Lookup
PageRank Search
Robots.txt Generator
ROI Calculator 
S.E. Comparison 
S.E. Keyword Position 
Site Link Analyzer 
Spider Simulator 
URL Redirect Check 
URL Rewriting 
Privacy Policy 
Support 


© 2003-2012 by Developer Shed. All rights reserved. DS Cluster 9 - Follow our Sitemap
Popular SEO Chat Topics
All Tutorials & Tools
 
SEO Chat is sponsored by:
Close this Sponsor Message