Search Optimization
  Home arrow Search Optimization arrow Multilingual Sites and Search Engines:...
SEO Chat Forums  
Choosing Keywords  
Google Optimization  
Link Trading  
MSN Optimization  
Search Engine News  
Search Engine Spiders  
Search Optimization  
Web Directories  
Website Marketing  
Website Promotion  
Website Submission  
Yahoo Optimization  
SEO Tools
Adsense Calculator
AdSense Preview
Advanced Meta-Tags
Alexa Rank Tool
Check Server Headers
Class C Checker
Code to Text Ratio
CPM Calculator
Domain Age Check
Domain Typos
Future PageRank
Google Dance
Google Keywords
Google Search
Google Suggest
Google vs Yahoo
Indexed Pages
Keyword Cloud
Keyword Density
Keyword Difficulty
Keyword Optimizer
Keyword Position
Keyword Typos
Link Popularity
Link Price Calculator
Meta Analyzer
Meta Tag Generator
Multiple Link Popularity
Page Comparison
Page Size
PageRank Lookup
PageRank Search
Robots.txt Generator
ROI Calculator 
S.E. Comparison 
S.E. Keyword Position 
Site Link Analyzer 
Spider Simulator 
URL Redirect Check 
URL Rewriting 
Dedicated Servers  
Moblin 
JMSL Numerical Library 
IBM® developerWorks 
Sun Developer Network 
SEO Weekly Newsletter
 
Developer Updates  
Free Website Content 
 RSS  Articles
 RSS  Forums
 RSS  All Feeds
Write For Us Get Paid 
Request Media Kit
Contact Us 
Site Map 
Privacy Policy 
Support 
 USERNAME
 
 PASSWORD
 
 
  >>> SIGN UP!  
  Lost Password? 
SEARCH OPTIMIZATION

Multilingual Sites and Search Engines: Part II
By: Tsvetanka Stoyanova
  • Search For More Articles!
  • Disclaimer
  • Author Terms
  • Rating: 3 stars3 stars3 stars3 stars3 stars / 10
    2005-05-31

    Table of Contents:
  • Multilingual Sites and Search Engines: Part II
  • Non-Character Languages and Cross-Language Searching
  • NPL Applications and Practices in SEO

  • Rate this Article: Poor Best 
      ADD THIS ARTICLE TO:
      Del.ici.ous Digg
      Blink Simpy
      Google Spurl
      Y! MyWeb Furl
    Email Me Similar Content When Posted
    Add Developer Shed Article Feed To Your Site
    Email Article To Friend
    Print Version Of Article
    PDF Version Of Article
     
     
    ADVERTISEMENT


    Multilingual Sites and Search Engines: Part II


    (Page 1 of 3 )

    As already discussed in the first part, multilingual sites face special challenges in regard to search engine optimization and placement in addition to the general challenges and requirements faced by pages in English.

    NPL (Natural Language Processing) Is the Key

    It seems that content on the page is the most important factor in identifying the language and the way it is done is with the help of NPL (Natural Language Processing) algorithms for matching text on the page to grammar and structural rules for the specific language.

    The concept of grammar and structural rules is especially useful when similar languages that use the same alphabet are concerned. For instance, when performing a search for "лаптоп" (the Bulgarian for "laptop") on Google, the results varied according to my search preferences. When I limited the search to pages only in Bulgarian (without restricting the country), Google returned about 3,500 pages and when I set no restriction about language and country, I got over 16,000 results from Bulgarian, Russian, Macedonian, Ukrainian, etc. sites, basically all places where they use a Cyrillic alphabet and the word "laptop" is spelled in the same way.

    Also, there were major variations in the order of the retrieved search results. The top 10 results from the first search (pages in Bulgarian only) now were in the top 30 and new entries from foreign sites were present in the top 10. It is important to note that most of the pages in the first 10 in both searches had no explicitly defined the language in the <html lang="xx"> tag, so their language must have been identified by the search engine in some other way.

    Similar to Google, AltaVista performed well with the same search string "лаптоп" in Bulgarian, although there were occasional pages in other languages that also use the Cyrillic alphabet. Also, the number of search results (1,470) was half of what Google found, although with Google there were more dublicates (duplicates being results that belonged to the same site or domain). It deserves noting that AltaVista supports wildcards and truncation for Cyrillic as well.

    I guess that in addition to distinguishing similar languages that use a similar alphabet, Natural Language Processing could be very useful in distinguishing the varieties and dialects of a particular language, provided that there are explicit differences in grammar, structure, and vocabulary.

    There are good reasons to believe that when performing a general search (in this case general means without restricting the language), search engines perform simple text processing of the pages they find. They retrieve everything where the particular string of letters, numbers, and other symbols, encoded as series of bytes occur, just matching the characters and not bothering with details about the text that is surrounding them. On the contrary, when the language for search is explicitly defined, search engines use NPL to deliver a refined set of results. And probably it is the search engine algorithms for Natural Language Processing, together with the grammar and morphological intricacies of particular languages that produce so many results in English and (very often) so few in other languages.

    More Search Optimization Articles
    More By Tsvetanka Stoyanova


       · Thank you all for your feedback! I hope I did not miss an e-mail and answered to all...
     

    SEARCH OPTIMIZATION ARTICLES

    - You Need More Than One Site Map
    - The Whys and Hows of Video Search Optimizati...
    - An SEO`s Experience: 21 Rules for Performing...
    - An SEO Eyeful: Interview with Ronald Herskow...
    - Research Your Competition for SEO
    - How I Became Number 1
    - Popular SEO Myths
    - Meta Elements: A Field Guide
    - Optimize This Page Title
    - What to Consider When Hiring an SEO
    - Raising Your Visibility with LinkedIn
    - Optimize Your LinkedIn Profile
    - Basic SEO: Tricks and Tips
    - Navigating Your Way into Your Visitors` Hear...
    - Creating SEO-Friendly Content






    © 2003-2008 by Developer Shed. All rights reserved. DS Cluster 2 hosted by Hostway