Search Optimization
  Home arrow Search Optimization arrow Page 4 - Multilingual Sites and Search Engines:...
SEO Chat Forums  
Choosing Keywords  
Google Optimization  
Link Trading  
MSN Optimization  
Search Engine News  
Search Engine Spiders  
Search Optimization  
Web Directories  
Website Marketing  
Website Promotion  
Website Submission  
Yahoo Optimization  
SEO Tools
Adsense Calculator
AdSense Preview
Advanced Meta-Tags
Alexa Rank Tool
Check Server Headers
Class C Checker
Code to Text Ratio
CPM Calculator
Domain Age Check
Domain Typos
Future PageRank
Google Dance
Google Keywords
Google Search
Google Suggest
Google vs Yahoo
Indexed Pages
Keyword Cloud
Keyword Density
Keyword Difficulty
Keyword Optimizer
Keyword Position
Keyword Typos
Link Popularity
Link Price Calculator
Meta Analyzer
Meta Tag Generator
Multiple Link Popularity
Page Comparison
Page Size
PageRank Lookup
PageRank Search
Robots.txt Generator
ROI Calculator 
S.E. Comparison 
S.E. Keyword Position 
Site Link Analyzer 
Spider Simulator 
URL Redirect Check 
URL Rewriting 
Mobile Linux 
APP Generation ROI 
IBM® developerWorks 
Sun Developer Network 
SEO Weekly Newsletter
 
Developer Updates  
Free Website Content 
 RSS  Articles
 RSS  Forums
 RSS  All Feeds
Write For Us Get Paid 
Request Media Kit
Contact Us 
Site Map 
Privacy Policy 
Support 
 USERNAME
 
 PASSWORD
 
 
  >>> SIGN UP!  
  Lost Password? 
SEARCH OPTIMIZATION

Multilingual Sites and Search Engines: Part 1
By: Tsvetanka Stoyanova
  • Search For More Articles!
  • Disclaimer
  • Author Terms
  • Rating: 4 stars4 stars4 stars4 stars4 stars / 18
    2005-05-24

    Table of Contents:
  • Multilingual Sites and Search Engines: Part 1
  • How Do Search Engines Know a Site or Page is Not in English?
  • IPs, TLDs, and Searching by Country
  • Character Set Issues

  • Rate this Article: Poor Best 
      ADD THIS ARTICLE TO:
      Del.ici.ous Digg
      Blink Simpy
      Google Spurl
      Y! MyWeb Furl
    Email Me Similar Content When Posted
    Add Developer Shed Article Feed To Your Site
    Email Article To Friend
    Print Version Of Article
    PDF Version Of Article
     
     
    ADVERTISEMENT


    Multilingual Sites and Search Engines: Part 1 - Character Set Issues


    (Page 4 of 4 )

    Although character set issues are not directly related to languages but more to alphabets, it is worth mentioning them in this article. It is not enough to specify the language of the page only; its encoding must be specified as well. The general rule is that one encoding can be used for more than one language (i.e. Windows 1251 is for Cyrillic, and it can be used for Russian, Bulgarian, and other pages). The opposite is also true: there can be more than one encoding (ISO, for Windows, for Mac, and so forth) for a language. Of course, there is Unicode, but it often causes more problems (in the proper displaying of pages) than it solves. Because of this, Web developers are reluctant to use it as an universal approach.

    Since encoding is more about display than search, is there a relationship between encoding and search results? Yes, there is. First, it affects indexing. Although most major search engines index pages in any encoding, there are still search engines (starting with national ones) that index only a limited number of charsets. So if your site gets excluded from the search results of a particular search engine, the reason for this could be that pages on the site are in an unsupported charset.

    Second, there are search engines which perform indexing and results retrieval of pages with not-so-popular encoding by recoding the character set (i.e. converting it to a different set). This operation (performed back and forth) can also influence search results. This is especially true for languages that have special symbols, for instance accented characters.

    Third, for those search engines that allow wild card symbols and truncation, very often these functions are not fully supported for non-Latin charsets.

    Content Reveals the Language

    It is hardly surprising that when servicing requests for pages in a particular language only, Google determines the language based on the content on the page and on the context in which the search string occurs. How do search engines know so many languages? Well, the answer is simple: they use NPL (Natural Language Processing), i.e. they have some type of database that contains words in different languages, together with some grammar and structural rules specific to that language, which allows them to analyze the text and determine the dominant language of a page. More details about the mechanism of NPL and about other factors that influence sites in foreign languages are included in the second part of the article.


    DISCLAIMER: The content provided in this article is not warranted or guaranteed by Developer Shed, Inc. The content provided is intended for entertainment and/or educational purposes in order to introduce to the reader key ideas, concepts, and/or product reviews. As such it is incumbent upon the reader to employ real-world tactics for security and implementation of best practices. We are not liable for any negative consequences that may result from implementing any information covered in our articles or tutorials. If this is a hardware review, it is not recommended to open and/or modify your hardware.

       · I have been waiting for articles on multilanguage sites! Just a couple of notes and...
       · 1. I am not sure if time is the answer here. If your Spanish page has arrived...
       · I am sorry, when posting the first comment in the morning, I did not notice that I...
       · Thanks for your responce. I'll maintain the numbers:1. My English site version...
       · 1. Well, the Spanish site has not been submitted so long ago, so let's hope that in...
       · Thanks - we'll see what happens. Don't feel for reporting abuse, they are not a...
       · Yes, sometimes just waiting is the perfect solution. So let's see when your site...
       · I've a English and Spanish site 2 different domain names, but both the same apart...
       · Hi Ian,1. For how long have the two sites been online? It's hardly probable that...
       · Hi tsveti,Thanks for your reply.www.costaandsierra.com has been online...
       · Well, February is not so long ago so it could be a reason. I do not believe that...
       · Thanks for all your help I'm working on links and hope to see some in-roads with...
       · Hi,I have like 18 different languages on my site ... its an immigration site...
       · As far as Google and some of the other search engines are concerned, the use of the...
       · Hi there, we are using Zope/Plone with Linguaplone which adopts the language to...
       · Make a link from you English homepage to the homepage in the other language and put...
     

    SEARCH OPTIMIZATION ARTICLES

    - SEO Tricks That Will Lower Your Rankings
    - Building Search Engine Tag Trails
    - Blocking Complicated URLs with Robots.txt
    - Is Your Web Content Accessible?
    - Links and More SEO Tips for Beginners
    - Ten SEO Guidelines
    - SEO Overview and Tips for Beginners
    - Stumbling Blocks to Web Site Success
    - Web Pages to Include in Your Site
    - Big Sites Don`t Automatically Rule Search En...
    - You Need More Than One Site Map
    - The Whys and Hows of Video Search Optimizati...
    - An SEO`s Experience: 21 Rules for Performing...
    - An SEO Eyeful: Interview with Ronald Herskow...
    - Research Your Competition for SEO





    © 2003-2008 by Developer Shed. All rights reserved. DS Cluster 4 hosted by Hostway
    Stay green...Green IT