In Some Places, Local Search Beating Google

Gotta Love It by pembo13 · 2007-10-28 19:06 · Score: 4, Insightful

How some people treat everything "Google" as if it were special. It would be news worth *if* Google was beating local searches in foreign areas.

--
"Thanks for all the money you paid to us. We've used it to buy off ISO among other things" -Microsoft

Re:Gotta Love It by MoonFog · 2007-10-28 19:27 · Score: 5, Insightful

I agree, this is a non-story really. In Norway we have a search engine called Kvasir (kvasir.no) which is very good for Norwegian stuff. Big surprise, the big American company cannot compete on accuracy versus a search engine specialized on finding Norwegian results? This is surprising how exactly?
Re:Gotta Love It by Daengbo · 2007-10-28 19:51 · Score: 2, Informative

Yeah, but here in S. Korea, I don't even think they know who Google is. That's pretty impressive. Want to do an internet search? Naver.com. Want a map? Naver. Want a friend's e-mail address? Naver. Shopping? Naver. Jeez. It's everyone's home page. It searches everything in Korea. No one uses anything else.

--
Put identity in the browser.
Re:Gotta Love It by duggi · 2007-10-28 20:08 · Score: 5, Interesting

It is surprising thus: People (From the English speaking world) have assumed that Google is number 1. Going by its search results, it is definitely a top contender to the post.So much so that it is the common homepage for millions of internet users all over the world. The non English speaking market is generally assumed to be underdeveloped (Africa, Indian subcontinent) or Google already has something for them(Language packs). The relationship between Google and China is well known, so it is expected to dominate the Chinese and along with it, other SE Asian markets, as it did in the English speaking world. The story comes as a surprise for those who have been seeing the world in a hazy, interpolated and homogeneous manner.(I belong here too.) But after the story is published , the haziness has been removed and the story seems pretty obvious. Hence my reaction: "WTF? IS this even newsworthy?"

--
http://monkeynesianeconomics.blogspot.com/
Re:Gotta Love It by Yetihehe · 2007-10-28 20:13 · Score: 5, Insightful

The story comes as a surprise for those who have been seeing the world in a hazy, interpolated and homogeneous manner.(I belong here too.)
So it IS newsworthy, as it helps you understand world better.

--
Extreme Programming - Redundant Array of Inexpensive Developers
Re:Gotta Love It by Threni · 2007-10-28 21:29 · Score: 2, Interesting

> How some people treat everything "Google" as if it were special.

I think Google is special. They were the first decent webmail service (ie they offered more than 10 megs or whatever, no annoying ads, POP3 access etc). They offer free mobile phone apps to read Gmail, or use Google maps. The language translation works. Google groups is great - ok, it's a bit buggy and you can't employ killfiles, but there's no other way that I know of to search Usenet archives, and it's pretty quick at that.

That's what I use - I'm sure other people use other features that I've not noticed/used. For all Microsoft's braying about innovation, they just do podgy, uncool stuff, or buy up other people's stuff and then fuck it up. Yahoo are playing catch-up in the search/email area (are they still attaching World Cup 2006 sigfiles to outgoing emails? How amusing!).
Re:Gotta Love It by Anonymous Coward · 2007-10-28 21:40 · Score: 2, Informative

I agree, this is a non-story really. In Norway we have a search engine called Kvasir (kvasir.no) which is very good for Norwegian stuff. Big surprise, the big American company cannot compete on accuracy versus a search engine specialized on finding Norwegian results? This is surprising how exactly?
Kvasir use Google for net search, and add their own directory listings and stuff on top of it. No web search engine of their own (go to their page on how to get your site indexed, and they link you directly to Google). They did run a very successful marketing campaign hammering in the message that they where better at local stuff. And if you want the YP business listings and other extra they add, maybe.. but it is not a search engine competitor to Google, it is Google..
Re:Gotta Love It by batje · 2007-10-28 21:57 · Score: 2, Informative

"The non English speaking market is generally assumed to be underdeveloped (Africa, Indian subcontinent) "

I think you forgot to mention the European Continent where people speak underdeveloped languages like French and German, and Asia of course, which is just slightly bigger than China alone (Indonesia alone has about 240 million inhabitants)

Besides that, English is rather well spoken in India as well as large parts of Africa, underdeveloped as they might be.

American primary education, it's tough.
Re:Gotta Love It by Tim+C · 2007-10-28 22:20 · Score: 3, Insightful

Actually, I rather think that was his point - that when the average American thinks "non-English speaking country" they tend to think of places like Africa and the Indian subcontinent, forgetting that there are a great many high-tech countries with first languages other than English.

--
It's official. Most of you are morons.
Re:Gotta Love It by PRC+Banker · 2007-10-28 22:33 · Score: 2, Informative

How some people treat everything "Google" as if it were special. It would be news worth *if* Google was beating local searches in foreign areas.
Yes. In China Baidu is the leader, though search is a general term covering searching many things for many people. Though apparently, Google.cn are very effective in serving and marketing to the higher revenue, more educated, higher earning customer sectors.

My main purpose for commenting was to point out the article linked solely to Newsweek pages: a Newsweek story and a couple of limp stories about searching in South Korea and Russia ALSO from Newsweek. No bad rap on Newsweek though, all the better for them linking to three of their own stories in one article.

--
Oh.
Re:Gotta Love It by tfreport · 2007-10-28 23:00 · Score: 4, Insightful

Come on versus how the French look down at me at my poor attempts while they visit MY country? I call bullshit.

While its fun/popular to make fun of the US and English speakers, few other language groups will praise someone for their broken sentences as they make their first attempts. Most people are pretty touchy when their tongue is mispronounced. Perhaps that is fair but I wouldn't say its English speakers looking down on others due to their language (perhaps other things but not language).

And no, most Americans do not have a second language. But why would they? Its not like a small European nation where you can travel or see people from other countries on a semi-often basis. There many parts of the US where you will go years without a foreign visitor. You could argue that people should travel to see the world but when you have a nation that is large and varied as a majority of Europe, what's the need? You have enough to do just to know your own country. Wait a few years and most Americans will at least be bilingual, the schools have really picked up the amount of Spanish taught.
Re:Gotta Love It by dintech · 2007-10-28 23:13 · Score: 4, Insightful

few other language groups will praise someone for their broken sentences as they make their first attempts.

Umm no. Japanese will often compliment you on your attempts to communicate in their language. However they are just being polite, and actually you really suck at it.

I think this is a general rule for most languages. Paradoxically, people will stop commenting on how 'good' your language skills are only when you are fluent and they don't notice your shortcomings. If someone politely comments that you speak very well in a particular language, most likely you still have some way to go.
Re:Gotta Love It by mgblst · 2007-10-28 23:44 · Score: 2, Funny

Just went there, couldn't understand a thing. How can people really expect to use this at all?
Re:Gotta Love It by MMC+Monster · 2007-10-29 00:22 · Score: 2, Funny

You can also have some fun by praising people of their command of their native language.

I end up with 50% confused, 50% insulted.

--
Help! I'm a slashdot refugee.
Re:Gotta Love It by Zebedeu · 2007-10-29 00:44 · Score: 2, Funny

I think this is a general rule for most languages. Paradoxically, people will stop commenting on how 'good' your language skills are only when you are fluent and they don't notice your shortcomings. If someone politely comments that you speak very well in a particular language, most likely you still have some way to go. As someone who has been learning german for the past year, and getting those same compliments, I have to say to you: thanks dude, that'll really help me feel good next time I get one of those :-P

(though I agree with your post 100%)
Re:Gotta Love It by Eivind · 2007-10-29 02:26 · Score: 2, Insightful

Google sucks -BIGTIME- if you attempt to use it in languages other than english, atleast the two where I've regularily attempted it, Norwegian and German.

Indeed, my main *complaint* about Google is that it likes to let its search-results be influenced by the language of the searcher, even when that is explicitly not wished, and it doesn't seem to be possible to turn that off.

You can "Search the web" (default) "Search pages in German" and "Search pages from Germany", which is fine and dandy, whats less fine is that the result you get if you "search the web" are *VERY* different if you happen to be logged in to google (say because you use gmail) compared to what you get if you ain't. And the results you get are *MUCH* worse.

My guess is, they're trying to bias the results so that pages of presumed interest for Germans are ranked higher, which is freaking ANNOYONG if you are like me, and search for terms that really are not local.

Example: Change your interface-language in Google to German, then "search the web" for "ubuntu". The top 4 links are to ubuntuusers.de and de.wikipedia.org/wiki/Ubuntu the ubuntu homepage is down at 5.

Now, I'd *expect* that if I had said "search german pages" or "search pages from germany", but I explicitly did NOT, I wanted the most relevant pages for the word "ubuntu" regardless of language and domain, if I wanted something different I'd have said so thankyouverymuch.

It's equisitely braindead to FORCE the user to prioritize pages from the same country, or in the same language as the users choosen interface-language, without mentioning that by a word. The option does NOT say "prefer german pages", it says: "Show the google user-interface in German", the two *aren't* the same and shouldn't be treated as such.

As far as I've been able to discover it is IMPOSSIBLE to convince Google that yes, I'd like the user-interface to be Norwegian (or german), but NO, I do -NOT- want those domains or languages given extra emphasis when I search, unless I say so (for which there are options!)

It's bad enough to make google localisation useless for me. I have it set to english. It's the only way to make it deliver sensible results.

OTOH by ceeam · 2007-10-28 19:15 · Score: 5, Informative

Still, Yandex is unbelievable crap - results-quality wise. I'd say Top3 go in reverse in this parameter. But the problem I think - apart from advertising (Y had a rather big ad campaign some time ago) - is that Google seriously dropped the ball and showed huge negligence and ignorance when entering local market unprepared - for example, their engine did not even search for different wordforms and Russian of course has an ultra-developed word endings system. So - at first - Google was 99% useless. Plus - Y had been around the longest and most people simply don't care about switching.

Re:OTOH by efence · 2007-10-28 20:25 · Score: 2, Insightful

Also Google's contextual ads showing up in Gmail for mail in Russian are absolutely irrelevant to the subject most of the time as compared to mail in English. That really tells about the attention to the markets other than English-speaking.

Too western? by bushboy · 2007-10-28 19:21 · Score: 3, Interesting

Perhaps in the West, we often assume that Google is the only player in town worth using.
It would be interesting to get the view of someone in South Korea, for instance, as to how useful Google is to them when compared with local/regional alternatives?

It's more than likely that Google is far too orientated around the West, both culturally and in terms of results.

--
A slashdotting - you get the stick first and then the carrot !

Re:Too western? by fender_rules · 2007-10-28 20:01 · Score: 3, Insightful

Naver's greatest advantage lies in its 'KIN' service, which is pretty similar to what www.answers.com provides. But most people don't go to their site for web searching however. Rather they go there for fun reading all the news articles (and all those trolling comments... yeah they're actually fun sometimes), blogs, cartoons, video clips and whatever.

It's not really comparable to Google. They're apples and oranges IMHO.
Re:Too western? by mgblst · 2007-10-29 01:14 · Score: 2, Interesting

The big question is, when they dub over movies, to they change references from google to something else? I imagine this is how a lot of people know about google.

In Soviet Russia... by Anonymous Coward · 2007-10-28 19:25 · Score: 3, Funny

Google searches you! Oh wait...

In Soviet Russia the currency transfer trounce you by arivanov · 2007-10-28 19:31 · Score: 2, Insightful

Not surprising. Till recently Russian currency was not freely convertible.

As a result, dealing with an external broker for services was too painful to contemplate. This restriction formed a protectionist barrier on any service dealing with relatively small financial transactions. As a result companies like Google were locked out off the market in favour of the local brokers.

AFAIK they have a freely convertible currency now which changes the rules of the game back in favour of Google and from there on ... Oh well... size matters...

--
Baker's Law: Misery no longer loves company. Nowadays it insists on it
http://www.sigsegv.cx/

Re:Newsflash! by pipatron · 2007-10-28 19:38 · Score: 2, Insightful

And these other search engines don't serve the interests of their stock holders?

--
c++; /* this makes c bigger but returns the old value */

Character sets? by ThirdPrize · 2007-10-28 19:42 · Score: 3, Interesting

How does Google handle all the various extended character sets out there? Can you search in Cyrillic, Chinese or even French?

--
I have excellent Karma and I am not afraid to Troll it.

Re:Character sets? by Anonymous Coward · 2007-10-28 19:58 · Score: 3, Informative

You can search in Cyrillic (and in other alphabets too), but it only looks for the exact words in the query, i.e. no morphological search. This is often good enough if you know exactly what you're looking for, like lyrics of a song, but if the query is more abstract, local search engines always win.
Re:Character sets? by rxmd · 2007-10-28 20:45 · Score: 4, Informative

You can search in Cyrillic (and in other alphabets too), but it only looks for the exact words in the query, i.e. no morphological search.
This is actually not true anymore. For example, you can do a Google search for "Putin", and it will highlight results in other grammatical cases than the nominative as well. It has been like this for a year or so. It's still not very far advanced yet, but Google apparently realized that they've got catching up to do.

--
As a state gets corrupt, its laws multiply; the most corrupt states have the most numerous laws. (Tacitus, Annales 3:27)
Re:Character sets? by Cyberax · 2007-10-28 23:30 · Score: 2, Informative

It still doesn't work very well. Yandex can conjugate the whole phrases and can work with composited words (words containing more than one stem). Google still uses simple word normalization.

Obligatory by Jello+B. · 2007-10-28 19:42 · Score: 2, Funny

In Korea, only old people use Google.

Re:Newsflash! by Andster · 2007-10-28 19:56 · Score: 5, Funny

Same reason I walk to work every day... because all damn car companies are controlled by the damn greedy stockholders.

It's 20 miles but I make it work because I'm so self-righteous.

As a Korean by ihavnoid · 2007-10-28 20:09 · Score: 5, Interesting

The most would-be-shocking fact is that more than half of the non-technical people doesn't even know what google is (for example, my mom). In contrast, I find most of my non-technical friends have naver.com as their first page on IE. In Korea, it's quite common to see TV commercials say "search XYZ in Naver", instead of displaying its URL.

The biggest reason is because Naver actually hosts content, rather than just indexing content. Not only that Naver is a strong search engine company, it hosts a vast amount of blogs, forums, an online game site (Hangame), user-provided knowledge base, plus third-party licensed contents (such as dictionaries, public transportation routes, news contents provided by other medias, etc.). All these contents are prohibited to robots (via robots.txt), which means Google can't even index them. Thus, no matter how great Google's search algorithm is, it will be almost impossible to match Naver's quality.

Plus, running a homepage *that looks cool* is a very complicated job for a non tech-savvy person. Thus, they don't get webhosting - they upload contents to big portals. I've even seen many small businesses forget about homepages, and instead have a blog/user-created forum/whatsoever on every major player. It would be much easier for normal users to reach them (since memorizing a URL written in a non-native language would be painful), and cheaper (near zero) to maintain.

Another downside of Google is that it DISPLAYS English search results, which would be useless to them. Yes, people are lazy enough to select the 'Search for Korean contents only'.

In terms of actual users, I believe Google would fall even further behind (far behind 10th place), since there is another big portal cyworld (http://cyworld.com/), which provides personal blogging services and web-based communities.

I use many different searching methods
- Naver or Yahoo for local information (public transport route, looking for a place for a nice dinner, etc.)
- Wikipedia for something that's expected to exist on an encyclopedia
- danawa.com and enuri.com for searching best deals (equivalent to PriceGrabber or whatsoever)
- Naver for anything else in Korean
- Google for everything else, or if all methods above doesn't give a good enough result.

As a result, I get to use google less and wikipedia more, while naver and everything else remains somewhat constant.

Re:As a Korean by Anonymous Coward · 2007-10-28 20:25 · Score: 2, Insightful

their dictionary, imho, is the best. and generally speaking, searching korean words on the net is such a pita. search engines do not make sense of particles and cannot separate words when they are just written with no spacing. I mean, there's a lot of ambiguity in word separation rules in korean, so it just makes harder for google. I wonder how it works in japanese....

The reason why NAVER in Korea tops google by holywarrior21c · 2007-10-28 20:28 · Score: 5, Interesting

I am student from Korea so i know very well about Korean websites. Naver gained popularity by providing human generated search engine and user generated contents such as imitation of yahoo's answer page. But there are no good search engine that supports Korean in the face of this planet. At least european laguages share common alphabet, that is the reason why google holds significant share on europe. But Korean is just different from English. As i search internet in Korean, neither google,naver returns reliable results. There are no search engine that supports basic functions like spell correction neither. (Lets say you type Koreea in google and it will suggest you that if you meant to type Korea) web portals and search engines in Korea are more like very well organized catalog with useful advertisements. There are long way to go in developing web search engine in Korean. In fact there are some progress done. Until the new technology is finally embedded into their websites it is just going to be good yellowbook with lots of ads. Funny thing is that when i use google i do my best to ignore all the ads. But when i use Naver, i only look at their ads. funnier things is tho, most scholars use google in Korea when searching Korean, because it has simpler interface.

Please NO! by Chrisq · 2007-10-28 21:50 · Score: 2, Insightful

All these contents are prohibited to robots (via robots.txt), which means Google can't even index them. Thus, no matter how great Google's search algorithm is, it will be almost impossible to match Naver's quality.

This could be the beginning of a slippery slope. Suppose Google responded by ignoring robots.txt files in Korea and protecting orkut, blogger and its own sites with robots.txt files that it does not obey itself. Up until now there has been an unwritten rule - something protected by robots.txt won't be indexed by any public search engine. The possible side-effect of breaking this rule is that robots.txt files are ignored, which can be a real pain for small scale interactive sites.

Re:Please NO! by tony1343 · 2007-10-29 01:31 · Score: 2, Interesting

If Google were to ignore the robots.txt file it is possible to bring suit against Google in the United States. There have been some successful such suits based upon the old english common law cause of action "trespass to chattels" which was a relic of common law history until the internet came along. I believe one such successful action was by eBay against a auction crawler (eBay v. Bidder's Edge or something like that). Some courts I believe are unwilling to hear such a claim unless there is actual monetary damage caused by the indexing, so not sure how this would turn out (or if I am remembering the caselaw very well). I am not a lawyer.

The reason is mostly ignorance by temcat · 2007-10-28 22:03 · Score: 2, Interesting

Google beats the hell out of Yandex and Rambler where results relevance is concerned. It's just that people got used to these and don't bother to switch.

That's not the complicated part by Moraelin · 2007-10-28 22:41 · Score: 5, Insightful

Transferring links around isn't the hard part. The hard part is to actually get something that's relevant for that search string.

Just simple lists of keywords associated with that link won't do. We already had that kind of search engines long before Google, and there's a reason why Google handed their arse to them.

And then there are the people gaming the system for a quick profit... even if it means ruining a valuable resource for everyone else. There was an almost epidemic of link spam on all possible forums and blogs, for example, just to raise the Google rank of a couple of pages.

Most of Google's uphill battle so far has been tweaking the algorithm to defend against such "attacks".

(And now that I mention it, it dawns upon me that maybe that's why smaller national engines can do better locally. With everyone trying to game Google and generally the larger English-reading world, it could be that noone bothered polluting the smaller national searches.)

So just being able to swap links around won't do much.

A second and third problems I see with your idea are, well:

1. timing. When I search for something, I'd rather not depend on the right people being online at that exact time. I also want the answer in half a second. Google does that with in-RAM indexes. I wouldn't bet a fortune on someone doing that equally fast via several hops over the net, P2P style.

2. reliability. P2P traffic has been poisoned repeatedly by interested parties, like, say, the RIAA and MPAA. And it's entirely trivial to do so. So what's to keep other interested parties from poisoning P2P search with falsely tagged links?

Even on Google, it's not entirely rare that someone buys ad-word keywords on their competitors' trademarks or such. E.g., if you have a company called, say, "Houndwire", I could buy that keyword for an ad for my company. Now everyone who searches for your company, will have my ad served to them. Then keep my fingers crossed that if I'm in roughly the same market, some people will just go ahead and buy from me. There have been even laws proposed against that kind of impersonation.

Now for adwords it's one thing, but the same could just as well be applied to poisoning a P2P search. Which could ruin its usefulness pretty fast.

--
A polar bear is a cartesian bear after a coordinate transform.

Ignoring robots.txt by eniac42 · 2007-10-28 22:56 · Score: 2, Interesting

This is happening already..

http://www.ewhisper.net/blog/msn-ignoring-robotstxt-files/

There are ways to block search engines that do this..

http://www.ars.net/bots/

--
"A nation that forgets its past is doomed to repeat it." - Churchill

Google is also blocked by some filters by aendeuryu · 2007-10-28 23:24 · Score: 2, Interesting

I've had interesting problems at some Internet rooms (PC Bangs) here in Korea. Every now and then you'll see odd websites blocked by some strange sort of filtering system. The one I used to go to had Fark.com blocked, Youtube blocked, ESPN was blocked, and even Google.com was blocked. Now, google.co.kr was not blocked, and when I wanted to check my analytics page, google.com/analytics was blocked, but another google analytics page accessed by https:/// (not http:/// was available. I'm not very bright when it comes to networks (or Korean, for that matter), so I'm not sure whose fault it was, but the webpage that came up instead had a graphic that made it clear this was to protect children.

This is NOT a widespread epidemic, but it has occurred occasionally at various internet rooms around the country under different ownership (ie: not a chain). As someone else mentioned, Naver has brand strength (company commercials approach it very similarly to the way AOL used keywords), but these sorts of filtering anomalies don't hurt.

Re:In Soviet Russia the currency transfer trounce by Cyberax · 2007-10-28 23:26 · Score: 5, Insightful

Nope. It was fairly easy to work with foreign currency in Russia since early 90-s. Yandex was simply MUCH better than Google because Google have not supported Russian morphology until very recently.

For example, if I'm searching information about, say, the name of Putin's dog I can use the following search query:
"Imja sobaki Putina" - (the name of Putin's dog) and Yandex can find documents with the words
"Imena sobak Putina" - (the names of Putin's dogs - note the plural) or documents with the words
"Imen sobak Putina" - ([about] the names of Putin's dogs)
"Imena sobakam Putina" - another grammar case. ...

Russian morphology is MUCH MUCH more complex than in English. Yandex started working on morphological search in 1996, so it's not surprising that it's still much better than Google.

Re:In Soviet Russia the currency transfer trounce by arivanov · 2007-10-29 01:02 · Score: 3, Insightful

Interesting point... Never thought about that but it makes a lot of sense.

It is a matter of approach to morphology actually.

IIRC Google approach to morphology as a whole is to throw brute force statistical analysis at it. They use statistical models and loads of data for translation. This works wonders with languages like English who have more exemptions than grammar rules while having fairly rigid sentence ordering and relatively limited common vocabulary.

Russian is very difficult to be subjected to this approach. Due to it undergoing a forced language reform at the turn of the 20th century, russian grammar can be expressed in less than 10 pages of strict rules with around 30-40 exemptions. This grammar used to be drilled down with vengeance in Russian schools so it has not changed a bit since formulated 100 years ago.

While the rules are strict (and relatively easy) the meaning of many key grammar elements is positional-dependant. To add insult to injury it has one of the largest working day-to-day vocabularies and there are probably more ways to say the same thing than in any other language (I mean proper Russian, not "Na huja zhe tebe eto nado blad'"..

So no wonder an analytical model is more successful than statistical. Thanks for pointing it out.

--
Baker's Law: Misery no longer loves company. Nowadays it insists on it
http://www.sigsegv.cx/

Re:In Soviet Russia the currency transfer trounce by arivanov · 2007-10-29 02:31 · Score: 2, Interesting

The "useless" google is your friend:

http://www.ipmce.su/~lib/osn_prav.html

I used to have a "legit" version at my old house (no access to it at the mo) which was printed by Moscow State. It was 35-40 pages in total with the preface and the contents.

By the way, when I taught Russian in the USA nearly 20 years ago I had that trimmed to 10 pages for the beginners.

The problem I found with it is that most English students of foreign languages are humanity students which are heavily into memorising and not trying to use rules and logic. They can memorise any number of phrases, the most obscure lexics, etc but they cannot memorise and use formal grammar. At all. As a result they have no problem with French, Spanish, etc but with Russian they hit a wall and run away screaming that it is too hard.

--
Baker's Law: Misery no longer loves company. Nowadays it insists on it
http://www.sigsegv.cx/

Slashdot Mirror

In Some Places, Local Search Beating Google

42 of 216 comments (clear)