What's Wacky with Google?
There are always going to be oddities with any big online service, but this one seems to be persisting. Join the discussion in trying to figure out a pattern. For maybe a week, Google has been returning zero results or "1-1 of about xxx,000" for common searches. One-word searches seem unaffected, but there are certain two-word combinations of common words like
candle truck
or
speaker bracelet.
Reversing the order can affect searches too:
motorcycle candles
vs.
candles motorcycle.
The strange thing is that usually the 1 or 2 results found are to commerce sites. Read the
Search Basics,
compare your notes to
GoogleWhack's,
have fun looking for patterns, but remember that Google always returns slightly different results for different IP numbers.
(Update: 13:56 GMT by J : When I first posted this story it said the problems have been occurring "for several weeks at least" -- but it seems to be more like one week.)
I am so glad someone else noticed this!!! I've been so pissed I haven't been able to get any speaker bracelets recently. God google... forcing me to use other search engines to get my fix.
SkyNet is becoming self-aware.
It's just a glitch in The Matrix, of course.
Check out this - all 25 hits on the quoted words "candle truck" should be showing up in the non-quoted search ...
Has anyone else noticed that the "spam" sort of sites that are nothing but link farms and Gator popups are getting much better at finding their way into Google's rankings? I switched to Google back in the day after search engines like altavista became overrun with such sites. Now I've noticed that they occasionally creep into their rankings...I guess entropy is the way of the universe after all.
quiet fool! .. you've uncovered the true plot behind this slashdot posting.
/. sites to take them down .. we will effect data mining for common searches on the internet.
No longer will we
Long live the Speaker Bracelet
Who makes you Sig?
At the risk of making you look bad, for phrase searches you have to put the phrase in quotes.
For example, I searched for "to be or not to be" phrase origin , and got what I consider to be useful results.
YMMV, of course.
Xentax
You shouldn't verb words.
"q=site:www.google.com google" - (third result)
This is what I'm seeing...
http://www.sminkybang.com/google.png
For any who are interested, Google.ca is behaving correctly. All search results listed (that I've tried so far) from googlewack.com are working properly and returning 1-1 of 1, or displaying as they should.
I wish I could compare to google.com, but for the past year or so, google.com automatically forwards all canadian IP's to google.ca
0110100100100000011000010110110100100000011000100
No, stories don't have to move through the cluster, and there's no concurrency bug. We have a front-end cluster of webheads but they all read from the same DBs. The only "moving through" is from our main DB to our replicated slave reader DBs, but they are typically only 0 to 1 seconds behind reality, so that's not an issue.
In this case, the problem was that Hemos and I were both editing the story at the same time. He added an icon and posted it at 9:36 EDT live, then I tweaked the text and posted it at 9:38 which was about 40 seconds in the future, then around 9:39 I went back and edited its time back to 9:36... so there were a few seconds there where the story went from front-page to subscriber-only and back.
The Slash backend is obviously too powerful for idiots like us :)
Um, yeah. Actually, I don't know what you're talking about. Entering the phrase "to be or not to be" -- with quotes, so as to indicate you want the phrase, not just the collection of words -- yielded the first two pages of results all having that phrase. Not all of them were for pages on Shakespeare, but then again, that phrase is now deeply buried in the common memespace. If you make the search phrase
you do indeed get results with the phrase and exclusively referring to Shakespeare. Oh, I get it. You don't like the idea you need to actually construct a reasonable search phrase. You're mad that Google isn't, I don't know, telepathic. Your best bet is the SFWIWNFWIS search engine -- search for what I want, not for what I say.
The Mongrel Dogs Who Teach
I've read that there's a real time search monitor in the lobby of Google's HQ. The nastiest words are removed, but other than that you can se exactly what people are searching for.
They have to be pretty confused right now, when thousands of searches for speaker bracelets, motorcycle candles and candle trucks show up on the display!
Martin
Google are aware of this problem and are working on it. I know cause I wrote to them with some example URIs and they replied they are working on some known issues with their servers.
I.O.U One Sig.
2*b || !(2*b) is actually a tautology :P
:P
ducks
I spoke with a friend who helps maintain the google engine. She said that they were running into some problems with a "cleaning agent." Because of all the sites taking advantage of the word revelancy, there are useless sites that simply have a list of words or phrases. It's been posted before that there are many pages designed for GATOR/GAIN spreading or other spyware/adware. She quoted the percentage of junk pages being at 35% to 40%. The cleaning agent was supposed to run through its own searches and check for junk and keep a log.
She didn't say if the problem was that the cleaning agent was clogging searches or if any logged junk pages had been blocked. If so maybe the agent is flawed. In any case, they've stopped using it for the time being.
The counts have been broken for the last five weeks. A count for the word "the" produced fairly consistent results until then of about 3.4 billion. Then it shifted five weeks ago to 5.2 billion. Lately it has been under 2 billion. Now it's just over 2 billion.
Webmasters who have various directories and know exactly how many pages are in each directory, began noticing five weeks ago that Google was reporting approximately twice the number of pages in each directory than have ever existed in that directory. Prior to five weeks ago, Google used to be fairly close to the actual number (assuming that you get a full crawl).
GoogleWatch speculates on the reason why Google has been behaving strangely ever since it stopped doing the traditional deep crawl once per month. The last standard deep crawl was in April but it wasn't used -- Google threw out this data (by their own admission) and reverted to earlier data. The speculative piece was written last June.
Since it was written, Google has started showing "supplemental results" on many searches. It looks like they are running a parallel index. Why would they do this? All the problems Google has been having, along with the supplemental index, seem to support GoogleWatch's theory.
gm candle truck: Results 1 - 10 of about 12,100
fiat candle truck: Results 1 - 10 of about 5,200
audi candle truck: Results 1 - 10 of about 7,090
chrysler candle truck: Results 1 - 10 of about 18,400
ferrari candle truck: Results 1 - 10 of about 9,810
ford candle truck: Your search - ford candle truck - did not match any documents.
Looks like it's about time ford got on the candle truck bandwagon.