Slashdot Mirror


Yahoo! Vs. Google: Algorithm Standoff

An anonymous reader writes "There's a new report out from the guys who brought us the Google keyword density analysis. As they put it, "the goal of this analysis is to compare the keyword density elements of Yahoo's new algorithm with Google's algorithm." They compared 2000 low traffic, non-competitive keywords in the hopes of seeing the algorithms more clearly, without any possible search engine tweakings related to high-traffic keywords. Their findings are interesting. Should you go and rebuild your site based on these findings? Maybe not. It's worth a look though."

270 comments

  1. Search Engine Optimization Professional by Anonymous Coward · · Score: 5, Interesting

    Gee, aren't these the guys responsible for continually diluting the quality of search engine results? I'm getting really tired of sites that present one thing to search engines and something totally different to me.

    1. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 5, Informative

      As always, there are is a grayscale of good and bad search engine optimization. A good webauthor designs a site for the users, but keeps the workings of search engines in mind, too.

      Search engines need help with frames (if anyone can still find a good reason to use them). If you use Flash based navigation, you better make sure that you have a prominent document which links to all pages as well or search engines won't index them. It's also a good idea to use descriptive titles and put what's important at the top of the page. In other words, most good search engine optimization is exactly what you would do to make a site screen-reader or text-browser friendly.

      Then there's link-bombing, show-something-different-to-Google, white-on-white text, redirections, etc.
      It's quickly becoming so that you can't tell someone to optimize a site for inclusion in search indexes or they'll fall into the hands of this kind of scum. It's a little like the word "Hackers". Can't use that anymore without having to explain that you're not illegally breaking into other people's computers.

    2. Re:Search Engine Optimization Professional by Bushcat · · Score: 5, Funny
      If you use Flash based navigation

      That's another set of people that need a whack with a clue stick.

    3. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 0

      Depends, for some types of sites it makes sense. I wouldn't oppose Flash navigation on a media-rich site, for example (but there absolutely has to be an alternative way of reaching every page).

    4. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 5, Informative

      I'm getting really tired of sites that present one thing to search engines and something totally different to me.

      Then complain about it. That practice is known as cloaking, and you can get sites blacklisted for it.

    5. Re:Search Engine Optimization Professional by Araneas · · Score: 5, Interesting
      It's an escalating battle. Someone hijacks a keyword that is highly relevant to your site so you have to figure how to overcome that and give users something that isn't porn or a crappy search portal.

      I think it's fair to say there are white hat SEOs as well as black hat hijack^H^H^H^H^H^H SEOs.

    6. Re:Search Engine Optimization Professional by dargaud · · Score: 5, Interesting
      That's what I wanted to submit to the Google programming contest, but it wasn't admittable:
      • Make a 2nd robot that retrieves a few full web pages (with graphics) per site claiming to be IE6 (or a normal Mozila), thus lying about it being from google.
      • Display the page in IE6 (or Mozilla), save the entire display as a bitmap image.
      • Run the bitmap image through an OCR program to extract the real text seen by the user
      • Compare this text with what the ordinary google robot sees.
      • If the text is completely different, lover the ranking
      This gets rid of all the blue on blue keywords, display:none keywords and others. I think it will come to that.
      --
      Non-Linux Penguins ?
    7. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 5, Insightful

      Or even better, just use an intelligent html parser that can work out if text would be hidden and ignore it if it is.

    8. Re:Search Engine Optimization Professional by AbbyNormal · · Score: 1, Funny

      lover the ranking...
      Wouldn't that get a bit messy? Personally, I would lower the ranking, but that's just my opinion. To each their own.

      --
      Sig it.
    9. Re:Search Engine Optimization Professional by SiggyRadiation · · Score: 3, Interesting

      Run the bitmap image through an OCR program to extract the real text seen by the user

      Wouldn't it be smarter to just render both versions and compare bitmaps? No need to OCR then...

      --
      This unique sig is intended to make this user more recognisable.
    10. Re:Search Engine Optimization Professional by DrSkwid · · Score: 1

      > Depends, for some types of sites it makes sense.

      to keep away the non Windows users

      go for it

      use java menus while you're at it

      --
      There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
    11. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 2, Informative

      Pages tend to be quite dynamic these days, so if your comparison depends too much on the surroundings of the actual content, it may produce too many false positives. Take the Slashdot homepage for example: The ads vary in size and position. If a single line wraps differently due to that, an image comparison is bound to fail. On the other hand, OCR on rendered text should be extremely simple, considering you have full control over the fonts used and don't have to take any fuzziness into account.

    12. Re:Search Engine Optimization Professional by Bushcat · · Score: 3, Funny
      use java menus while you're at it

      Wait, I've got another clue stick here somewhere.

    13. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 1

      If a page is going to use Windows Media streaming, platform independence can probably be ruled out as a design goal. Not everyone has to cater to the whole world to find an audience. Besides, in which way does Flash exclude other operating systems?

      Java Menus: Been there, done that. Wouldn't do it again, but it was better (more stable and easier to maintain) than DHTML at the time and there was a plain HTML fallback.

    14. Re:Search Engine Optimization Professional by Hanji · · Score: 1

      No.t ;

      --
      A Minesweeper clone that doesn't suck
    15. Re:Search Engine Optimization Professional by samhalliday · · Score: 4, Interesting
      thats ridiculous... OCR is not needed in this scenario, it is easy enough to write a program to find out what colour the background and foreground of text is, its probably just takes too much time to factor this in to the equation. your method would take _at least_ 10 seconds to even check a simple page (assuming all the code worked, which it wouldn't, cuz its OCR).

      and, this way you are giving a lower ranking to pages which use text in images. it is not good practice to have all the text embedded in images, but it is often necessary for sytle purposes; an example being the logo of a site (ok, alt= should handle this). hell, i even do it! its cleaner than hoping the person on the other side can render the same fonts as me (which would be impossible cuz i filtered then thorugh GIMP to add some effects).

      a lot of sites auto detect robots based on what you are saying, and either block them or launch a seek-and-destroy attack against you. to get around this, the file /robots.txt (which every large site should have) WILL be read by the google/yahoo prowler no matter what, and abided by. it plays the prominent role in what the search engines read... not the server reading the browser tag.

      thats without even going into the algorithms of matching the read OCR text up against the text from the source.

    16. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 1, Informative

      I find most "search engine optimization professionals" to be fulla shit. Let's face it; they have a vested interest in convincing you that they need to continuously (expensively) update your pages to make sure that you rank top 10.

      Aw, bullshit! I have optimized pages now for 3 companies that I have worked for. They all consistently showed up in the first 10 hits (usually top 5) for specific search phrases that related to the business and I did it without using any of their damned "tricks" to do it! Not only that, once gained, the ranking stayed high without updating pages continuously.

      Most of it is common sense and make clearly readable and focused web-pages anyway. This whole idea that there are magical "secrets" to getting high ranking on the search engines is misleading marketing manure.

    17. Re:Search Engine Optimization Professional by a24061 · · Score: 5, Insightful
      Besides, in which way does Flash exclude other operating systems?


      It excludes blind users with screen readers and people who don't or can't install superfluous plug-ins. Flash is great for entertainment but it should never be required for getting information.

    18. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 0

      Understood. I did write "media-rich site" though and DrSkwid had something to say about "non Windows users", so please stay focused. If you're a blind Flash hating epileptic on BeOS, you're not in the target demographic of "Webgames&Videos'R'Us".

    19. Re:Search Engine Optimization Professional by scrytch · · Score: 1

      > Search Engine Optimization Professional

      Also known as "Spammers". Their entire purpose is gaming the system to promote crass commerce (and boy I do mean crass), and just to top it off, I haven't seen one that hasn't engaged in spam to promote their service (to be fair, I probably wouldn't see the ones that don't spam, since I'm not in the business of pushing sites up in some silly ratings criterion).

      I never noticed the problem with google til just recently, just a few sites that seem to have my keyword no matter *how* obscure but are filled with nothing but links to ... other search engines, mostly paidclick ones of course. Those are fairly easy to strip out since the abstract is always nonsensical.

      I'd love to see an option on the google toolbar of "report link as googlespam", but I can imagine the manpower that'd be required to sift through submissions (you can't just do it automatically).

      --
      I've finally had it: until slashdot gets article moderation, I am not coming back.
    20. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 1, Insightful

      That's a very close relative to the halting problem if you're not going to render the site to interpret the output, and a real CPU-eater if you're going to render the site.

    21. Re:Search Engine Optimization Professional by anti-trojan · · Score: 1

      Then a 1-pixel difference would spoil the whole idea. What about randomly changing quotes, pictures, and other dynamic data (like the date and time etc).

      OCR'ing and checking for "totally" different content is a better approach (as in the parent post).

    22. Re:Search Engine Optimization Professional by DavidStewartZink · · Score: 1

      Nah, these guys are just scam artists. Google ranks pages by what pages link to them; these people seem unaware of this, andf aren't testing for it.

    23. Re:Search Engine Optimization Professional by SiggyRadiation · · Score: 1

      What's the "difference" between a 1-pixel difference and a 1-character difference?

      You don't need to drop a page's rank for just a 1-pixel (of a square) diff just as you don't need to drop a page's rank for a 1-char/sentence diff. Random quotes and time-related content are going to be a problem in both sollutions.

      Whell, it was just a question. Got answered. Satisfied. :-)

      --
      This unique sig is intended to make this user more recognisable.
    24. Re:Search Engine Optimization Professional by Woogiemonger · · Score: 3, Interesting

      A lot of times, text would be masked by making it a color that blends into the background graphic. A plain background color is intelligible by an HTML parser, but you would need to do at least some form of color histogram/pattern recognition on the background graphic to determine whether or not it is likely to mask keywords. Honestly, I think it's a nice idea, and it's not like every page has to be scanned. It's a way of filtering out a few relatively obvious bad apples, or at least some rather irritatingly hard to read web sites.

    25. Re:Search Engine Optimization Professional by 0x0d0a · · Score: 2, Interesting

      I wouldn't oppose Flash navigation on a media-rich site

      Umm. By definition, if a site is loaded with Flash, it's media-rich. :-)

      I just can't really see Flash being a benefit. Folks thought that it was useful back when it was novel -- ("Look, the web page makes sounds when I click!"). We've gone through this same "novel" phones so many times on the web that it's depressing. When music came out, everyone had to put music on their personal pages, and at first it was kind of cute. Then it got really annoying. Even before that, there was GIF animation. I remember the first time that I saw GIF89a animation. I was enthralled. Here's a copy of this newish Netscape Navigator program and *stuff moves on the screen*. Surprise, a year later, with way too many sites using animated GIFs, I never wanted to see them again (and fortunately, my browser lets me disable their animation).

      Flash is the same thing. It only interests anyone because it's novel. There just aren't any good justifications for using it.

      Actually, no. I believe I've used effectively once before. There was a new MP3 player of some sort out, and you could use an embedded Flash file to try out the interface and see what you liked. That was actually a useful thing.

      Aside from that, Flash on webpages is useless.

      Flash still has some merit for standalone video, as there are no other good vector animation formats (SVG is just plain not designed for animation).

    26. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 0

      Search Engine Optimization Professional?

      OMG we are all fscked, playing video games for a phd has more merit than these guys. This is as disturbing as "communication experts" (if you speak you are a communications expert). How do these guys justify getting a paycheck for being lazy assed slackers? At least comm professionals have the fact that most people cant communicate.

    27. Re:Search Engine Optimization Professional by ichimunki · · Score: 3, Interesting

      Yes, but it seems to me that like another escalating battle there will be a simple agent-based, learning algorithm solution.

      Bayesian filters learn to recognize spam and are personalized to the user. They are at least as effective as rules-based mail filters, but very effectively halt the rules race (where the filter writer writes a rule to filter by, and the spammer figures out a way around the rule, rinse, repeat).

      We need something like that for web pages and web searching. It's not just about keywords with sites like Google. It's also about the other parts of their page rank scheme. But imagine that your spider software (unlike Google) was grading its results. For any bad result it could go back and score against every page involved in getting to that result. Same for good results. Next time you search it gives more emphasis to pages in good result trees. Etc.

      I mean, that's not an actual technical idea there (I can think of lots of problems with that sort of spider/agent idea that would keep it from being practical). But that's the kind of thinking we need to be doing. Is there a way to solve this problem of finding information that won't involve a central repository of keyword scores and rankings?

      --
      I do not have a signature
    28. Re:Search Engine Optimization Professional by AndroidonPPC · · Score: 1
      True. A good place to start is just to check bgcolor against text and at least make sure that is ok. once you have a value for text, convert any bgimage into greyscale and find the average grey value in 4x4, 8x8, and/or XxY neighborhoods or do a histogram. basically, just make sure the text contrasts enough with the image.

      Doing a full screen cap would work, but screen cap and OCR would majorly bog down on a processor especially when working with huge arrays (the bitmaps would have to be huge in some cases), and might no be efficient enough to be worthwhile.

      of course, contrast checking might be as well.
      So, what exactly do all the numbers mean? Does anybody else think that this study might have been the result of dead monkey logic?
      Andy

    29. Re:Search Engine Optimization Professional by spentrent · · Score: 1

      FOOLS! Flash is a GREAT MARKETING TOOL. OF COURSE you won't use Flash on a site about liver diseases in 16th century Europe...

    30. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 0

      Flash sucks. You can't search for text in it, it takes a long time to load all those silly graphics, clicking on things usually results in waiting through a whiz bang animation (just take me to the information, dammit!), it can't be translated by Babelfish, webpages are meant to be seen not heard, and well the list goes on..

    31. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 0

      Marketing tool it is, and exactly how many end users care about advertising? None.

    32. Re:Search Engine Optimization Professional by amplt1337 · · Score: 1
      and, this way you are giving a lower ranking to pages which use text in images.
      If all your text is in images, then you're already not going to rank very highly in Google -- they're not going to OCR your text, they're going to see you as content-free. (Which you most likely are, if you can't be read comfortably in Lynx -- the message is still more important than the medium -- but that's a side issue/flamebait). It's the same problem for folks who are using database-driven sites to serve up lots of content manageably (a most excellent idea); if the dumb spider can't figure out how to read it, they're not getting ranked.

      I'm even going to go out on a limb here and say that the company that can handle database-driven sites best will wind up having a serious advantage in the Search Engine Wars... that company will be best able to index content from well-managed periodical sites, as well as handling commercial sites better (and maybe even putting them in a separate category).
      --
      Freedom isn't free; its price is the well-being of others.
    33. Re:Search Engine Optimization Professional by Hentai · · Score: 2, Interesting

      Now THERE'S an interesting idea - a Google subscription service. I know I'd pay Google $20/mo to dedicate a few megs to customized Bayesian filters that learn MY particular search needs, and remember them for next time. It'd depend on their privacy policy, though.

      --
      -Hentai [in vita non pacem est]
    34. Re:Search Engine Optimization Professional by thbb · · Score: 5, Informative

      Here is where to file a complaint at google. Fast and easy to do, don't hesitate...

    35. Re:Search Engine Optimization Professional by Lozzer · · Score: 3, Informative

      This is the only good use of Flash I've seen. My nephew likes it, anyway.

      --
      Special Relativity: The person in the other queue thinks yours is moving faster.
    36. Re:Search Engine Optimization Professional by tanguyr · · Score: 2, Insightful

      Ah c'mon you're all being just a tad harsh. Sure, gratituous flash usage sucks but flash can do some pretty nifty things: you can build simple web based video conferencing or shared whiteboard apps in flash quickly and easily, and there are some nice games out there as well. Don't think of it as the heavier and even more annoying replacement for animated gifs, think of it as an alternative gui technology - "not-so-thin client".

      As pcs get faster and faster and more and more people get broadband, you will see more and more flash being used - but maybe not used well. One of the problems today is that web designers still think in html, so they do something that looks like moving html at 10 - 20 times the weight and 1/100th the reaction time. /t

      --
      #!/usr/bin/english
    37. Re:Search Engine Optimization Professional by Chris_Jefferson · · Score: 1

      The latest version of flash has support for designing sites suitable for both the blind people.

      True, it blocks people who don't install the plugin, but by that argument perhaps all sites should be compatable with me just telneting in case I don't want to install a web browser at all?

      --
      Combination - fun iPhone puzzling
    38. Re:Search Engine Optimization Professional by JimDabell · · Score: 2, Interesting

      Or even better, just use an intelligent html parser that can work out if text would be hidden and ignore it if it is.

      There are legitimate reasons for hiding text. For example, putting help text into a page, and only showing it when the user clicks a help button (far more friendly than popups).

    39. Re:Search Engine Optimization Professional by samhalliday · · Score: 1
      If all your text is in images, then you're already not going to rank very highly in Google

      yeah, you're correct. but there is the alt= flag which displays the alternative object should the main one not be displayable; thats what i use for all my text which is in an image... i duplicate it in alt= for lynx users to read [and help w3c validator happy :-)]

    40. Re:Search Engine Optimization Professional by Talinom · · Score: 1

      The absolute best search results I ever got were from a program called "Bullseye". I believe Infoseek had a link to it for a while.

      This program would go to about 30 different search engines, throw your query at them, return the results, go to each page listed, and search the contents of that actual page to see if what you were looking for was really there.

      There was the option to search just the web, newsgroups, forums, onlines stores, and so on. Most of the methods for submitting the query to the search engines, as well as the returned results, have changed for most search engines so it doesn't work anymore. Plus many of the search engines are long gone so the results were getting more limited as time went on.

      Has anyone developed another tool that does this? Were I a coder I would find the time to make this myself and distribute it under the GPL. Sadly my coding consists of making my computer say "Hello, World".

      --
      "Giving money and power to governments is like giving whiskey and car keys to teenage boys." - P.J. O'Rourke
    41. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 0

      Web sites ARE compatible with you just telneting in.

      ---
      telnet slashdot.org 80
      GET / HTTP/1.0
      HOST: slashdot.org

      ---

      At this point you will get the web page at http://slashdot.org/

      Remember: 2 newlines at the end.

      Any other stupid comparisons you wish to make?

    42. Re:Search Engine Optimization Professional by DrSkwid · · Score: 5, Interesting

      Besides, in which way does Flash exclude other operating systems?

      Let's see

      Mozilla on FreeBSD (that's me) :

      We are unable to locate a single Web player that best matches your platform and operating system

      Mothra on plan9 (also me)

      We are unable to locate a single Web player that best matches your platform and operating system

      The acceptable list is :

      Windows 98/ME/2000/XP - Internet Explorer/AOL/Netscape/Mozilla/Opera/CompuServe - Flash 7

      Mac OSX / OS9 - Internet Explorer/Safari/Netscape/Mozilla/Opera - Flash 7

      Other Operating Systems
      Linux x86 Flash Player 6 for Mozilla 1.1 - (Not officially supported by Macromedia.)
      Pocket PC Flash Player 6 for Pocket PC 2003 (color devices supported only)
      OS/2 Flash Player 4 for Netscape
      Sun Solaris (Sparc/Intel) Flash Player 6 for Netscape
      HP-UX Flash Player 6 for Netscape
      SGI IRIX Flash Player 4 for Netscape

      On my 500,000 page impression web site, using Flash would have excluded the otherwise successful visitors running the following OS

      CPM
      Windows 3.xx
      WebTV
      OSF Unix
      Aix
      NetBSD

      I will admit that the actual numbers are low but being excluded/ignored is how us non Windows users are treated day in day out. Seems you can't fight the pigopolists.

      --
      There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
    43. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 0
      True dat. God, that reminds me of my last boss. She was the worst of the dot-bomb snake-oil salespeople (and I do mean salespeople, as she knew not one technical lick of what she spoke!) I knew she was trouble when she came on board and started using-- with a straight face-- terms like "repurposing," "thinking outside the box," "value-added experience," etc. As the Simpsons put it, "Paradigm? Proactive? Aren't those just words that stupid people use to sound important?" But, I guess that's why she and her husband had to declare bankruptcy on their dotcom scum business. Ha ha.

      Everything in her life was apparently a scam, though, as she constantly bragged about writing off every room in her house on her taxes... hope she enjoys her audit!

    44. Re:Search Engine Optimization Professional by mlilback · · Score: 2, Informative

      Flash excludes using that site from my Treo. Pretty lame when I go to someones website for an address/phone number and am unable to get it on my mobile browser (pda, phone, etc). Same goes for restaurant menus, hours of operation, etc.

      Too many sites today just contain a big flash file.

    45. Re:Search Engine Optimization Professional by drpentode · · Score: 1

      Just because a site is media rich doesn't mean it's using Flash. Media rich means that there's audio and video as well as text, and you don't need Flash for audio or video. These can be done elegantly with HTML and embedded MPEG, AVI or WM* objects.

      Also, Flash greatly enhances the user experience, especially in online learning situations where animations are required or on sites where navigation goes beyond the standard HTML hierarchy. Sure, some people may be excluded, but those are the extreme minority.

      Flash content is great for the disabled because it can screen read for them, or the interface can be tweaked. Color contrast can be adjusted, font size can be adjusted, etc., without having to rely on flaky CSS support from browsers and bad programmers.

      It's all a matter of using the right tool for the right job. Overusing Flash sucks, but so does not using it when it's needed.

    46. Re:Search Engine Optimization Professional by cpeterso · · Score: 1


      man, that is some trippy shit!! :)

    47. Re:Search Engine Optimization Professional by dcam · · Score: 2, Insightful

      Add Javascript into the mix and this gets pretty difficult. I write some pages that only display stuff as the result of an onload event for the page. For example it is often more efficient to load up a javascript array with values and use that to populate a drop down list, as it can avoid roud trips to the server to regenerate the ddlist.

      There are also pages that only display stuff when you click on something. The MSDN pages commonly do this, often for things like code samples. They display a basic page with the option to show more information under relevant sections.

      I've also written a page that is generated almost entirely by javascript, basically because you can regenerate the page in response to certain actions on the client without having to make a round trip to the server each time.

      How intelligent are you going to make this parser again?

      --
      meh
    48. Re:Search Engine Optimization Professional by dcam · · Score: 1

      What about a background image? You'd need to somehow work out where the text is going render to relative to the background.

      OCR isn't such a bad idea, except for the reasons onlined in my earlier comment about Javascript.

      --
      meh
    49. Re:Search Engine Optimization Professional by iantri · · Score: 1

      They ARE. One can easily read the HTML source code and make a fair bit of sense of the page.

    50. Re:Search Engine Optimization Professional by Boltronics · · Score: 2, Informative

      It's no wonder they don't support Flash on GNU/Linux systems - it's buggy as hell! I was a beta tester for Macromedia's GNU/Linux player (offered the opportunity after reporting so many bugs in FlashMX). Needless to say, many bugs never got fixed. My browser (FireFox) often crashes - and it's almost always due to a Flash-'enhanced' or Java-'enhanced' site. They should just enhance gplflash instead of making their own version, but they don't want to release the Flash specifications of new versions for quite some time after it's release (to give them an advantage over the competition).

      Of course by that stage, most of the motive to develop a free software player is lost by the community anyway.

      --
      It's GNU/Linux dammit!
    51. Re:Search Engine Optimization Professional by mapinguari · · Score: 1

      Sure, but the search engine shouldn't be taking into account help text that most users won't actually see when viewing the page.

    52. Re:Search Engine Optimization Professional by JimDabell · · Score: 1

      The help text was just an example. What about the technique that presents a single page as a series of tabs, or the related technique of presenting a long form as a series of pages? Search engines should certainly take that text into consideration.

    53. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 0

      > ... for both the blind people.

      If there is only 2 of them, why bother?

    54. Re:Search Engine Optimization Professional by yaar · · Score: 1

      And for the small lot of us running Linux or BSD on an "exotic" architecture, Flash isn't an option. (To my knowledge, Flash is only available as a binary, and I don't know of any free alternatives.)

      --
      "Nothing in education is so astonishing as the amount of ignorance it accumulates in the form of inert facts." - Henry A
    55. Re:Search Engine Optimization Professional by 0x0d0a · · Score: 1

      Just because a site is media rich doesn't mean it's using Flash.

      That's not what I claimed -- it's the converse of what I said.

      The original poster wrote that using Flash on media-rich sites is justified. I followed up that using Flash *makes* a site media-rich. You said that not all media-rich sites use Flash, which while true, is only a counterargument to the converse of what I said.

      Also, Flash greatly enhances the user experience, especially in online learning situations where animations are required or on sites where navigation goes beyond the standard HTML hierarchy.

      I can't think of a situation where online learning would require Flash because it's online learning.

      Flash content is great for the disabled because it can screen read for them,

      The same can be read for non-Flash, with the added benefit that a disabled person can set their browser up to do so for *all* sites, as opposed to the one where the webmaster implemented speaking functionality.

      Color contrast can be adjusted, font size can be adjusted, etc

      Both can be done without Flash -- I enlarge my own browser font so that I can sit well back from the monitor. Heck, the lightweight browser I use is dillo, which *definitely* doesn't have any CSS support, and certainly supports increasing font size.

    56. Re:Search Engine Optimization Professional by RzUpAnmsCwrds · · Score: 2, Informative

      Not the latest Flash. Flash is now compatible with Microsoft Active Accessibility, which means that screen readers can read and activate buttons and other objects in Flash applications. Tab navigation also now works.

      Try it. Start up Microsoft Narrator (Windows Key + U on Windows 2000/XP) and head to macromedia.com.

    57. Re:Search Engine Optimization Professional by forlornhope · · Score: 1

      I have to agree that Flash used poorly is a horrible thing. But Flash does have its uses.

      Who can argue with Strong Bad.

      Seriously though, Flash, like many other tools, can always be used poorly. Its up to the web designer to use the tools properly and if you encounter a site that uses any tool in a way that you dont like, dont just degrade a tool for someone else's mistake, degrade that person or orginization for being braindead.

      Note: For those people not using one of macromedia's supported operating system, there is an open source plugin that Im currently using to view all flash and I have seen only a few instances where it does not work.

      --
      "We Don't Need No Truthless Heros!" - Project 86
    58. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 0

      Please note the quoted term "totally"...

    59. Re:Search Engine Optimization Professional by mabinogi · · Score: 1

      hah, do you really believe the CPM user is real?
      They've got to be masking their user agent.

      --
      Advanced users are users too!
    60. Re:Search Engine Optimization Professional by SlashSim · · Score: 1

      I'd love to see an option on the google toolbar of "report link as googlespam", but I can imagine the manpower that'd be required to sift through submissions (you can't just do it automatically).

      That's a great idea. I recall hearing about a NASA project using volunteers on the internet to classify craters on Mars from images. To combat innacurate submissions they had many different people classify the same image and filtered out the spurious results.

      For Google, a statisticaly meaningful number of "googlespam" hits for a page could reduce its pagerank. Use the parallel processing power of millions of human brains to defeat spammers. Some form of abuse prevention would be nessecary, of course, but Google already has a cookie on my machine and many talented software engineers.

      I would cheerfully offer a half second of my lifetime to demote scumbag spammers in google's ranking. In fact, I expect it would elicit a gleeful cackle from me as I hit the button.

      Take THAT evil spammer scumbag!

      --
      If the only tool you have is a hammer, you'd better start looking for a carpentry job.
    61. Re:Search Engine Optimization Professional by Anonymous Coward · · Score: 0

      Tell that to my client, he just won't drop the flash idea. I tell him, "Look Bob, Flash doesn't work for a Diseases From The Middle Ages to Modern Day portal site" but he doesn't listen.

  2. Uhhh... by Anonymous Coward · · Score: 1, Insightful

    I don't get what the results of this test were. Did Google have better handling of the density, or did Yahoo? Is bigger better or does smaller win out?

    1. Re:Uhhh... by ran-o-matic · · Score: 1

      The test wasn't meant to determine which search produced better results. The results of the test help you determine what to put on a web page to get a higher rank on Google or Yahoo. Its really just trying to scam the search engines.

  3. Re:Yahoo? by Bigman · · Score: 3, Informative
    From the article:
    Following months of testing their search engine results, Yahoo has now launched their search algorithm and replaced the Google results they had been using.
    So you are way out of touch, I'm afraid :o)
    --
    *--BigMan--- Time flies like an arrow.. but personally I prefer a nice glass of wine!
  4. Re:Yahoo? by Pharmboy · · Score: 3, Informative

    Nope, the big changover was a few days ago. Even had a story here on it. Inktomi now provides the smarts for the yahoo search, and MSN and Lycos as well.

    --
    Tequila: It's not just for breakfast anymore!
  5. If Yahoo wants my vote... by PoprocksCk · · Score: 4, Interesting

    ...they'll have to get rid of all that junk on their home page. Much of the reason for my using Google is that its home page is simple, it loads quickly, and it is just so easy to _search_, which is what a search engine should be. Yahoo failed when it became a "portal" and tried to do too much by itself. If they could somehow reduce the size of Yahoo's page down to that of Google (that would mean getting rid of those ads, guys) then maybe I'd consider trying it.

    1. Re:If Yahoo wants my vote... by penultimatepost · · Score: 5, Informative

      That's why They have: http://Search.yahoo.com

    2. Re:If Yahoo wants my vote... by ravishjunk · · Score: 2, Informative

      Actually you can directly go here, http://search.yahoo.com which is more or less clean.

    3. Re:If Yahoo wants my vote... by PoprocksCk · · Score: 5, Interesting

      Heh...

      Well that's all well and good, but how many people would know to type that in?

      Has anyone looked at altavista lately? They've certainly taken the Google route, and their home page looks a lot like Google now, as does search.yahoo.com. However, in search.yahoo.com _and_ altavista, I noticed that "sponsored results" show up before the real ones, but they appear in the list just the same. That could confuse newbies, and I prefer the approach Google has taken to advertising (shoving the ads to a separate entity on the right, and keeping them text-based).

    4. Re:If Yahoo wants my vote... by ak3ldama · · Score: 5, Informative

      I know many people who use Yahoo! as a home page and they like the many services that are offered by Yahoo! besides just the search facilities. If all they wanted was search I doubt they would use yahoo.com for their homepage.

      --
      "but money is the God of Algiers & Mahomet their prophet." - Rich. O'Bryen June 8th 1786
    5. Re:If Yahoo wants my vote... by Pedrito · · Score: 5, Funny

      "Yahoo failed when it became a "portal"..."

      It failed? If a market cap of 28 BILLION dollars is failure, what do I have to do wrong to get there?

    6. Re:If Yahoo wants my vote... by PoprocksCk · · Score: 2, Informative

      Well, I meant that it failed as a search engine... if you go back to the year 1995 or so, Yahoo was _the_ search engine, and then around 1997 it became Altavista, and now it's Google. Yahoo has not failed as a portal -- they've got one of the best portals out there. But when they became a portal, they became less and less of a search engine in my opinion.

    7. Re:If Yahoo wants my vote... by Sique · · Score: 5, Informative

      Yahoo never was a search engine in the pure sense of word. Yahoo started out as a browsable catalogue of the Web, where every entry was put into categories by hand. The automated search came later and was bought as service from external providers up until now.

      --
      .sig: Sique *sigh*
    8. Re:If Yahoo wants my vote... by whatever74 · · Score: 1

      They got too get rid off that "SPONSOR RESULTS" which make me scrolled a min before I can see my search results. Those sponsor result are way to intrusive! So far.... Google 1 Yahoo 0

    9. Re:If Yahoo wants my vote... by Anonymous Coward · · Score: 0

      Tried raging.altavista.com yet? That's been there since before I knew Google (and that's been quite a while ofcourse).

      Used to be on ragingbull.com, dunno if it still is.

    10. Re:If Yahoo wants my vote... by anti-trojan · · Score: 1

      Correct. First they used Altavista, then Google, and now Inktomi.

      Maybe they should dump the catalogue (is it still profitable?) and switch to Open Directory as well...

    11. Re:If Yahoo wants my vote... by anti-trojan · · Score: 1

      However, in search.yahoo.com _and_ altavista, I noticed that "sponsored results" show up before the real ones, but they appear in the list just the same. That could confuse newbies, and I prefer the approach Google has taken to advertising (shoving the ads to a separate entity on the right, and keeping them text-based).

      You can buy similar adspace from Google too. Check out this for example.

      But at least they change the background color of the ads and put a "Sponsored Link" notice.

    12. Re:If Yahoo wants my vote... by polymorpheus · · Score: 1

      how is search.yahoo.com more cluttered than google.com?

    13. Re:If Yahoo wants my vote... by demonbug · · Score: 2, Interesting

      I've been using Yahoo for years as my homepage, it was quick and easy to set up nice news summaries and stock market summaries on things I was interested in. Their search feature always sucked, though, so I have used Google for that purpose. Unless Yahoo Search comes up with much better results than Google, I see no reason to change this. It isn't all that hard to type in "google.com" when I want to search for something.

    14. Re:If Yahoo wants my vote... by Shadarr · · Score: 1

      Me too. I use Yahoo for pretty much everything except searching.

  6. And a User Friendly game to go along! by Indras · · Score: 5, Interesting

    Just grab a friend and a deck of cards, and you can play Yahoo vs. Google at home.

    --
    The speed of time is one second per second.
    1. Re:And a User Friendly game to go along! by Anonymous Coward · · Score: 5, Interesting

      Chris Langreiter has a cute toy to compare Yahoo vs. Google results.
      Touch the dots !

      It's written in REBOL

    2. Re:And a User Friendly game to go along! by Psychic+Burrito · · Score: 1

      Very cool toy, thanks!

      What I already found out with this tool: For movie queries, I always feel that the imdb is a very good resource. Google always neglected the imdb a bit, but now Yahoo is much better.

    3. Re:And a User Friendly game to go along! by Destoo · · Score: 2, Informative

      and of course, a quick peek at imdb.com/robots.txt could explain why...

      # robots.txt for http://imdb.com/

      User-agent: Mediapartners-Google*
      Disallow: /ActorSearch
      Disallow: /ActressSearch
      Disallow: /AddRecommendation ...

      It also includes "User-agent: *" about halfway through, but the list was different at some time..

      You can always check the previous versions of the robots.txt on the wayback machine

      --
      Nouvelles de jeux et technologies en français. TC
    4. Re:And a User Friendly game to go along! by Psychic+Burrito · · Score: 1
      Well...
      • the important URLs /title and /name are not on the exclude list for Google in the robots.txt
      • To inhibit google from going through your page, you would need to address it by its name, which is "googlebot". This "Mediapartners-Google" is something different
      • The important point is that imdb links still appear in Google, but often not among the first 5 hits. So this entirely up to Google's and its ranking algorithm.
      So, I don't think this will explain why Google values imdb links as low as it does.
    5. Re:And a User Friendly game to go along! by Anonymous Coward · · Score: 0
  7. I think by Bishop,+Martin · · Score: 5, Insightful

    Google is way too embedded in everyones everyday life, it will just naturally be more widely used. When was the last time you heard someone say "Yahoo it"?

    --
    Setec Astronomy
    1. Re:I think by rosie_bhjp · · Score: 1

      This is true but when you make a Xerox of something you are more likely to be using a copier made by Konica, Ricoh, Sharp, etc. Brand name awareness doesn't necessarily mean dominance. That is exactly the attitude that brought Xerox down in the 70's/80's.

      --
      A radio maverick jumps to internet only. The Future of Rock n Roll
  8. Re:Yahoo? by sam1am · · Score: 4, Informative

    Yahoo! Switches Search Engines (Wednesday February 18, @09:51AM) has the info on when this happened.

  9. Google Super Computer? by YanceyAI · · Score: 3, Interesting

    Wasn't there a Slashdot article claiming that the Google servers may be the fastest super computer in the world, but they are so busy they couldn't run the benchmark? I can't find it now. If that's the case, how does Yahoo compete? By dividing the traffic? Can anyone link me?

    --
    Can I bum a sig?
    1. Re:Google Super Computer? by PoprocksCk · · Score: 2, Informative

      Well, I know that Google runs on what is, I believe, the world's largest Linux cluster.

      For those of you who don't know, a cluster is (as far as my understanding takes me) when you take several ordinary computers and link them together, providing a cheaper way to get a "fake" supercomputer.

    2. Re:Google Super Computer? by /ASCII · · Score: 5, Informative

      Your statement is not completely correct. There is nothing "fake" about a cluster based supercomputer. In fact, all sufficiently large supercomputers are cluster based. Many of them use special purpose, low latency NICS and switches, and proprietary communication protocols, but the underlying principle of a Beowulf cluster is the same as that of the Earth simulator.

      --
      Try out fish, the friendly interactive shell.
    3. Re:Google Super Computer? by Anonymous Coward · · Score: 0

      Linking ordinary computers does not produce a supercomputer. This was discussed in response to a recent Slashdot article about an attempt to get an ad-hoc Linux cluster into the Top-500 supercomputer list. The most you can realistically hope for is a system which can handle many independent tasks, somewhere between what grid computing can provide and what you need real supercomputers for. The scale from Grid computing through cheap clusters to supercomputers is mostly a scale of network latency and throughput. It could therefore be argued that the Google cluster may have the aggregated power of a supercomputer, but is not capable of efficiently working on supercomputer-typical problems.

  10. Re:Yahoo? by MoriarGryphon · · Score: 5, Interesting

    RTFM, Yahoo is switching to their own engine.

    Personally, I find the differences in how the two engines handle bold text to be most interesting. If only for that, I'd stick to Google.

    Most pages that have 17 occurences of your search text in bold are only going to be Porn sites ((unrelated to your search)) or Spam sites ((unrelated to your search)).

  11. A layman's view by EulerX07 · · Score: 3, Insightful

    Yesyer I was hearing a colleague curse at his computer yesterday because he was looking for something specific.

    "Man, Goggle SUCKS now!, I'll try yahoo."

    "DAMN! Yahoo sucks even more!"

    I have to admit that I used to think google was incredible just after it came out, but nowadays I'm used to wading through 10-15 pages of results before finding something relevant to what I need.

    1. Re:A layman's view by Quaryon · · Score: 5, Interesting

      Is anyone else getting so annoyed by pages which grab your keyword and then direct you to Amazon, no matter what the topic? Seems that every time I do a search on Google and find a site which looks interesting they're either just ripping Amazon's content or redirecting me there.

      Guys, if I wanted to go to Amazon I would just type "www.amazon.co.uk" into my browser.. If I'm searching on Google it's because I've either already looked at Amazon and didn't find what I want, or because Amazon is really not relevant..

      I've started adding "-amazon -kelkoo -dooyoo -pricewatch" and others to my Google searches recently which helps cut down the chaff a little, but doesn't seem to cut out all the Amazon ripoffs.

      Q.

    2. Re:A layman's view by Anonymous Coward · · Score: 2, Interesting

      >>
      I have to admit that I used to think google was incredible just after it came out, but nowadays I'm used to wading through 10-15 pages of results before finding something relevant to what I need.

      Yep. I agree. I search for something as simple as "Philips DVD driver" for a Philips DVDRom drive and I get at least five adds selling Philips CD/DVDRom drives before I find a "SINGLE" reference to Philips themselves. Is this what Google has become? Maybe I should have put an 's' on driver.

      Codifex Maximus

    3. Re:A layman's view by dubiousmike · · Score: 2, Informative

      Do you know how often I hear someone say they cant find something on Google?

      Then I walk over and find it within 2 minutes.

      People still don't really know how to use search engines. They don't use enough keywords or the right ones.

      I wont use Yahoo for Search. I think they are hella shady with their privacy policies (they switched my preferences when "aquiring" new services from 3rd parties which I was a member of).

      Their games and fantasy sports stuff is fun though. Its all about the value they give me when it comes to my privacy.

      I trust Google, so they get my search querries.

    4. Re:A layman's view by pledibus · · Score: 5, Interesting

      I think google's ranking system needs a major overhaul; various sleazy companies have become *much* too effective at fooling it. For example, below are the first three hits that I got by typing "prozac suicide" into google (I've deleted the URLs to protect the guilty :-). Most of the top 20 hits are similar to these.

      prozac suicide
      Prozac prozac suicide. prozac nation nude Viagra prozac hair loss Paxil
      prozac dogs Yasmin ssri prozac Propecia prozac ocd. ... prozac suicide. ...

      Prozac Suicide - Shopping and Discounts - PROZAC SUICIDE
      Prozac Suicide Prozac Suicide. Are you looking for Prozac Suicide? We've searched
      the internet for the best Prozac Suicide and we hope you enjoy what you find! ...

      Prozac Suicide
      Real Pharm - Lowest Prices & Fantastic Service - Prozac Suicide, ... Prozac
      Suicide Prozac Suicide. Prozac(R) is a selective serotonin ...

    5. Re:A layman's view by a24061 · · Score: 2, Interesting
      I wont use Yahoo for Search. I think they are hella shady with their privacy policies

      What about Google's perpetual data retention and refusal to say what they may or may not do with the info?

    6. Re:A layman's view by Dalroth · · Score: 1

      Try -search as well. You're already at Google (which is a search engine), so you don't need to see links to other search engine sites. That should help cut out even more of the cruft.

      Bryan

    7. Re:A layman's view by evilmatt · · Score: 0

      And yet, weirdly enough, if you click on the second result to that query, you wind up on a page of low fat Mexican food recipes.

      Wonder what they're trying to tell us here?

    8. Re:A layman's view by jafuser · · Score: 3, Informative

      Every time I hit one of these para-sites I reach up to my trusty old Google toolbar and click the blue sad face. I encourage others to do the same.

      --
      Please consider making an automatic monthly recurring donation to the EFF
    9. Re:A layman's view by an_mo · · Score: 1

      Even more annoying: typing the exact title of a book on Amazon's search box, and getting a list of 5 or 6 different books, then a sponsored link to the exact book sold by a different bookstore. Then typing the exact title on the advanced search option of amazon, wading through the results to find that amazon sells it at the same price.
      Try to search for "my art my life" using "all products"

    10. Re:A layman's view by Anonymous Coward · · Score: 0

      I've started adding "-amazon -kelkoo -dooyoo -pricewatch" and others to my Google searches recently which helps cut down the chaff a little, but doesn't seem to cut out all the Amazon ripoffs.

      That's a nice idea. An even nicer idea would be if someone would make a frontend for google that automatically inserts a bunch of those to your searches to reduce irrelevant content. I'm thinking something like a few checkboxes
      Remove search results from
      [ ] Shops (Amazon...)
      [ ] Blogs
      etc.

    11. Re:A layman's view by Anonymous Coward · · Score: 0

      For those who don't have the google toolbar, what does hitting the blue sad face do?

    12. Re:A layman's view by rofthorax · · Score: 1

      Not many people search for this stuff, that's why you get these pages.. If you did the searches and spent some time on the pages (make sure you have a google toolbar) then the favorable results would appear at the top of the pages, but you would have to have more people doing this.. Doing these unlikely searches just points out that you have very uncommon tastes in search terms and there isn't very many people out there looking for what you are interested in.. Also you could use this, as if you were selling something, to determine what people were interested in.. I wonder if Google sells their search statistics to marketing comitees..

      --
      Just say no to license servers!!
    13. Re:A layman's view by barthrh2 · · Score: 1

      That's why I've moved to Vivisimo: - meta search that hits all the best engines (customizable) - Really basic front page (Google-esque) and a good advanced page - Clusters results in to categories so you can easily filter the relevant stuff out. - Lots of other nifty tools, like page preview right from the search results page. Dogpile also uses Vivisimo's technology, but overall I find Vivisimo more usable (better tree, fewer graphics). Because I get my Google results through Vivisimo (it searches Netscape, which uses Google), I never use traditional engines except for comparison purposes, category browsin and, of course, Google Groups.

    14. Re:A layman's view by ran-o-matic · · Score: 1

      Unless things have changed, the blue sad face does nothing. The voting buttons were being tested as a way to provide feedback to Google. The sad face would tell Google the result was bad. It should be cool when they get it working.

    15. Re:A layman's view by Anonymous Coward · · Score: 0
      That's why I've moved to Vivisimo: - meta search that hits all the best engines (customizable) - Really basic front page (Google-esque) and a good advanced page - Clusters results in to categories so you can easily filter the relevant stuff out. - Lots of other nifty tools, like page preview right from the search results page.


      So, remember Vivisimo, folks, for all your searching needs. That's Vivisimo.

      The preceeding message was paid for by Vivisimo.

    16. Re:A layman's view by base_chakra · · Score: 1

      Every time I hit one of these para-sites I reach up to my trusty old Google toolbar and click the blue sad face. I encourage others to do the same.

      Of course you realize you're also simultaneously encouraging people to use IE on Windows, because (sadly) that's all the Toolbar runs on. :p

    17. Re:A layman's view by slightly_kooky · · Score: 0

      I was onsite at a customers today, and they had some virus or something.

      Whatever they searched for on google, they got loads of popups (including amazon). Left right and centre.

      Pain in the arse, but that is how the non techies see the web - until I cleaned their machine out.

      Tim

    18. Re:A layman's view by W2k · · Score: 1

      That's not true. Next time, you ought to check your facts before posting. I suggest using Google . :)

      Moz/Fb/Ffox users, the download link is here.

      --
      Quality, performance, value; you get only two, and you don't always get to pick.
    19. Re:A layman's view by base_chakra · · Score: 1

      Kettle, check your blackness. The unofficial GoogleBar doesn't emulate the features (e.g., 'sad face') the poster described.

  12. Pattern Recognition by Space+cowboy · · Score: 5, Interesting

    This is essentially a problem in pattern recognition, and it's a damn hard problem to solve because of the disparity between the high-volume and low-volume words.

    Information is essentially the inverse of entropy. Entropy can be calculated, and you can use Bayes probability theory to get a hold on the information content of a given word within a set of words.

    What is difficult to do, and what search engines are trying to do, is measure the mutual information inherent between the set of pages that the word appears in, and the word itself, then apply that to all the words in the searched-for phrase; this is commonly called 'context'. This is plainly impossible to do for every given phrase, for every word combination, for every page indexed. The best you can do is use a statistical approach (and Bayes is your friend again) to come up with "good" matches.

    The problem with the statistical approach is the class unbiasing, since once you have wildly different statistical populations, your choice of context gets harder and harder - the "easy" standard models don't cope very well. You don't have the computational resources to do a good analysis, so you're essentially stuck between a rock and a hard place.

    This is why the google idea of strengthening the importance of a word depending on linked pages was such a good one - it "did" the hard work by relying on the entire planet to do it for them, by creating links. Of course, what one man can do, another can undo, and Google has got progressively worse over time. It's still by-far the best though, and my search engine of choice. When you look at the queries from search-sites, I get 100x as many from Google as Yahoo (next nearest)....

    People think searching is easy, and it is. What's really really hard is searching *well*.

    Simon

    --
    Physicists get Hadrons!
    1. Re:Pattern Recognition by Eivind · · Score: 5, Interesting
      And what is even harder, as you sorta hint at, is searching well in a world where thousands of people do their damnedest best to game the system.

      Google doesn't only have to make sense of a great big mess.

      It has to make sense of a great big mess where a significant part of the pages are made *spesifically* to confuse Google, and where a part of those same pages gets tuned regularily in dedicated attempts at confusing whichever algorithm google use more.

      Most of the cases where Google returns poor results these days, it's obvious to a human observer that the bad results on top are *purposely* made to confuse Google. I've even seen pages that return one set of content if your user-agent is "Googlebot", and another, totally different content (dialer, etc) if your user-agent is anything else.

    2. Re:Pattern Recognition by fermion · · Score: 2, Insightful
      The other problem with a statistical approach the general assumption the the data is largely unbiased. The problem with search engines is that they assume that information gathered from a self selected unmonitored population is valid. In the pre-google days this meant that we assumed that individual keywords were meaningful. Now we assume that links are meaningful. Neither of these are strictly true as we have intelligent agents with the mean and motivation to lie.

      Statistically we should have some information gathering and analysis targeted towards assessing the validity of the information. Are the links themselves information or entropy? This is what google is working on. I think we are going to need some human processing, which is the link at the bottom of the pages asking if the results are useful.

      --
      "She's a scientist and a lesbian. She's not going to let it slide." Orphan Black
    3. Re:Pattern Recognition by ericspinder · · Score: 2, Informative
      I've even seen pages that return one set of content if your user-agent is "Googlebot", and another, totally different content (dialer, etc) if your user-agent is anything else.
      This is probally Google's biggest problem. What they need to do is make a second pass at specific pages in a site which has recently been crawled with a more typical USER-AGENT to see if there is significant differences. They whould have to hit every page. The second crawler could also check to see what is "visiable" to the user. When sites fail the second crawler, lower (or eliminate) their ranking. If the Secondary crawler has enough problems with a domain they can lower all of the rankings on it. If they have enough problems with an IP address (say over 35% of know domains)they can do the same for that address, ditto for subnets.

      Basicly use automated "scammer checks" to see if a particular area of the web is less "reputable" and then rank that area (directory, domain, IP address, subnet, netblock) lower. These scammer checks may change based on the flavor of the month.

      --
      The grass is only greener, if you don't take care of your own lawn.
    4. Re:Pattern Recognition by ericspinder · · Score: 0
      Sorry for replying to myself, but there is a significant error...

      The second crawler would not have to hit every page.

      --
      The grass is only greener, if you don't take care of your own lawn.
    5. Re:Pattern Recognition by fupeg · · Score: 1

      This why good search engines from Yahoo and (hopefully) MSN are important. Google has a Microsoft like stranglehold on searches. So if you want to fool people, you only have to figure out Google. It's a search monoculture, and as such it will be exploited just like an open port on Windows. If there are competing search engines using significantly different algorithms, then it's a lot more difficult to spoof all these engine and search results on all three, including Google, will improve.

    6. Re:Pattern Recognition by Eivind · · Score: 1
      That would indeed help. Scammer-checks would have to come from random and not known-to-be-google ip-adresses too, to be useful. For obvious reasons I haven't observed this first-hand, but it would surprise me a lot if there aren't similar sites which return one page if your ip-adress is Google, and a completely different page if your ip-adress is anything else.

      It's like the difference between engineering and security. Engineering requires that you program murphys computer. (if anything can go wrong - it will.) It requires that you take into account all situations the program will reasonably meet in its lifetime. It is very hard to do well.

      Security requires, however, that you take into account not only things that can "go wrong", but even things that go wrong on purpose. That you deal with a world in which you don't only have "bad luck", but rather a world in which someone is doing their damnedest to force you into this particular bad luck.

      From an engineering-standpoint, a program that has a race condition if run on february 29th, on a day with full-moon, with the PID 37912, with a certain input that'll never occur in normal data, will probably never expose that race-condition, much less will it be a problem. (it'll still be bad engineering, but it'll probably be without consequences)

      In a security-context, however, the fault could be (depending on cicumstances) disastrous. People will spend considerable effort convincing your program that it's february 29th and that the moon is full, they'll feed it only the single bad input, thousands of times over if nessecary,they'll run "echo" to wrap-around the pid so that you'll always have the one single bad pid and they'll have a hundred threads in the background doing their damnedest to take advantage of the race-condition and create that file in that split-second when it must not be created.

    7. Re:Pattern Recognition by psycho · · Score: 1

      Information the inverse of entropy? More entropy means less information?
      Way to go, Shannon.

    8. Re:Pattern Recognition by Space+cowboy · · Score: 1

      Two things here:

      1) You're invoking Shannon, whose work provides a law limiting the amount of information that can be sent down a channel with a given capacity. I never mentioned capacity - I said two things were the inverse of each other, and they essentially are. Shannon's work obviously still applies, if limits are placed on capacity.

      2) Information is indeed the inverse of entropy. Consider the standard example: Two glass enclosures, one (Y) is a perfect vacuum, the other *X) has 1 million Hydrogen (H2) molecules in it and one Helium (He) atom.

      At the beginning of the thought-experiment, the probability that the He atom is in container X [p(X)] is 1.0, and the probability of it being in container Y [p(Y)] is 0.0

      Now introduce a path between the two containers. Due to statistical mechanics, (and more coarsely, thermodynamics), the gas in X will migrate to Y over time. As it does so, the probability p(X) drops, and p(Y) increases, until an equilibrium is reached where p(X) = p(Y) = 0.5. There is a linear relationship between p(X) and p(Y) throughout this time.

      The point of equilibrium is the point of maximum entropy and the point of minimum information. Conversely, the starting conditions are the point of minimum entropy, and maximum information.

      You therefore have a starting state where entropy is a minimum and information a maximum. You have a final state where entropy is a maximum and information a minimum. You have a linear progression between the two points. This is an inverse relationship between information and entropy.

      QED.

      Perhaps I should point out that my PhD was in information processing using Neural Networks. The information contained within the feature-sets used was characterised by loss-of-entropy within those feature sets as information increased. Ignoring the hand-waving argument above, I could probably still do the maths to prove the case as well... Trust me that the argument above is a *lot* easier to follow...

      Simon.

      --
      Physicists get Hadrons!
    9. Re:Pattern Recognition by perkr · · Score: 1

      Entropy in statistical mechanics and entropy in information theory are not the same thing

    10. Re:Pattern Recognition by Space+cowboy · · Score: 1

      Entropy in statistical mechanics and entropy in information theory are not the same thing


      I'm aware of that, but the mathematical derivation works fine. I was trying to demonstrate the point without resorting to some pretty unfriendly maths and the information-theory version comes to the same equations via a far less obvious route... I did say it was a 'hand-waving' argument...

      Simon
      --
      Physicists get Hadrons!
  13. First thing... by Weird+O'Puns · · Score: 0, Redundant

    ..that came to my mind after reading the headline was this.

  14. Keyword density?! by Short+Circuit · · Score: 5, Interesting

    When I search for something, I don't want to get a page that's a marketing front for what I'm trying to find, I want an informational, probably technical, page on the item I'm searching for.

    Such pages don't usually mindlessly repeat the keyword I'm searching for over and over again.

    1. Re:Keyword density?! by jayzee · · Score: 1

      From the article:

      The first major difference that jumps out in the Yahoo results is the preference Yahoo's algorithm seems to have for more words on a page.

      This seems to favour not only keyword density, but also dense wording in general. Please don't tell the marketing dept. I nearly had them believing that less is more...

      --

      Mole? 4? Cars?
    2. Re:Keyword density?! by Short+Circuit · · Score: 1

      Come to think of it, search engines used to work that way. And many intra-site search engine packages still do. (Like a lot of OEM sites out there.)

      Keeping that in mind was part of knowing how to effectively use a search engine. Perhaps Google's spoiled me. :)

  15. Re:Yahoo? by TwistedSpring · · Score: 2, Informative

    Thanks for clearing that one up. I did read that part of the article, but I was actually wondering where the results were coming from (whatever algorithm you use, you need to use it on a data set). Now I know.

    I use Teoma a lot these days, it's very much like Google was about 6 years ago. Fresh, relevant and speedy. Plus their twist on pagerank is a pretty sweet idea that's worth a look.

  16. My little test.. by CoolCat · · Score: 4, Interesting

    Just typed in the company I work for name (8 employees). First hit on google, yahoo.. I gave up after 9 pages..

    1. Re:My little test.. by tachin · · Score: 1

      Well i typed the name of the company i work for and i got a link in the first page of yahoo results but nothing on the first 10 result pages from google, so we could say your test and my test are just "anecdotal"...

    2. Re:My little test.. by Anonymous Coward · · Score: 0

      Well... We're 12 people, and we're the first hit on both Google and Yahoo. :)

    3. Re:My little test.. by meta-monkey · · Score: 1

      When I type in keywords relevant to the business I own in yahoo, my site comes up first, and my 9 or so competitors are all on the first or second page. When I type those same keywords in Google, five of us (myself included) are nowhere to be seen.

      I hope more people start using Yahoo.

      --
      We don't have a state-run media we have a media-run state.
    4. Re:My little test.. by MrHanky · · Score: 1
      My test is to search for MrHanky. If http://slashdot.org/~MrHanky is not on top of the list, the search engine is crap. This means:
      Yahoo - useless
      MSN Beta - understands that I've taken my nick from South Park (sponsored link) - bonus point, but still useless: it corrected my speling, and a real search for MrHanky still didn't get results. Interestingly, the first relevant hit was from a discussion about MSN Messenger.
      AltaVista - 2. result. I can cope with that.
      Google - We have a winner.
    5. Re:My little test.. by Afty0r · · Score: 1
      Just typed in the company I work for name (8 employees). First hit on google, yahoo.. I gave up after 9 pages..
      Does your company have a website? If not, then it's a good job you have up after 9.
    6. Re:My little test.. by squiggleslash · · Score: 1
      In fairness, I'd have thought that the occasional South Park website would stand a good chance of being #1 in your case...

      Does anyone ever do this and find it bizarre how much stuff they've written ends up on other websites? The worst offenders seem to be online bookstores with Amazon reviews - try searching for a phrase from a book review you've written on Amazon and seeing how many matches you find.

      --
      You are not alone. This is not normal. None of this is normal.
    7. Re:My little test.. by levar · · Score: 2, Interesting

      weird. I just tried my company (5 employees). First hit on yahoo and not in the top 15 pages on google.

  17. It's All Magic... by photonX · · Score: 5, Insightful

    I'm one of those greybeards who was writing college reports in the pre-BBS days, never mind the World Wide Web. Remembering back to when I used to spend a half-day of research in the library to mine info that now magically appears on my computer screen in ten seconds, well...it's hard to throw stones. I'm just happy the damned things work at all.

    --
    Anti-gravity? That was *my* little secret! But I never patented it! Boy, was *that* dumb!
    1. Re:It's All Magic... by AndroidCat · · Score: 1

      Of course, "Proof by Google" is always right. After all, if it's on a web page and Google finds it, then it must be true .. right? :^P

      --
      One line blog. I hear that they're called Twitters now.
    2. Re:It's All Magic... by Araneas · · Score: 4, Insightful
      I shave but the moustache is getting a little white. ;)

      What I miss is looking in the card catalogue under the general subject and being able to pull out all sorts of related material I hadn't thought of. Same for browseing the stacks. Grab the general Dewey number and go surf the titles.

      Wetware fuzzy logic at its best.

    3. Re:It's All Magic... by Short+Circuit · · Score: 1

      It drives me crazy that I can cite several sources online whenever I get into an argument with my parents, and all they say is "Well, if it's on the internet, it must be true." </sarcasm>

      What good is all those billions of pages, if the people you're trying to convince refuse to believe a word of it that they disagree with?

  18. Teoma vs Google by Anonymous Coward · · Score: 3, Funny

    Search for "slash" in Google and the results are:

    1) Slashdot
    2) Slash's Snakepit ...

    Put the same "slash" keyword and search with Teoma:

    1) Slash's Snakepit
    2) Slashdot ...

    Personally for this keyword search I feel Slash's Snakepit is more relevant and belongs at the top of the heap.

    1. Re:Teoma vs Google by Anonymous Coward · · Score: 0

      Slash's homepage and his first record is called "Slash's Snakepit".

    2. Re:Teoma vs Google by Snowmit · · Score: 0

      I would have thought that the most appropriate result for "slash" would have been some hot Kirk on Spock action.

      So I guess Booble wins.

      --
      I have a lot of opinions about Cyborgs and Architects
    3. Re:Teoma vs Google by Apiakun · · Score: 1

      google it :)

  19. So that's what happened! by peterdaly · · Score: 5, Interesting

    I've been on vacation and away from internet and most mass media for a week. Got back on Monday and have noticed a drop in traffic to my web sites while I was gone. Didn't have a clue why. Well, now I know.

    I'll be watching this very closely. Inktomi (sp?) sucked, which is what this is based on. I think it's too early to tell right now if the results are any good. Along the same lines, it will probably take about 6 months for marketers to learn to effectivly spam the results, which is something Google has historically been very good at keeping at bay.

    This will be interesting to watch over the next few months.

    -Pete

    1. Re:So that's what happened! by queen+of+everything · · Score: 3, Informative

      That's interesting. I've notice the reverse with mine. Slurp (Yahoo!'s bot) has been coming to my site almost hourly getting different pages for the past 2 weeks or so. I've also noticed a HUGE increase of referrers from search.yahoo.com. Usually all the referrers from search engines were from Google. Now, Yahoo! is much more frequent.

      Once yahoo changed over to Inktomi's search, I did several different searches for keywords or terms tha I want to be listed for. Surprisingly, I am ranked much higher on yahoo than Google right now for some things. I haven't changed anything in my code, its just interesting to see how the different search engines interpret the same thing.

      --
      "Wisdom is not a product of schooling but of the life-long attempt to acquire it." -Albert Einstein
    2. Re:So that's what happened! by Moeses · · Score: 1

      Lets see, you stop using the web and your website stops getting hits...maybe you're the only one that looks at them? ;)

  20. Warning: You are being watched! by walter. · · Score: 5, Interesting
    Looks like someone is counting the slashdot community. One of the links in this post points to
    http://www.searchguild.com/redir/o.php?out=http:// www.gorank.com/research/01072004_Google_Density_Re port.php
    So someone at searchguild.com is counting every slashdot visitor who clicks on that link! The unredirected link points here.
    1. Re:Warning: You are being watched! by PaschalNee · · Score: 1

      Hmmm ... searchguild.com. Could that be the guild?
      I agree that it is advisable to click on the direct link or, if clicking on the original link, at least to wear a tin foil hat to prevent falling under the mind control of the new world order.

    2. Re:Warning: You are being watched! by wine · · Score: 2, Informative

      This is no big deal. Unless you have configured your browser in a special way, it will gladly give out the URL of the page you were coming from to the page you are going to.

      The HTTP 1.1 standard includes this "referrer" statement in the headers of http. Following the direct link you posted, and watching mozilla discuss with the server through the mozilla module Live HTTP Headers, you can see your browser gives out this information:

      Referer: http://developers.slashdot.org/article.pl?sid=04/0 2/25/0857235&mode=nested

    3. Re:Warning: You are being watched! by cubic6 · · Score: 2, Insightful

      I think what original poster was referring to is that the clicks are being collected not by the article or it's authors, but by whoever submitted the link. If you went via the actual address of the article, only the article's server (gorank.com) would get that referrer. Due to the addition in the link, all visitors from slashdot get redirected through a page on searchguild.com, which may be collecting data.

      --
      Karma: Contrapositive
    4. Re:Warning: You are being watched! by Anonymous Coward · · Score: 0

      Given the subject matter, I would be more inclined to believe that it was an attempt to game Google.

      Inbound links from popular websites like Slashdot are treated as very valuable by Google. In this instance, searchguild.com is getting all the Google-love from Slashdot instead of the website that should be getting it, gorank.com.

    5. Re:Warning: You are being watched! by Anonymous Coward · · Score: 1, Insightful

      Another side effect is that gorank does NOT see the Slashdot referrer, but lots of searchguild referrers.

    6. Re:Warning: You are being watched! by tswann01 · · Score: 1

      Ha. They don't know their target market very well, do they?

      We don't RTFA.

  21. W3 compliance? by valentyn · · Score: 3, Interesting

    Slightly off topic: yesterday someone said that Google ranks W3-compliant pages higher than non-W3 compliant pages. I'm still confused. Could this be true?

    --
    my other sig is a 500 page novel
    1. Re:W3 compliance? by Aphrika · · Score: 3, Interesting

      In theory, it makes sense for Google to prioritise pages that adhere to W3C standards.

      Over-generalising here, it means you get a lot of professional sites rather than little Timmy's Frontpage creation, however, being a large corporation doesn't guarantee you a decently constructed site, and is no guarantee of it being W3C compliant.

      But then, Google probably sees this as a possible 80:20 rule - with the majority of W3C compliant sites probably offering something useful to index ,and index well, so they get priority over a page of junk that may or may not contain useful information.

    2. Re:W3 compliance? by Diplo · · Score: 2, Informative

      My personal experience with my own website says this is true. When I redesigned it so it validated as XHTML1.1 Strict the number of hits I got from Google increased by a massive amount.

      I believe Google actually respects a well-formed document and weights (for example) keywords found in header ( H1 - H6) tags above those found in, say, paragraph tags. It also extrapolates info from the much under-used TITLE and ALT tags, which a lot of WYSIWIG desined sites fail to incorporate properly.

      Plus, as anyone who has played with XML can tell you, a well-formed document is easier to parse than one that's composed of tag-soup.

    3. Re:W3 compliance? by BReflection · · Score: 2, Interesting

      This is interesting considering Google is not even W3C compliant. I guess when your on top you don't follow rules, you make them.

      --
      python -c "x='python -c %sx=%s; print x%%(chr(34),repr(x),chr(34))%s'; print x%(chr(34),repr(x),chr(34))"
    4. Re:W3 compliance? by Anonymous Coward · · Score: 0

      Maybe it's so they don't index themselves...

      Imagine how bad that would be, if Google started recursively indexing itself, it would only be a matter of hours until the world blew up, or something...

    5. Re:W3 compliance? by Anonymous Coward · · Score: 0

      Why would you use XHTML 1.1 if the current spec is only 1.0 ?

    6. Re:W3 compliance? by JimDabell · · Score: 1

      It's certainly possible, but as Google's own website is not valid, I doubt it's part of Google's strategy.

      The main thing to remember is that the Googlebot is like a web browser you cannot test in. You have no idea what parts of HTML it supports, and what bugs it can work around. You know the old problem with Netscape 4.x, where, if you leave off a closing </table> tag, it doesn't see any of the table? Who knows how many of those types of errors Google has problems with?

      The bottom line is that if you want some arbitrary user-agent to understand your website (in this case Google), then you really don't have any option but to write valid HTML. If you don't, you are gambling, plain and simple.

    7. Re:W3 compliance? by Diplo · · Score: 1

      Because It isn't.

      See?

    8. Re:W3 compliance? by AeroIllini · · Score: 1

      > This is interesting considering Google is not even W3C compliant.

      Yes, but take a look at the *nature* of their non-compliance. They don't have a !DOCTYPE declaration, but when I substitute an HTML 4.01 Transitional for them, the only errors I get are: missing "type" attribute on <style> and <script> tags, and missing quotes around all other tags' attributes. That's all. So even though they are not following the exact letter of the W3C compliance, they still write incredibly good code that renders properly in every browser on the planet, without having to resort to gimmicks like alternate stylesheets.

      Their search results page seems to have similar results. The only additional mistakes I saw were a few missing "alt" attribtes in an <img> tag, a few <img> tags without "src" attributes (they are 1px x 1px placeholders), and some nesting issues with <tr>, <td> and <form> tags. These are a little more serious, but are easily changed within the page generation script.

      Perhaps as a community, Slashdot should email suggestions@google.com and recommend W3C compliance. A few small changes in the code will help Google become a forerunner in standards compliance. Google already does many, many things right... why not add standards compliance?

      --
      For security, the MD5 hash of this message and sig is 09f911029d74e35bd84156c5635688c0.
    9. Re:W3 compliance? by Anonymous Coward · · Score: 0

      Ah, but then the question is, why in the bottom corner of the w3c page do they say their page is xhtml 1.0 compliant. If 1.1 is the standard, why would the w3c themselves not be using it?

    10. Re:W3 compliance? by mbauser2 · · Score: 2, Insightful

      Google doesn't give a rat's ass if a page complies to W3C standards. That would be a stupid way to run a search engine, because that would let junk sites boost their rank for superficial reasons while punishing relevant sites that have minor mistakes. Google is about content, first and foremost, and following standards doesn't improve content.

      When it comes to web design issues, Google does not punish naive mistakes. If somebody's HTML is so weird that it must be an attempt at manipulation (like making an entire paragraph out of H1 elements), it might get penalized. Other stuff, Google doesn't care about, because any strategy that penalizes most of the web is counterproductive to their goals.

      That said, Googlebot is computer program, so it probably does a better job of parsing pages that are well-formed (in the XML sense), and otherwise "easy to parse". Following standards is a good way to achieve "easy parsibility", so Google occasionally gives the "check your HTML" advice to people becuase it's easier than writing everything I ust wrote.

      --
      Proud to be / Smiley-free / Since Nineteen / Ninety-Three
  22. Missing the google point? by ItsIllak · · Score: 4, Interesting

    Isn't this missing the point of how google works? OK, so it measures the success, but it won't tell you anything (or much) about the actual search algorythm as google is actually basing the score not only on the page you link to but also pages that link to IT.

    Hence, it's an interesting read, and maybe you could draw your own preferences from what the weighting turns out to be in the listed cases, but it's not a very fair representation of how google works. *NB* I've no clue how Yahoo/Inktomi works, so I couldn't comment.

    1. Re:Missing the google point? by base_chakra · · Score: 1

      sn't this missing the point of how google works? OK, so it measures the success, but it won't tell you anything (or much) about the actual search algorythm as google is actually basing the score not only on the page you link to but also pages that link to IT.

      I think you're the one who missed the point. Among other things, one can use these figures to estimate optimal keyword density, and then strive to achieve that density in one's own sites. It also helps to determine what parts of a URL Google weights the most heavily.

  23. Sale sites. by Bender+Unit+22 · · Score: 2, Insightful

    I have seen that sites that does nothing but sells stuff, has gotten higher rankings lately. But maybe I just need to be more specific in my searches.

  24. If you're "wading through 10-15 pages of results" by jbellis · · Score: 2, Informative

    you need to change your google preference from 10 results displayed to something larger...

    if you have already done this and you're still wading through that many pages of results you suck at specifying what you want to search for :)

  25. They are different by samsmithnz · · Score: 3, Funny

    For example if I search for me (Sam Smith), I show up 4th on Google, but 51st on Yahoo.

    I guess Yahoo really doesn't love me after all.

    1. Re:They are different by millahtime · · Score: 1

      "...I search for me (Sam Smith), I show up 4th on Google, but 51st on Yahoo."

      I did a search for myself in yahoo and google and I came up 9th on yahoo and 19th in google. Yahoo was more accurate even if by a little for me. Interestingly though is that the page that Yahoo found was a much more obscure page for a project I worked on 3 years ago. The same page the google found at 19, yahoo found at 20.

    2. Re:They are different by meta-monkey · · Score: 2, Interesting

      It's almost scary to google yourself, isn't it? I just did it and found a newspaper article I was quoted in from six years ago, a letter to the editor I wrote to my college newspaper and listings for various research projects I was a part of a long, long time ago. Thankfully, there's nothing incriminating there.

      Also, it was interesting to see that I seem to be the only person on the Internet with my name. A search for my name in quotes, first and last, with either the long form or short form of my first name, turned up links ONLY to me. Thankfully, I've never done anything truly embarassing that wound up in the papers, so I guess I'm safe. How much would that suck to do something assinine 10 years ago, get a blurb in the online version of your town's newspaper, and then have it turn up every time somebody searched for your name for the rest of your life? Ouch.

      --
      We don't have a state-run media we have a media-run state.
    3. Re:They are different by Anonymous Coward · · Score: 0

      Also, it was interesting to see that I seem to be the only person on the Internet with my name.

      Yeah, well, I know I don't run into many guys called 'meta-monkey'.

  26. Why the search engines go from bad to worse by Anonymous Coward · · Score: 0

    'Entropy'.
    Yes! and they will even get worse. It is only a natural phenomena..Everything in this universe has to obey it including information and thus the search engines..

  27. The Problem with Search Algorithm Monocultures by G4from128k · · Score: 5, Insightful

    While I know that various search engines use various core ideas in search, I would think that a better way to search would use multiple approaches. Some combination of link-based analysis, keyword analysis, expert analysis, cluster-analysis, etc. rather than a single "this-is-how-we-do-it-here" algorithm.

    The first big challenge in search is in disambiguating what the searcher really wants without requiring a long string of inputs. A multiple-algoithmic approach would let a search engine serve up hits gathered in multiple ways (e.g., hit number 1 was top ranked using mehtod 1, hit #2 was top ranked using methd 2, etc.). The search company could then see which algorithm provides the best hits for a given search (i.e., by watching which hits the searcher clicks on).

    The second big challenge is all the nasty spammers and SEOs (Search Engine Optimizers) who will try to use knowledge of any search algorithm to game the system and artificially raise their page rank for commerical purposes. This is probably one reason why Google cannot maintain dominance - any dominant search enegine attracts the concerted efforts of SEOs, thus ruining its search quality, thus ruining its dominance.

    Yet a multi-algorithmic search engine could create a moving target that frustrates SEOs. By rotating the algorithms and even using negative weights on some algorithm results, a multi-algorithmic search company could cause high-ranked pages to plummet in rank over time. One week, a heavily keyworded site (e.g., one listing every possible keyword in metadata) might be at the top of the list, the next week it is at the bottom of the list. This raises the cost to sites trying to game the system. (The search company might even reward or penalize sites that change structure to often to either find the freshest sites or penalize the efforts of SEO).

    There never can be one right way to do search.

    --
    Two wrongs don't make a right, but three lefts do.
    1. Re:The Problem with Search Algorithm Monocultures by meta-monkey · · Score: 2

      Is search engine optimization really that evil? You create your site to serve information, what's wrong with trying to make sure that you show up well in the google rankings so you can serve that info?

      You sell widgets, so you want good placement for "widget store." Who's to say that your web site is or isn't the best place to buy widgets? Optimize it, so that when somebody searches for "widget store" they find you. The searcher is happy because he got his widget, and you're happy because you sold it to him. Everybody wins.

      --
      We don't have a state-run media we have a media-run state.
    2. Re:The Problem with Search Algorithm Monocultures by evilad · · Score: 1

      In the article's example, hit #2 for "prozac suicide" links to a page discussing mexican recipes. The page bears no relevance whatsoever to the search terms, except that you can possibly find a way to buy prozac from the page.

      It is ad-pollution, plain and simple.

    3. Re:The Problem with Search Algorithm Monocultures by 0x0d0a · · Score: 2, Interesting

      "Evil" is meaningless.

      It's clearly in search engine spammers' benefit to do so (much like email spammers).

      It also clearly disadvantages users, since PageRank is a pretty good metric (outside of people trying to game the system) of usefulness.

      You clearly have some interest in discussing SEO. The parent has some interest in discussing thwarting SEO. I'd that that the second subject has at least as much merit (as in, it benefits a large group of people a good deal), and is certainly equally interesting.

      Now, it's true that simply eliminating SEO-using sites may not be worthwhile -- it's possible that some SEO-using sites have merit, and over-penalization is possible.

      Increasing the difficulty of SEO analysis is interesting. A couple of other interesting possibilities:

      * It might be interesting to try to specifically identify users trying to "game the system" and start feeding them slightly shuffled results. As long as the shuffling isn't too heavy, it even false positives with this shouldn't be too painful.

      * It might be interesting to try to identify sites attempting to utilize SEO and penalize them. Frankly, the kind of sites that use SEO are generally the sort of thing that I *don't* want to find.

      * Not quite as nice, but it might be interesting to try to identify clouds of SEO sites. For example, Google seeds an inverse trust network by posting to an SEO site (and posing as an SEO) a particularly complex approach to SEO. A site implementents it, and it is immediately a "known using SEO" site. Google tries to identify sites that are "related to" it a la PageRank and looks for sites that adopt similar measures, considering them to be SEO-ized sites with a somewhat smaller probability.

    4. Re:The Problem with Search Algorithm Monocultures by meta-monkey · · Score: 1

      I think of SEO as "designing my site so it's easy for search engines to find and rank." There's absolutely nothing wrong with that. First, design to make it easy for people to navigate. Then, tweak to make it easy for search engines to rank you.

      For instance, you have the option of either using a graphical banner at the top of your page to display the title with some pretty graphic or font. Or, you can use H1 tags to accomplish the same thing. Which should you use? It isn't really going to matter much to the viewer, but using text allows Google to index the words in that text, so you'll get a better rank for those search terms. Uh oh, my site has now been "optimized" for a search engine. Do you not want to visit my site anymore, even though it contains the information you're after? After all, it's been search engine optimized.

      As another poster noted, there's a difference between Search Engine Optimization and Search Engine Manipulation. You go through a lot of time and effort to set up a web site to serve some information to the public. The public often uses search engines to find information. Since there IS a difference between how a person interprets a site and how a robot can interprete a site, what's wrong with tweaks that help the robot index your site if it doesn't effect a person's ability to navigate and read your site? Remember, robots can't interprete images.

      On the other hand, if you're intentionally misleading the search engine, then that's wrong. If people are searching for "chicken soup recipes" and you're trying to get your penis enlargement pills website to show up for "chicken soup recipes" then you're manipulating the search engine, in a deliberate attempt to return poor results.

      Search engine optimization is a good thing all web designers should do. We design websites to serve information to people, so let's make it easy for people to find our website. Search engine manipulation is the equilvalent of email spam, and efforts should be taken to prevent it.

      --
      We don't have a state-run media we have a media-run state.
    5. Re:The Problem with Search Algorithm Monocultures by WMSplat · · Score: 1

      Wrong. There is almost certainly a truth-telling mechanism for the search engine functionality. Truth-telling here means that the optimal behavior for a site to get hits is to tell the truth about the page's content (ie. no trickery). Of course, the mechanism would also almost certainly be very slow to run, but still...

      "Monocultures" tend to be bad, but they don't have to be.

  28. Summary by Sieni · · Score: 0

    Could the editors please put a summary of the results, so that we don't have to RTFA.

  29. SEO - SEM by peterdaly · · Score: 4, Informative

    As someone who does search engine optimization of his own sites, I believe there is an important distinction between ethical and non-ethical (spam) activities.

    Search Engine Optimization - doing all things possible to tell a search engine what your page is about while being balanced for humans to read as well. Ethical. Sometime considered spam when really the search engine returns poor results; usually due to the page you are looking for not being easy to understand for spiders.

    Search Engine Manipulation - trying to doing things to get search engines to return your page in results when the page may not otherwise be something the engine considers relevent or high quality. Showing something different for the search engine falls under this category, is commonly refered to as cloaking, and is against many search engines "rules" for designing pages. Not ethical, aka spam.

    -Pete

    1. Re:SEO - SEM by silentbozo · · Score: 4, Insightful

      The problem is that telling the public what your site is about is equivalent to telling search engines what your page is about. Aside from meta-tags (which should really be all you need in order to communicate "additional" info to search engines), any change to your website to "optimize" for a specific type of search engine, and not for the general public, has the effect upgrading your page ranking AT THE EXPENSE OF NON-OPTIMIZED SITES.

      Here we go into the slippery slope that leads to situations like the tradgedy of the commons (where people tend to use up a resource because it isn't theirs), the hiring of lawyers (statistically, if one side hires a lawyer, they get better results, but if both sides hire lawyers they get the same settlement, only smaller because of lawyers fees), etc. It's the prisoner's dilemma - defect (ie, optimize) to improve my position, at the risk of everybody else defecting and earning worse returns than non defecting in the first place (ie, everybody stops using google because the rankings are screwed up and are no longer trustworthy.)

      Put simply, the moment any site tries to game the system, even just a little bit, they ruin the usefulness of Google. As it stands, I'm getting better results with Metacrawler now than with Google - something I wouldn't have said just a year ago. Don't even get me started on websites with javascript-redirect gateway pages, or the ones that scrape search-engine/newsgroup/eBay pages for text in order to boost hit counts, and then link back to similar pages in order to get higher link relevancy, OR the ones that take over abandoned domains in order to exploit the ranking generated by pre-existing links that point to the domain name...

    2. Re:SEO - SEM by RevDobbs · · Score: 3, Interesting
      Aside from meta-tags (which should really be all you need in order to communicate "additional" info to search engines), any change to your website to "optimize" for a specific type of search engine, and not for the general public, has the effect upgrading your page ranking AT THE EXPENSE OF NON-OPTIMIZED SITES.

      But like the "SEO v. SEM" argument above, search engine optimization done right will also give better results to the end user.

      Think about it: if I'm looking for the specs on Widget A and the best damn website on Widget A makes me sit through a 135 second flash animation before I can get to any usefull content, I'm going to miss all that valuable information because I'm not wasting my time or bandwidth loading that crap.

      Now, what if the second best Widget A site is ran by people with a clue: title tags contain the important keywords ("bulk pricing", "failure modes", "Mil/Commercial/Industrial specification compliance"), easy-to-use navigation that tells me by the link text this is what I want? Well, than this is the most useful site, and should be ranked higher than the others.

      Search engines are just distiliers of information; super-quick page scanners. If you make your page human-scanable and easy to use, your relevence will rise higher than other pages. By effectivly telling people what your pages are about, you'll be effectivly telling the search engines what your pages are about.

  30. Google is good for one thing. by Anonymous Coward · · Score: 0

    Free porn.

  31. Re:Yahoo? by WhodoVoodoo · · Score: 1

    Why don't you google it?

    or yahoo it.

    No, that just doesn't sound right.
    Google: 1
    Yahoo: sounds like something I call my neighbor

  32. They are search engine spammers by Anonymous Coward · · Score: 3, Insightful


    yeah trying to figure out how to get to the top of search engines by analysing keyword density so you can then construct copy text with fake entry pages or as the se.spammers call them "gateway" pages with 302 redirects via the useragent or constructing urls/with/the/keywords using ModRewrite

    we know what they are up to, spamming search engines peddling shite with their refferer links

    fuckers, these people are the reason 90% of search engines suck and who are rapidly poising google so in 5 years no-one can find shit without being taken for circlejerks and wading through shitty websites peddling porn,viagra and whatever shit is flavour of the month, if thats what the internet i see is gonna turn into then why the fuck do i bother

    and we link em here at slashdot
    i wouldnt give these people the time of day

    A>S

  33. Re:Yahoo? by Anonymous Coward · · Score: 0
    I use Teoma a lot these days, it's very much like Google was about 6 years ago.
    Nice try, except Google wasn't around six years ago... I believe it launched in September 1998.
  34. All I know... by Ironix · · Score: 2, Funny


    Is that I'm pissed off for suddenly loosing my ranking a month ago. I used to be in almost every spot for the top 30 results for the keyword "QQQ", but now I am below 100. =(

    --
    Still #1 -- Lonely Gay Geek
  35. Cocks. by WhodoVoodoo · · Score: 5, Interesting

    Actually, I find an intersting way to rate search engines is to search for the word "cocks"

    yeah, I know what your thinking.

    You typically get a couple things from this search:

    Porn (duh)
    Chicken related things
    and the band "The Revolting Cocks"

    By looking at which ones come up first, you can infer some interesting and useful things about how an engine works. What those things are I will let you decide.
    Mostly because it's funnier.

    But seriously, folks, try it out.

    1. Re:Cocks. by Anonymous Coward · · Score: 0

      Try setting the Safesearch Yahoo Preferences to STRICT.

      I've no idea what the default is, I think its the middle setting. Google looks like it has the high setting.

  36. Re:If you're "wading through 10-15 pages of result by Anonymous Coward · · Score: 0

    There also is a problem with the internet content these days. There is many more pages with little content.

    For intstance if I want to learn about symmetry operations on the electronic structure of silicon I can search for the "symmetry electronic structure silicon", but this returns a slew of research grade articles which are tangental to the basic subject.

    So I can use the words "Basic" or "Introduction" in the search and now I've found pages and pages of intro to solid state course outlines. But there is no real content here either. At the best PPT slides, but these aren't that useful without the lecture.

    I suppose that the trick is developing AI to distinguish content from a list of names.

  37. more isnt always better by dpw2atox · · Score: 3, Insightful

    From what I have seen in the past as well as currently more results is not always better. One of the primary reasons I use google as my search engine is because it has very accurate results. I would rather have a search engine display 10 results which are accurate than 100 results which are completly wrong. This article might show that yahoo displays more results in certain areas but I plan on using both services for searches over the next few weeks to see which one is more accurate.

  38. Comapre the Algorithms manually by GoogleGuy · · Score: 4, Informative

    The challenge for Google and Yahoo is to filter out the SEO spam (Doorways, cloaking, ...)

    Check out the algorithms yourself by comparing google and yahoo search results side by side.

  39. And right below the article... by Anonymous Coward · · Score: 0

    ...is an ad for Google's AdSense!

  40. Vs Yahoo by paragon_au · · Score: 1

    Try yahoo, slashdot is all the way down to 6th.

    While google's 4th result Slash Online (Offical fanpage of slash) is now at the number 1 postion.

  41. Insightful? by paragon_au · · Score: 1

    This post is nothing more than a flame. These people are not spamming search engines, they are attempting to research the accuraccy of the two search engines. Just because some of their research can be used for 'evil' doesn't mean that these people have done anything wrong. "Stupid guy who invented the wheel, if the wheel has never been invented car bombs would never have been invented and the world would be a much better place.I wouldn't give him the time of day."

    1. Re:Insightful? by Anonymous Coward · · Score: 0

      from their webpage
      "Tools that unlock the secret of ranking 1st in the search engines."

      like bulk email and tools have nothing to do with spam

      who do you think "search optimisation" is aimed at ? optimised for who ? if my page is relavent to the persons search why do i need to optimise it ?

      if you see a spade call it a spade

    2. Re:Insightful? by 0x0d0a · · Score: 1

      It *is* a flame, but the criticism in it is legitimate. These people are clearly targeting SEO (search engine optimizers/spammers), based on pretty clear words in their article.

      Besides, why would a legitimate researcher care about word density in Google? It's just not interesting research -- you're analysing an algorithm that you know nothing of. It'd be much better to make your own research search engine and analyze *its* results, and get something useful, interesting, and possibly publishable.

      That being said, ultimately the flaw is with Google/Yahoo. They are exploitable (even if it's hard to pull off), and money can be made by people without risk. These guys are looking for data to build attacks. The fact that their attacks screw over a lot of other people sucks, but there you have it.

  42. nr 4 loves you. by leuk_he · · Score: 1

    On the other hand Number 4 on the yahoo list loves you for giving up your place on number 4. as do nr 5 to number 50.

    make up your mind...do you want 1 friend (yahoo) or 46?

  43. Re:Yahoo? by stanmann · · Score: 2, Informative

    2004-1998=~6 For more details you can google

    --
    Food not Bombs is a nice platitude but it breaks down when you notice that the Bombees are usually well fed
  44. Was doing a little test and... by l3pYr · · Score: 2, Funny

    I was running query after query to see who came up with better results. I typed in 'quote AMD' and yahoo brough up a nice little stock quote with a graph. In google a generic graph icon was on top with a link to 'Show Stock quotes for AMD' ... it linked to Yahoo's finance page. Just thought that was a bit ironic..

    --
    RTFA and cite your sources or prepare to get pwnd
  45. Google search Mozilla by bstil · · Score: 1

    I don't use Google's homepage much, now that Mozilla Firebird/Firefox has the Google search box right next to the URL box! Thanks Mozilla team.

    1. Re:Google search Mozilla by osu-neko · · Score: 1

      You can add this feature to IE with the downloadable Google search bar. It also adds popup blocking, which is really nice. OTOH, you can just both of these features with Firefox right out of the box, and not be vulnerable to every browser exploit that comes along too...

      --
      "Convictions are more dangerous enemies of truth than lies."
    2. Re:Google search Mozilla by bstil · · Score: 1

      But Google toolbar might raise some privacy concerns. Here is their privacy policy. The IE toolbar does contact Google's servers to check page rank of the pages you visit.
      Yahoo also has the Yahoo Companion toolbar.
      For right now, I'll stick with Google from Mozilla Firefox browser.

  46. The search engines just need moderation by pj2541 · · Score: 4, Insightful

    But the only choices should be "Interesting" and "Troll." If each vote added or subtracted a very small amount from the page rank, and steps were taken to prevent stuffing the ballot box, I think this would actually improve the search results for the users.

    1. Re:The search engines just need moderation by Golthur · · Score: 1

      I can just see the spammers^H^H^H^H^H^H^H^HSEOs writing automated "moderator-bots" to vote their pages "interesting"...

      --
      Hofstadter's Law: It always takes longer than you expect, even when you take into account Hofstadter's Law.
    2. Re:The search engines just need moderation by MikeBabcock · · Score: 1

      This is what Google was originally designed for, fwiw.

      --
      - Michael T. Babcock (Yes, I blog)
  47. Re:If you're "wading through 10-15 pages of result by TheLink · · Score: 2, Interesting

    Actually google has got worse.

    Now many of my web searches tend to turn up tons of mailing lists archives. If I want to search those I'd use google groups (I get about the same results for my search terms in google groups).

    I'm actually not that surprised - when I first heard they were using Page Rank some years back, I wondered how long that would keep working. It's easy to manipulate, plus it's kind of circular.

    --
  48. Clarification! by Ayanami+Rei · · Score: 4, Informative

    The article submitter is SPECIFICALLY trying to profile slashdot readership. Clearly the Anonymous Coward is either the article's author, or someone with a vested interest in our opinions on this topic, but someone who can't look at gorank's referral logs.

    This is VERY sneaky (akin to putting an Amazon referral link in a book review).

    Do NOT click on the link. If the submitter had actually bothered to use a logged in slashdot account, I would be more trusting.

    Copy Link location, open new browser window, paste.

    --
    THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
    1. Re:Clarification! by Anonymous Coward · · Score: 0

      Or click it a zillion times and clear cookies each time. Might be cool if /. was overcounted instead of undercounted.

    2. Re:Clarification! by Matthias+Wiesmann · · Score: 2, Interesting
      Or click it a zillion times and clear cookies each time.
      Why click? If you want to put silly things in their logs, simply follow this stupid link, with cookies disabled it inlines the inline page in itself a few times before inlining the google cache of slashdot.
  49. Bring it on by tomblackwell · · Score: 0, Flamebait

    SEO

    You don't know what you're talking about. Please feel free to come here and try to ram your fist down my throat. It'll make my day.

    I'm waiting for you...

  50. what do the terms mean??? by Anonymous Coward · · Score: 1, Insightful

    Nowhere does the article explain what repeats and density mean and how they are calculated. What kind of report is this??

    The first line of the google table shows 1001 5.1 2.4% (W, R, D). An obvious guess would be that the density is the percentage of the repeats in the words (d=r/w) but that is not what it is, so what is the density calculated from?

    1. Re:what do the terms mean??? by Anonymous Coward · · Score: 0

      according to a google on "define:keyword density",

      "The number of times a keyword is used on a web page divided by the total number of words on the page. Expressed as a percentage. "
      www.google-secrets.com/google-secrets-glossary. htm

      But these tables all have the density percentage calculated incorrectly then. No one else on slashdot bothers to check the math?

      In the first line, 1001 5.1 2.4% (W, R, D), 5.1 is .51% of 1001, NOT 2.4%

      Is everyone using this guys faulty data to optimize their pages?

  51. Calling yourself an SEO... by Ayanami+Rei · · Score: 1

    is like a spammer calling himself an "eCommerce Marketing Agent"

    When I want to search for something online, I do not want to see a webpage designed to snag my eyes.

    AT ALL!

    You do me and many other previously happy web users a great disservice.

    You should encourage web page designers to make MORE USEFUL WEBPAGES rather than wasting their time trying to get search engines to notice them.

    It is the job of the search engine to determine what page is most relevant, not the page authors themselves (directly). Because everyone wants to be most relevant. But frankly that fails because of something called the pigeonhole principle.

    Guess what! 80% of people aren't better than average drivers either!

    So please, accept my gift of a virtual knuckle sandwich.

    --
    THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
  52. That's right, mod me down. by Ayanami+Rei · · Score: 2, Insightful

    But you know I'm right. If you have similar sentiments, mod me back up. Let those fucknuts know how you feel about "Search Engine Optimization", i.e., "you'll never find an objective review about a commercial product EVER AGAIN with a search engine... HAHAHAHhAHAHHAAA"

    --
    THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
  53. SEO by tomblackwell · · Score: 1

    is marketing

    Marketers attempt to help a company get their message across to those who would be interested in their product. If the company doesn't know what the search engines are looking for, it won't know where to concentrate their efforts in building the site.

    An SEO will tell companies things which often end up sounding like common sense.

    A marketer will tell a company things which often end up sounding like common sense.

    They're two facets of the same thing.

    Also consider:

    without marketers, most people wouldn't have jobs.

    Now fuck off.

    1. Re:SEO by 0x0d0a · · Score: 1, Insightful

      Marketers attempt to help a company get their message across to those who would be interested in their product.

      Yes. However, marketers also have little interest in the case of targetting people who are *not* interested in said products.

      In the case of the Web, frequently marketer performance is measured by hits. Thus, marketers have incentive to mis-direct users to their site even if those users are not interested in a product.

      Marketers are not in the business of simply informing people about a product. You absolutely do not need a whole department to manage that -- it's quite easy. Most marketing is designed to exploit human irrationality and quirks in the thought process and preferences to make humans make irrational purchasing decisions.

      I don't hate SEOs any more than I hate politicians. Both are placed in a system that we designed that is exploitable. We screwed up, and the burden of fixing the system to not be exploitable lies on us -- come up with a better search engine, suggest some way to prevent people from mucking with results.

      However, SEOs cause me about as much unhappiness and discomfort as email spammers do, so I'd hardly shed a tear if they all went out of business, say.

  54. Ads good at filtering out crap by 0x0d0a · · Score: 4, Informative

    I've had excellent luck using Google's ads for one thing -- when I'm looking for a retailer to buy something. Not infrequently when trying to buy something, I come up with plenty of garbage and irrelevant results, but the paid advertisements are there because the people are trying to sell me what I want (and they are interested in not wasting impressions on people that *aren't* interested in their product, so they have a positive incentive to focus their ads).

    1. Re:Ads good at filtering out crap by Anonymous Coward · · Score: 0

      if you search for the product name, you're implying you want information about the product. if you want to find a place to buy the product, you should search for the product name + price or vendor or something to indicate that you're looking for a retailer.. i, for one, would hate to search for a product and only find vendors. usually i look for reviews first :)

  55. My advice: work hard on content by MarkWatson · · Score: 2, Interesting
    I am fortunate to be the number 1 hit for the keywords "Java consultant" on Google and Yahoo.

    I have never played any games what so ever to get there. What I do however is try very hard to place interesting and useful content on my site (mostly 'free web books').

    I don't think that it matters so much what you do in life so long as you love doing it. I have been programming computers since the early 1960s, and I still love it!

    -Mark

    1. Re:My advice: work hard on content by ionizer · · Score: 1

      Speaking of content the link to your demos page (http://www.markwatson.com/demos.php) is 404. You might want to work on your content...
      Cheers!

  56. Categorized approach by Anonymous Coward · · Score: 0

    WiseNut tries to categorize search results in order to enable you to "drill in" (I know that's probably not a good choice of words if you're searching for "cock"...) according to what category you're after.

    1. Re:Categorized approach by Anonymous Coward · · Score: 0

      WiseNut tries to categorize search results in order to enable you to "drill in" (I know that's probably not a good choice of words if you're searching for "cock"...)

      Yeas, well you're already talking about using WiseNut to search for cocks, so I wouldn't worry about terminology.

  57. Interested in WHAT product? by Ayanami+Rei · · Score: 1

    I can't thing of a single thing I bought online that I searched for directly in a search engine.

    The way most people shop is 1) They use a search engine to find a retailer that has the kind of thing they're looking for combined with good reputation or price 2) They use the internal search engine to find the product they want.

    Or they use a specialized product search engine like froogle.

    So what are you helping them sell, exactly? Care to provide an example?

    --
    THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
  58. HA, no by Anonymous Coward · · Score: 0

    Users to modify their sites? Hell no! Yahoo and Google should tweak their algorithms. That's the *right* way to do it.

  59. Yahoo uses more than keyword density by elflet · · Score: 5, Informative
    "Keyword density" is a favorite SEO trick for trying to get a page to rank more highly, along with engine-specific tricks (e.g. getting people to link to your page with they keywords you want in the link to drive a Google ranking higher. I just ran a handful of experiemnts with long-established (8+ years), high-ranking pages and found a few interesting things in Yahoo:
    • Incoming link popularity appears to play a far smaller role than on Google. Pages that are "top of page 1" material in Google due to their oncoming links don't even show up on top of Yahoo.
    • Yahoo is using the meta Description tag, at least in the display (but it also looks like they're using it for ranking.)
    • They're giving extreme weight to items that show up in the Yahoo directory (which has been pay-for-inclusion for the most part the past several years.) In fact, one of my pages which has changed titles shows up in yahoo search under a 6 year old title (the one used to list it in the directory, natch.)
    • Yahoo is also giving heavy weight to keywords that show up in URLs.
    • Keyword cramming seems to move sites up on Yahoo (very annoying, especially for those of us who would rather get placed via honest content.)
    To be honest, Yahoo's new engine reminds me of circa-1996 engines. Go run the same search on Yahoo and Google and see what comes back with better relevance (Google still looks better to me.)
    1. Re:Yahoo uses more than keyword density by MikeBabcock · · Score: 2, Informative

      The only people I've noticed *not* liking Google lately who are excited about Yahoo's new engine are SEOs. People who actually *use* search engines all day should be very happy with how Google works.

      I was reading an SEO discussion on a programming site earlier today and everyone was complaining about how buying keyword ads on Google didn't help their ranking for those keywords in the search results (of course not; it just buys you ad-space).

      --
      - Michael T. Babcock (Yes, I blog)
  60. Pagerank rules by Camel+Pilot · · Score: 1

    Someone help me out here. Google heavly relies on pagerank and thus keyword relevance based on volume and quality of links pointing to you. Could it be that word density is completely irrelevant?

    Is this not a classic case of correlation not equal to causation?

  61. MOD PARENT RIGHT UP by Anonymous Coward · · Score: 0

    'nuff said

  62. $25/hour? by wodelltech · · Score: 2, Interesting

    I am complete befuddled as to how/why you charge so low with so much experience and a top Google rank. What's up? Is money just not an issue?

    --
    Your monitor is staring at you.
    1. Re:$25/hour? by MarkWatson · · Score: 1
      Thanks for asking.

      I live in a really remote area (mountains of Northern Arizona), so I work mostly by telecommuting.

      A few years ago, I worked with a bunch of very talented Russian guys and also very talented Brazilian developers. They received a very low pay by our standards, and I consider a telecommuting U.S. programmer has to, in the long run, compete with foreign developers. Because of my experience, and "time zone advantage", I set my rates at about double what a very good overseas developer gets.

      Actually, with my current consulting rates and book royalties, I am very satisfied with the living that I make.

      -Mark

    2. Re:$25/hour? by wodelltech · · Score: 1

      Interesting answer...and far better than "oh no! some one hacked my site and removed the 0 from 250"...that I feared. :)

      Seriously, it's good to see motivation other than pure $ at work in the world.

      --
      Your monitor is staring at you.
  63. Yeah but... by Anonymous Coward · · Score: 0

    Can it find the Meaning of Life?

  64. Google don't use keywords to search pages (??) by rofthorax · · Score: 1

    I thought that they just used the google toolbar to determine what keyword searches lead to correct results by monitoring how long someone stays on a page and using the log of events from the toolbar to feed these findings.. So if you searched for "Pentium 4" First hand it may do a search by keyword to find the Pentium 4's but the product is faced up according to how many people before you had also done this search, and the rankings of the various results based on the success of the outcome.. Like if the multitude of searchers ended up browsing the DELL page, then Dell would appear above all else at the top of the page. So no matter how many keywords you have in your page, it won't have a chance against the pages that people tend to look at.. However if you can flood the database with pages with all relevant keywords, then maybe you can keep people from ever getting favorable results for those keyword searches, then nothing ever gets ranked.. That's the experience you've had if you've gone looking for (ahem) "swimsuit" and you get all the porn pages with 10,000 keywords in them. Unless people actually go weeding out the swimsuit pages from the crap, nobody will see anything relevant at the top of the page.. This is why it seems google can read our minds, as yahoo doesn't even come close.. Yahoo doesn't use the outcome of the search in the computation of future searches.. So Faaaa to Yahoo, keyword searches in pages is only half the problem..

    --
    Just say no to license servers!!
  65. SE Dictatorship by Via_Patrino · · Score: 1

    I've a nice information webpage, it's in fact a free (as in speech) online book.

    It's very informative, but how it's about a relative popular subjective it doesn't show up on google's top 10, and depending of the searching terms not even on top 50.

    On the other side I see a lot of pages selling products (often the same product) in front of mine, a lot of them use:
    - marketing terms: advertise a product that isn't exactly what it says, but it's what people search for (the magical solution). Example: "remove all your virus!"
    - SEO: you know what's that, I won't teach people how because I don't like the way it's usually used.

    As nowadays about 90% of the referes to a webpage is a SE if I don't fit my page for them I'll have a very small number of viewers and the information I spent time wroting will be useless.

    I don't like that idea, I prefer my page human friendly not SE friendly, but I'll need to make my page look like a "scamming page". That doesn't mean I'll use dirt tricks or being unethical, but it definitly won't look the way I liked it.

    Welcome to SE dictatorship!

  66. How to get to top of google!! For spammers!! by rofthorax · · Score: 1


    First get 100 of your best friends,
    get each with a google toolbar on their browser..

    Have each search for the same set of keywords, but have them all click on your page..

    Have then spend about 5 minutes on that page..
    Then have them all close their browser..

    Do this again and again for several other keyword combinations..

    Do this procedure as much as possible, and you will find your page ranking at the top of the results of the keyword searches you performed..

    What would be better is to automate this.. Write some viralent programs that infect people's machines then perform these rigged searches and send fake outcomes to google.. Why I suggest making a virus or a trojan hoarse? The searches are bound to be keyed to IP addresses, if you have one IP address acting like 100 users, the google search database could place less significance on result outcomes that affect only a small portion of the Internet community.. It could be the algorithm uses a ratio of users with unique IP addresses to all the IP addresses to determine relevance as well..

    If you say "That's impossible, there is no way I can do that.." Precisely, that's why this whole discussion of keyword searches on a page is moot..

    --
    Just say no to license servers!!
  67. Missing Domain Name Data Points by PetoskeyGuy · · Score: 2, Interesting

    Domain Names.

    Search Engines definately give rank to domains which contain your keyword in them. Tons of sites out there seem to have figured this out to make searches useless. There are tons of "keyword.useless-site.com" dictionary pages out there.

    I would really like to see the search engines be able to figure out that certain pages make no sense. They read like something from the old SNL subliminal man skits. Or site that bounce you somewhere else as soon as you arrive.

  68. Not magic -- it's the Batcave! by pjack76 · · Score: 1
    When I was a kid there was some ridiculous superhero cartoon on which Batman and Robin would ask their computer something, and the computer would just magically know the answer...

    And mother always said "No, pjack76, you can't have a Bat computer, there's no such thing as an all-knowning machine..."

    --

    Wow, a lucrative publishing contract! I don't have to be evil anymore. --Meteor

  69. Look Out For Yahoo! Lawyers... by herrvinny · · Score: 2, Interesting

    According to Whois information (CAPTCHA required), yahooslurp.com is owned by a flower store site. How long until Yahoo figures this out and hammers the store into the ground?

  70. Re: Searching by name by WuphonsReach · · Score: 1

    Which is exactly how this lad got himself caught. His prospective date decided to search his name on the web and got a nice hit from the FBI's wanted list.

    --
    Wolde you bothe eate your cake, and have your cake?
  71. The conclusion for the article by danila · · Score: 1

    For those too lazy to RTFA:

    Conclusion

    Well, we planned to do a completely different report this month, but figured Yahoo is probably what is on the mind of many SEO experts at the moment. Hopefully this research provides some useful data you might be able to incorporate into your SEO efforts.We have plans for our next report but it will probably change in the coming weeks. If you have any ideas please let us know. You can email us at info at gorank.com. We plan to do many more reports. If you would like to join our mailing list to be notified when we issue a new report please enter your email address below.

    Please feel free to mod down (-1: Uninformative)... :)

    --
    Future Wiki -- If you don't think about the future, you cannot have one.
  72. Rich-media Relevance by drpentode · · Score: 1

    I think if Google and Yahoo! log SCORM and semantic web information, searches will be a lot a lot more relevant, especially image, video and audio searches.

  73. Troll. by Anonymous Coward · · Score: 0

    This thing doesn't work. Type in anything
    other than slashdot and it doens't do anything.

  74. analysis not impressive by wjzhu · · Score: 1

    I tried the web page analysis: they include html words as normal words. Pretty lame website. But they know how to make useless data look good, with lots of entries and percentages in a table.

  75. sloppy averaging by pudgybuddy · · Score: 1

    the simplest explanation is that averaging is done in different places. using the top line from above: 2.4% is the density value (repeats/words) averaged over the 2000 first ranked pages; while words and repeats are total words/2000 and total repeats/2000. *if* this is the case, density is more useful than repeats/words. just consider an outlier page that is 100% repeats: it will boost the repeats/words ratio by a huge amount if the page is, say, one million words long, but the density number wont be affected---its just averaged in with the other 1999 as one data point of 100%, totally regardless of page length. notwithstanding, this article is a pretty half-ass "analysis"....

  76. Google Versus Yahoo -- And Results by etLux · · Score: 2, Interesting

    As an operation with several dozen websites with fairly substantial traffic, we tend to look at all this from the other direction. Google consistently delivers a whopping THIRTY TIMES more traffic than Yahoo, network-wide. Guess whose "algorithm" we like better...

  77. Re:Hello, Guns 'N' Roses by Anonymous Coward · · Score: 0

    You remember this band, they were huge back in the late 80s and early 90s? The guitarist, Slash, started his own band Slash's Snakepit but he has ditched that band as it was not going anywhere and he is now part of Velvet Revolver with other ex-G'N'R bandmates(but not Axl Rose), with Scott Weiland as the frontman.

    I think it is more relevant for Slash's Snakepit to be higher because it actually fits for the keyword. Slashdot is more of a tangent, I assuming alot of people are linking to slashdot like this:

    Slash

    Slashdot and Slashcode are compound words that contain the word Slash, but they are not Slash, so I think it is clear that Teoma wins this little faux comparison.

  78. $20 a month!?! by nuntius · · Score: 1

    My roommate and I each spend $22/month ($44/month total) for broadband access. And you're saying another $20/month for a better Google is worthwhile?

    $5/month is pushing the limit. Most users would balk at even $2/month simply because of the hassle and risks of more spam/credit card theft.

  79. congratulations by clsc · · Score: 1

    You've all been participating in an interactive marketing campaign for a search engine optimization company... innovative way to do it, i'd say.

  80. code to index text by Anonymous Coward · · Score: 0

    begin 664 idiocy.c.gz
    M'XL(`.1:24`"`ZU9>V_;R!'_WY]BHL(.*5.V Y/BNO=`R8#0YY-J\D/10M$DJ
    M4-12W)I<$MRE%27P=^_,/O@ 0)1M-&P0V.3L[L_/ZS2Q]/CZ",?`5+^+M<TBX
    M6,&FJ%82J> ='?^`BSNH5@RNYE>=J6S)YEE[OD*6*5)^:Q$)E.XP*-0Q(&5\.
    MY>5Y)/K46G!D[M-B.LVNP(J+-=&.SL=H2L9`JCI)R)(C+A0 4)1.+HEJL.//B
    M-*I@G(@H9P'0XCAC8JU2'[X?@294=S#76S S+]6[QX<6[MZ__X8?(@<KJ6`$9
    MCS^(PA/P<,\53'TH6545E =GH!\"^<N7-]+Z$=B!?`"=2:8I5C,JD.I-J(?DW
    M1O2*J;H2 >(SPZ%Y;1''1$4(K.Y;=%7P%)9JN6&[-DL8D%&S,23855\Q#,I ("
    MF`6X?5741G]9*^F-1KY6HX6)HMPG",PQT@(M+@OU'%1TR R34)5P\.X8BP4,4
    MZRK*GTJH:@&*YXP.J$72L1<ZL0:2`]`< WCA>^D.UQ@!#5N2CL'G-(O06R5R@
    M3ZMFD79RH9TUARE1-BG E@J<P-A).6[>8F#VQS"<GP&64B3KWQLIW'`!#-3K<
    M>KM3,S .4>V"99%IJ*_3)7JGQTML1'(""R:XV?Z!J:E7IG^KTE%[O;?H9 )O^_
    M$-[)+(P0YFDOM:C&5BQQR;XH"ZF(XWM;``UM+-A7Y?R _*N(%7Z%XL,OA4!@I
    M94)5VYZXEMH(--&F!2>>GC&*[E6;12 J3HA::J3F4?9!TE$9T:`"BJ$`6F*,5
    MBV2!^9IR3%RL"BZ83 >W9^262&?SY_>\FG8N$W!ZM5L[LA2K:(R\RQ*DFY0]Q
    M>1T3 -ZP,.@ZSX(.1-._P9`[(,KGN6^BRJ./Z#69&'F59$7N$'T7B$= &WZ8./
    MDVLK<MX$IUDA1X/5U#J,EGLD9$'VSD+_5#W)]T?[P 6+VTV-(\:!SN[X;$R'H
    M9$?0S8VA6[M;\?AEV*=ITP@K,2T\ 6D=[2$,(AALL[<3S-,$W?O/]!D@:NCT"
    MS.?N-!H'<I;'>=E G"Z#]:>#.UOL2L_)6'\@$MSFRABR*K4V"QV+?\MNF-#*\
    M(] >7GG63Q(*,*[8.51O4K3R[-DB"R2P$C#P7=U&&\;1IA^%MMG02 ZNWOKU^;
    M?#DZ7%<>[@I<+(D;_Z.&%<N88I!410Y%MJ)RX(I C)7,!KA*-_<;CQH>HM!?`
    MT`C+BSNL^H*D";NS6QLZV4RW)L FV$ES#7-5Y^4"FTKL)5XM+H4NUT&:9E4N_
    M;&;9`-OF;19,M K3/>_HYF&$@\4;/860I6E/9B&^*'#"OB5SV5;8BCE>C@%8[
    M SG=)U>QQO(VV^]XI/@M#OV_<I9AK0Z[G>$0?E*EUB<N1MM+EIR ](_PZCI/[V
    M;8LG'"UK(<P#%_2KC.);E-M?ZO-,X;Z5.MZT$ =F4-M$EU3Y5?GEZZESS,#R=
    M&$S"7122"J.#N59BI6W*B99H /?3_D$)>WI=X.MM<9Y<\+W'VB2/)GIL.#\LM
    M9H%@.&.)`M) (I@C-RXPU4W)<5Q6*6S@P[P1D22H$,V$+33#ULXFF73XPA#YB
    M<T\VVFUGU?YI.D.JT6MW_2]3)59RJ[@_8>*2LQ5V3N*&/E/# 80>-&T&$]VGE
    M]>0'\/2S>*IMPWFLOX8$N8O:9L7OR;4SK&$ TY=U.USO:VGT3V%E"Q^C^8I1U
    M%,S;*=EV(6M9STWMAE,W_O 8]Y$;3#D@N%X]$S=V^>DPN%8:9IIVUKP9ZR:2Y
    M>I0&5$R1U #BS1[NE@/2/>+5DV/=Q"9L#=8B)*B:Z0P1F7*1^B[<&*$2VA6B )
    M$`RSZ?&$IIPE4XI5=+E6:220?#'3R1*!C$Q-%E592]APO/ O-+B'G64;-2X<1
    M,7UV\5=8<2J76)$42X^PQ_[\!O3LZMJ)/ `-]6\?IFF;75S<?7WW\[9\OX=G%
    M'R]_.5B_"S+XDV/^8H9B .NNMA/=G\)<S^#OC8LFJ-:NTVC<<P\4R>,MJO**+
    M,^#QK?6 #&>XNIL=V0J:A>3CCX>G1/\^F("0-(XA*679&)Z?HTVG*?V\.I (2]
    M`%(D:R'Y6K`5I/-I`.NF",T%$#JWO!!.3YN>1A=L+X6K *[CT<7FLVD);XU)Z
    M,OV:3,V_M@VF\*\YK*^O+R[#'J7M<_? M33V%X\;WNSC5S5KM^!\"R6'\T!+G
    M.%,F?F`/>OC?H]!*$H V.P[!JCK`/7!]!WO8#"P^;6L<+Z^$JUX;Z39RYO@0#
    MQS@WW @;>=NA'\<`D/O]BH7;OFKMHW^]QRH/PU75>WR2WRP]AT,/*:/V #/4R[
    M9-":T#\[M"O=<L[A<OK+S^$`J?W=-D(5M"-AK/<&]J ?N'H,M.R>9S/QFF^UX
    M0XXQK?M6XGYO[W?/8][6NXRWZ;:`[ 0>B$E$IBE/45M1T9:B*)2+^UD"41G06
    M57$*/WV%!"\V"%(: GR-"8(/H$&7K`B?R-`_HPQ+RYGR=TBV7(2;5