Slashdot Mirror


Are We Searching Google, Or Is Google Searching Us?

An anonymous reader writes "The folks at the Edge have published a short story by George Dyson, Engineer's Dreams. It's a piece that fiction magazines wouldn't publish because it's too technical and technical publications wouldn't print because it's too fictional. It's the story of Google's attempt to map the web turning into something else, something that should interest us. The story contains some interesting observations such as, 'This was the paradox of artificial intelligence: any system simple enough to be understandable will not be complicated enough to behave intelligently; and any system complicated enough to behave intelligently will not be simple enough to understand.' After you read it, you'll be asking the same question the author does — 'Are we searching Google, or is Google searching us?'"

35 of 346 comments (clear)

  1. depends... by Anonymous Coward · · Score: 5, Funny

    If you're in Russia, Google searches you :)

    1. Re:depends... by Candid88 · · Score: 5, Insightful

      Your version actually manages to be even less funny!

  2. This is slashdot by elguillelmo · · Score: 5, Funny

    After you read it, you'll be asking the same question the author does

    Do you mean we are supposed to read TFA? Seriously?

    --
    Dawkins Revisited: A person is shit's way of making more shit -- Steve Barnett, anthropologist.
    1. Re:This is slashdot by oodaloop · · Score: 5, Funny

      That's the slashdot paradox. Somehow linked websites' servers crash from 1 million+ geeks simultaneously failing to RTFA.

      --
      Tic-Tac-Toe, Global Thermonuclear War, and relationships all have the same winning move.
    2. Re:This is slashdot by phulegart · · Score: 5, Interesting

      Hey, we all know the unspoken rules... if you read the article, you aren't supposed to post... and if you post, you aren't supposed to read the article. That's how a million geeks can slam a site from a Slashdot link, because there surely aren't a million posts in the thread of discussion about the same article.

      Sorry about crossing the 30 word barrier though, and all the pain I caused those who have read this far...

      --
      "I love deadlines. I love the whooshing sound they make as they fly by." -D. Adams
  3. ::yawn:: by Lurker2288 · · Score: 4, Funny

    Wake me up when it starts teaching the monkeys how to use tools and kill each other. And no Republican jokes, either.

    1. Re:::yawn:: by WwWonka · · Score: 5, Funny

      Wake me up when it starts teaching the monkeys how to use tools and kill each other. And no Republican jokes, either.

      Soooooooooo, this Republican walks into a bar with a monkey under his arm....

    2. Re:::yawn:: by peragrin · · Score: 5, Funny

      Wake me up when it starts teaching the monkeys how to use tools and kill each other. And no Republican jokes, either.

      Soooooooooo, this Republican walks into a bar with a monkey under his arm....
       
        Walks into the bathroom and taps his foot under the stall door.

      --
      i thought once I was found, but it was only a dream.
    3. Re:::yawn:: by 93,000 · · Score: 5, Funny

      Bartender says: what are you doing with that jackass?

      Republican says: It's not a jackass, it's a monkey.

      Bartender says: I wasn't talking to you.

  4. And the biggest questions of all by Opportunist · · Score: 4, Funny

    Is Google searching Google some sort of self-discovery process?

    --
    We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
  5. Assuming that Google could reach consciousness by BadAnalogyGuy · · Score: 5, Insightful

    The best argument against this kind of ridiculous assertion that somehow random information will somehow give rise to intelligence is provided in the old movie Short Circuit. The SAINT 5 robot spends all night reading the encyclopedia and when morning comes, it is suddenly an expert on everything. But its expertise is only in pure knowledge, not the rational use of that knowledge to create something beyond mere identification.

    The only way for a robot to grow past its programming is to add the capability to do so. And simply having a system scan data and find correlations isn't going to be enough. There needs to be an action taken on the discovered correlations, and beyond that the actions need to be reprocessed back into the system in a feedback loop. And even further, it is necessary for the program to identify patterns and make intelligent decisions based on those patterns, but the intelligence necessary to make those decisions must come from external sources. I.e. the programmer.

    It's a bit outlandish to think that just because a program is constantly watching and processing inputs that it is somehow sentient.

    1. Re:Assuming that Google could reach consciousness by Opportunist · · Score: 4, Funny

      It's a bit outlandish to think that just because a program is constantly watching and processing inputs that it is somehow sentient.

      Proof: Any average couch potatoe watching TV 24/7. Constantly watching, (presumably) constantly processing, but no sentience can be identified in it.

      Your counterargument being the quality of your TV program? Gee, you know what Google is being fed constantly? See, the proof stands firm!

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    2. Re:Assuming that Google could reach consciousness by phoenix321 · · Score: 5, Interesting

      Any biological intelligence does exactly the same as described: gather data (try to assess external universe model), find correlations (build internal universe model), act according to internal needs (act upon internal universe model) and repeat.

      This chain of processing is done by all brains from the fruit fly to humans. Everything else is a consequential result from this process.

      A human brain has very few hardwired constants and many of them they can be overridden.

      Feedback loops are a natural result of action to fulfill internal needs according an internal model - that is always incomplete or wrong, see Goedel - upon the external universe. In the next step data is gathered, correlations found (which constitutes the feedback loop) and then acted out according to the adapted internal model.

      A fruit fly has simple sensors, a very simple correlation engine and a tiny memory for its internal model. But that doesn't mean its following a different path than a newborn Einstein. Einstein has detailed sensors (easily surpassed by those of dogs and eagles, but still ok), a yet-unmatched correlation engine and a sufficient amount of internal model memory.

      All other inputs come from the external universe and while some of them are absolutely neccessary and come from other organisms (parents, teachers), they do not impose a hard limit on Einstein: with enough correlation power, he can easily discover new facts, unknown to any of his inputs (teachers, parents).

      Einsteins brain was never designed to do anything else than processing input signals, detecting correlations and contacting motor neurons to act upon its internal model. How did he discover Relativity then?

    3. Re:Assuming that Google could reach consciousness by rts008 · · Score: 5, Informative

      Turn in your geek card!
      It was Trinity that downloaded the program to fly the helicopter, not Neo.

      --
      Down With Slashdot BETA!!! I've been around the corner and seen the oliphant; you can only abuse me from your perspecti
    4. Re:Assuming that Google could reach consciousness by QuantumG · · Score: 4, Interesting

      You make it sound so easy!

      Of course, at a certain level of abstraction, everything is easy.

      Intelligence really is that simple.. except there's one little detail you're ignoring.

      Any biological intelligence does exactly the same as described: gather data (try to assess external universe model using limited computational resources), find correlations (build internal universe model using limited computational resources), act according to internal needs (act upon internal universe model using limited computational resources) and repeat.

      That's the hard part. If you have infinite computational resources it's really trivial to act intelligently. All you need do is enumerate all possible outcomes of all possible actions with an idealized model of the world (Godel not withstanding) and pick whichever maximizes your expected reward. You can write nice long mathematical papers on this.. or even a whole book. The question is, how do you do it with a sensible amount of processing power and memory?

      All the geeks have a great laugh when Matt Groening causes Bender to become transparent and we see a 6502 inside. The joke is that Bender has about the same processing power of a C64 from the early 80s. The show is littered with additional Commodore jokes which I'm sure 90% of the viewers just don't get. But that's not what really makes it funny. What really makes it funny is that all us geeks know that you need a lot more processing power than a 6502 to do the complex things that Bender does in the complex environment he does them in. But how is that? We don't know how to do AI. We don't even have the slightest clue. For all we know, there is a tight little algorithm for AI that could run on a 6502 and produce all those crazy behaviors that Bender gets away with.

      And that's the problem with AI. The allure is that some short little algorithm exists that will magically evolve into a super-human intelligence if you just could find it and hook it up to the world. After all, nature figured out, how hard could it be? This has led many a would be mad scientist to code up a genetic algorithms implementation. In fact, most every programmer I know has given it a go. The mystery of what you'll find if you give it the right fitness function is a powerful motivator - with a little magical thinking, it could be anything!

      --
      How we know is more important than what we know.
    5. Re:Assuming that Google could reach consciousness by jollyreaper · · Score: 4, Funny

      Proof: Any average couch potatoe watching TV 24/7

      Well hello there, Mr. Vice-President.

      --
      Kwisatz Haderach
      Sell the spice to CHOAM
      This Mahdi took Shaddam's Throne
    6. Re:Assuming that Google could reach consciousness by adamofgreyskull · · Score: 4, Interesting

      Unless that's just his morality co-processor.

    7. Re:Assuming that Google could reach consciousness by ShieldW0lf · · Score: 4, Interesting

      What makes your comment really funny is, the Commodore didn't use it's CPU for everything and connect to dumb IO devices. It had a good deal more intelligence in it's various components, keeping the load on the CPU low in the same way SCSI drives don't tax the CPU like ATA does. Which is how humans work... most data doesn't ever make it to the brain, but is pre-filtered by our organs, and most complex co-ordination exhibited by our bodies is not directly orchestrated by our brain, but through various biological dumb circuits.

      The Commodore 64 had more in common with how humans work than modern computers do. I expect that once we begin grappling with the "avalanche of cores" problem in a meaningful way, modern computers will begin to be programmed in a fashion more reminiscent of how biological systems work.

      --
      -1 Uncomfortable Truth
  6. Google is definitely searching us by neokushan · · Score: 4, Insightful

    But I have to ask, is it such a bad thing?
    You know what it's like, you go to search for something completely innocent and porn comes up. It's not a fault or an idiosyncrasy of the interweb, it's google giving you what you really wanted.

    --
    +1 IDisagreeSoHeMustBeATrollOrAnAstroturferOrAShill
    1. Re:Google is definitely searching us by Opportunist · · Score: 4, Funny

      I guess I'm not the only one who doesn't mind the porn I get offered, just the kind of porn. It's that sick, twisted, perverted and utterly gross kind of porn that comes up with the searches, the kind that I certainly do NOT want. And I'm sure I'm not the only one who thinks like this, so I doubt this could be anyone's favorite kind of porn.

      Ya know, the kind that you're supposed to pay for.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
  7. Well by rarel · · Score: 5, Insightful

    I don't know who's searching who, but I do know that I no longer use Google because it's "simply the best". Relevant results are always lost in a torrent of ads, fake review links and e-stores trying to sell me something that's irrelevant.

    To the point that I'm not using Google because I genuinely like it any more, but merely because I know the alternatives are even worse. In a few years' time Google went from the best to the lesser evil.

    It's... disappointing.

  8. If only.. by QuantumG · · Score: 4, Interesting

    all "magical thinking" in the field of artificial intelligence was reserved for fiction.

    There's so much rigorous mathematically described hooey in AI that its hard to tell the naive geniuses from the crackpot morons. Consider this paper by Solomonoff. Brilliant stuff! A fantastic read. Then, at the end, it says:

    In our view, however, the most interesting situation in machine learning, arises when we do not know ahead of time what program will solve a given problem and where the machine discovers the program itself. It seems to be very hard to find out much about this by theory alone. Running experiments is crucial.

    This is Solomonoff's way of reminding us that he is a mathematician and hasn't actually run any experiments. His other papers make similar pronouncements in the footnotes about the uncomputability of his math or acknowledge the requirement of perfect (aka impractical) training data, etc. He makes it abundantly clear that is work is purely theoretical and unimplementable, but does this stop enthusiastic amateurs from reading his papers and declaring that AI is "solved"? Well no, of course not.

    --
    How we know is more important than what we know.
  9. This is just inane. by KTheorem · · Score: 5, Insightful

    If a system capable of being understood could not act intelligently, then why the hell do we even bother studying the human brain? And further, any attempt at creating artificial intelligence would rely on us not knowing what the hell we are doing?

    I am tired of this kind of blanket assumption that anything humans can do that we don't understand or know how to reproduce artificially is somehow incapable of ever being understood or reproduced. We are not so special as to invalidate the existence of the mechanical processes that make us work.

  10. Google Home by MosesJones · · Score: 4, Insightful

    At JavaOne about 3 years ago there was a boffin talk with Gosling, Joy and others and one guy raise the image of hearing something drop through his letter box and then suddenly a little bot appearing in his room with a message "don't worry I'm just indexing your house for Google"

    His point was that he had two reactions to this firstly "what a huge invasion of privacy" and second "Great I'll be able to find my car keys".

    Of course Google is profiling what people do as they search, indexing everything is what they are about. The question is where this impacts on privacy and what limits we want to put on it.

    --
    An Eye for an Eye will make the whole world blind - Gandhi
    1. Re:Google Home by Tim+C · · Score: 4, Funny

      It's called a baseball bat.

  11. AI - A Myth by mlwmohawk · · Score: 4, Insightful

    We had this discussion a little while back. The mythical AI where machines "learn" how to "think" is a long way away or possibly impossible with current technology.

    The appearance of intelligence is not intelligence. A recommendations system or search engine may appear intelligent, but the part of the system that processes information "intelligently" was programmed by a person who understood the process. The computer is merely following directions.

    Some knowledge based algorithms seem unpredictable when given random data. This is not intelligence either, it is more a result of unintended consequence. You can go back and figure out why it acted a certain way.

    1. Re:AI - A Myth by BoldlyGo · · Score: 5, Insightful

      Some knowledge based algorithms seem unpredictable when given random data. This is not intelligence either, it is more a result of unintended consequence. You can go back and figure out why it acted a certain way.

      The same rules apply to people. We have a set of programming we are born with, and then we are given random data. This data and our pre-programming explains why we act a certain way. The ability to go back and figure out why we act a certain doesn't mean we aren't intelligent.

      It is a mistake to assume our intelligence is something more than a program. Our programming is just less transparent to us.

    2. Re:AI - A Myth by SpinyNorman · · Score: 4, Insightful

      The appearance of intelligence is not intelligence. A recommendations system or search engine may appear intelligent, but the part of the system that processes information "intelligently" was programmed by a person who understood the process. The computer is merely following directions.

      That not true in general. It's only true for old fashioned "code forward" computing where your code is specifying what to do with the data. With connectionist approaches, genetic computing, etc (and natural evolution), it's often the data not the code that is in control, and techniques like this are usually used specifically because you don't know how to "intelligently" solve the problem, so you instead, in essence, feed the data into an architecture where it organizes itself.

      Let's also note that even though in a software system a genetic algoritm is explicitly coded, that in nature it's not. You'll not find "the evolutionary algorithm" anywhere in any form in nature. Evolution is just the emergent behavior what happens when the necessary pre-conditions (parallelism, shared resources/competition, imperfect inheritence) exist. A reasonable way to view using the same approach in software is that you also are not really providing an algorithm - you are just setting up the preconditions/environment that will result in what you want happening, without you being aware or specifying how it is going to happen.

  12. Not everyone believes that by Namarrgon · · Score: 4, Interesting

    It's perfectly possible for insanely complex systems to arise from very simple rules. We cannot grasp the entirety of the system, but we can know exactly how to create it, or perhaps manipulate it.

    By way of example: the Mandlebrot set.

    --
    Why would anyone engrave "Elbereth"?
  13. Re:Short answer... no by wild_quinine · · Score: 4, Insightful

    We're not searching Google, we're searching the Internet. Google is a tool that can be used (and often is used) to facilitate this search.

    Nitpickers are the worst, particularly when they're wrong.

    Google searches the internet, but we don't, whilst using it. We search Google, because all the results we want are stored at Google, within Google, and we hopefully find the result we want. Only then are we directed to a site on the internet outside of Google containing the information we searched for.

    It is not entirely innaccurate to say that 'We search the internet using Google', but this assumes a logical progession: We search google > because Google searches the internet > so that we cand find what we want on the internet = We used google to search the internet. However, contrary to your misconceptions, it is MORE not LESS accurate to say 'We search Google (to find what we want on the internet).

  14. People should understand things they write about! by miketheanimal · · Score: 5, Insightful

    Turing machines were being assembled into something that was not a Turing machine The author needs a bit of theoretical computer science. However many Turing machines you assemble, you still have a Turing machine.

  15. Google's information gathering techniques. by Coolhand2120 · · Score: 4, Insightful

    While doing some debugging on some AJAX work, using tamper data (FF) and Fiddler (IEx) I stumbled upon some nefarious network communications between my mouse* events (over,move,out, click etc.) attached to every single link in googles search results. And there's more! Not only are these events present but they are silently inserted after the page is rendered. Some may say "well this is for older browsers", to that I say, they are not replacing the HREF property on the anchors, they are adding event handlers to mouse* events, and perhaps more that I'm not detecting. You can not see this stuff just by viewing the source. You would need to activate the event that creates the mouse* functions. E.g.: mouse over, and then mouse click gains a new event, so trying to look at the source before the mouse over event occurs yields an null function. Any attempt to look at the source code that google is running (the script handling the events) will be met with a really good obfuscator. Google does this to just about all their public code, e.g.: google maps. The most I can realize about the extra events is that they send a LOT of information to google whenever you click on anything. But don't take my word for it, fire up FF and the latest version of Tamper Data, click 'stop on next line' or whatever engages the debugger (I can't be bothered to look, I'm working on err. something.) and mouse over or click the links on googles search results and watch your data fly over to google, in a rather secretive manner.

    It may just be nothing. Every search engine tracks what link you click on, and I think this is one of the more elegant, albeit backwardly incompatible, ways of tracking what links are clicked on. Yahoo does something similar, but they use the 301 permanently moved header with a specially crafted HREF in the anchors, you can see this pretty plainly if you open up yahoo and mouse over the links, they all point to yahoo, then you're redirected to the search. From a coding perspective this is more compatible but annoying to the end user as the link is not what it says it is going to be, it's a yahoo redirector. This means if you try and copy the link from the result you'll get some yahoo bullshit. I like googles method better, but it leaves a lot to be desired in the 'forthcoming' area.

    Google also maintains a network of 'adsense' tracker scripts on hundreds of thousands of 3rd party sites, I have several customers that swear by their visitor tracker. It's kinda neat, and it's free, however, I'm sure google does not just ignore the statistics gathered by its tracker. These numerous sites make up a good chunk of the internet, so even if you don't visit google, google sees you indeed. They can track every site that participates, reading referrers and IP addresses, I could imagine some very simple algorithms that could, for the most part, piece together what other non-participating sites you've visited based on the information gathered when you do eventually visit a participating site.

    Google Underhandedness IMHO:
    1. Adding the even handlers after the page has loaded. There may be a technical reason, but it's just creepy.
    2. Sending volumes of information back after each click. There really needs to be a limit. Do you really need my browsing history!?
    3. Creating a GPS like grid of sensors on 3rd party sites. This is the creepiest. Google can tell where you are, where you've been and where you're probably going to go with this, and you don't even need to visit google a single time to be added to this network! in fact you don't have any choice whatsoever in the matter!

    What Google can do to fix this perception:
    1. Quit obfuscating your damn code! It just makes you look guilty when you basically say "Don't look here" in something that is "sneaking" it's way into the source. It's not like google came up with the damn cure for cancer in their JS, what are you try

    1. Re:Google's information gathering techniques. by Anonymous Coward · · Score: 4, Insightful

      >You're crossing the line of, what I think is relevant to your primary customers, searchers.

      Google's customers are _advertisers_.

  16. Re:Short answer... no by edittard · · Score: 4, Funny

    "The internet" cannot be searched, there's no protocol for it

    The real reason is that some of the tubes are bent, and you can't see round corners.

    --
    At the bottom of the /. main page it says 'Yesterday's News'. Well they got that right.
  17. Genes and self-modifying programs by tucuxi · · Score: 5, Interesting

    Parent is right. As long as there is no way for the programs running on Google's hardware to grow past their original programming (beyond optimization and load-balancing), there will be no Skynet.

    Yes, many computer programs work in a feedback loop, and so do all organisms. But as long as only the data entry part of the loop can change, and the system lacks the flexibility to change the type of processing that takes place (the 'program'), no spontaneous evolution will occur.

    Several factors are needed to get us to the bleak, dark, machine-vs-human Sci-Fi universe slashdotters know and love.

    • The would-be AI programs must be free to rewrite portions of themselves. Self-modifying code is generally frowned upon as being very hard to write and debug, and outside academia (evolutionary programming?), nobody is pushing it. Also, current approaches need massive amounts of processing for meager results.
    • The programs should be free to replicate. While Google has a lot of machines, they probably don't want runaway programs hogging the CPU cycles (they are not in the heating business). Internet-roaming malware is a much more likely than Google-sponsored code to eat over the Internets. Partly because the cheapest way to replicate is not asking for permission, and evolutionary systems will take shortcuts whenever available.
    • There must be evolutionary constraints to help weed the "unsucessful" strains. If a viral, self-modifying program manages to get everywhere and "kill the host" (bog down the net completely), it will no longer evolve. Fortunately, there's lots of different systems hooked up to the 'net, and colonization would be hard enough.

    The first point is the most difficult. It is *not* easy to take pieces out of two programs and build a third program that does things that both do. Whatever OO promises, code is not yet "easy as lego blocks" to assemble. You need very well though-out constraints to mix code in a meaningful way - any self-modifying program would need a small, hard-to-modify kernel that would take care of the mixing mechanism. Nobody knows how to design such a kernel correctly, or what exactly to include as 'genes' (mixable code modules). Computational biology (and biology itself) are hard at work on this problem.

    But mixing blocks would not be enough. A successful system would need to build new, unseen blocks by modifying existing ones -- or starting from scratch. How many different things can you say in 20 words? How many of these things make any sort of sense? And how many of those require a very, very specific context to fit into?. The way that evolution can sort this out is by, very slowly, building things that sort-of, kind-of get the job done. However you look at it, there will be huge amounts of trial-and-error involved.

    And another problem is that of intelligence "scale". Imagine a super-self-modifying internet worm. The ability to probe and infect does not automatically lead to self-consciousness. There are many, many evolutionary steps from bacteria (very good at self-modification and breeding) to humans. And the current installed base of Internet-connected computers and their "stability" (the time-frame during which a given system remains 'constant') is tiny in comparison to the resources that earths' organisms have had at their disposal for evolutionary purposes. Yes, computers are way fast and this can compensate for some parallelism issues. But I still think that emerging AI is still very, very far off.