Slashdot Mirror


Xeroxing Personal Data From Your Browsing History

grease_boy writes "Xerox has filed a patent covering a technique to recover demographic information like your age, sex and perhaps even your income by analysing the pattern of web pages you browse. They want to license the technique to online advertisers and shops. Read the full patent here."

27 of 116 comments (clear)

  1. Hehe by HomelessInLaJolla · · Score: 2, Interesting

    Because nobody could have ever thought of this before.

    --
    the NPG electrode was replaced with carbon blac
  2. More lame patents by omeomi · · Score: 4, Insightful

    Wow, great, another patent covering something completely obvious, like analyzing my browser history to find out what sorts of things I might like.

    1. Re:More lame patents by catbutt · · Score: 4, Insightful

      Typically patents cover the particular way something is accomplished. For instance, if you invent an airplane and patent it, you are not patenting flying in general (which might be said to be an obvious idea), but the specifics of how you accomplished it. Someone else, who uses a different approach to fly, will not infringe your patent.

      Did you read the 20 claims before assuming their technique is obvious?

    2. Re:More lame patents by QuantumG · · Score: 5, Funny

      Of course not. This is Slashdot, no-one here understands patents and no-one here thinks anything should be patentable.

      --
      How we know is more important than what we know.
    3. Re:More lame patents by Brickwall · · Score: 3, Interesting
      Geez, anyone who read Peter Wright's "Spycatcher" - published 20 years ago - would realize that this patent is merely a specific implementation of the techniques Wright described in the book. In particular, he developed the idea of monitoring radio communications from the Russian embassy in London. All the transmissions were in code, so he and his colleagues had no idea of the content of the messages. However, Wright deduced that when the Russian "Resident" left the embassy, coded traffic between the embassy and his car would increase. By simply matching the comings and goings of various individuals from the embassy with increases and decreases in radio traffic volume, he and his team were able to pinpoint Russian intelligence agents promptly. (The whole book is a great read, by the way, and available at your local public library, I'm sure.)

      In fact, I tried to get Nortel to implement something similar on their Meridian phone systems back in the early 90's. I thought that by tracking internal phone calls through "Call Detail Records" (CDR - which list the calling extension, called extension, date, time, and call duration), we could see patterns of calls between departments, and determine if repetitive patterns existed, such as many calls between sales and billing at specific times of the month, etc. I thought this might help identify operational issues, inefficiencies, etc. Of course, I was blown off.

      Anyone want to give me a job?

      --
      What was once true, is no longer so
    4. Re:More lame patents by baryon351 · · Score: 2, Insightful

      It sounds obvious, but also stupid.

      One reason marketing likes the whole age/sex/race/income thing, is that they can derive from that some possible information about what you might be interested in, in order to market to you. It's not perfect, as not all 16 year old males with wealthy parents will be interested in buying insanely expensive gaming systems, and not all 85 year old women on a pension will want to just email photos of family around, but it works better than guessing.

      So, what they're really after is what a specific person is interested in. What does a specific hypothetical user browse? what do they spend money on, what are they interested in, etc.

      And now in a world where it's so easy to get that specific information on hundreds of thousands people, they want to use that specific information to derive less reliable demographic statistics, in order to derive what a particular person might be interested in...

      Nuts.

    5. Re:More lame patents by MillionthMonkey · · Score: 3, Insightful

      I like this kind of software patent. This isn't like most of these annoying software patents on elementary algorithms that any student of computer science would be expected to discover. This is a patent on an algorithm we would expect only to be discovered by assholes.

      What is unfortunate is the fact that asshole companies like Xerox (and its licensees) intend to engage in this particular form of assholish behavior in the first place. But a patent is basically an instrument for preventing people from doing things. If Xerox were granted this one by the PO, it would only play a positive role in the world. It would put Xerox in the position of being able to prevent other asshole companies from mining your browser history for personal data (at least in the fashion described) except under the terms they specify. Conceivably Xerox could prevent anyone from doing it at all simply by not granting licenses. If the EFF had filed this patent with those intentions, none of you would be complaining.

    6. Re:More lame patents by QuantumG · · Score: 2, Insightful

      Oh yes they would. Because what you just did is called rational thought and the Slashdot crowd is not capable of rational thought when it comes to patents.

      --
      How we know is more important than what we know.
    7. Re:More lame patents by martin-boundary · · Score: 2, Insightful
      If it's a mathematical algorithm, it should NOT be patentable. I have no objection to Xerox patenting the part where they pay a person to snoop through the web history and make a wild guess while swivelling left and right on their swivel chair (*)

      (*) that part wouldn't be very valuable but if they felt it was worth it :)

      Most mathematics is nontrivial and highly ingenious, but just because it's non-obvious that still doesn't mean it should be patentable. EVER.

    8. Re:More lame patents by martin-boundary · · Score: 2, Funny

      With all the dupes, it's enough to read one's browsing history and predict TFA :)

  3. In order to protect my identity ... by khasim · · Score: 5, Funny

    ...I must download more lesbian pr0n.

    Get real. This is worthy of a patent? Just by the fact that you're reading this post you're most likely male, some college, etc.

    1. Re:In order to protect my identity ... by inviolet · · Score: 2, Funny

      To really confuse Xerox, you need to shop online for diapers, ladies' undergarments, walking sticks, power glasses and valentine's day cards....

      [Marge] "Whatever you've got planned tonight, count me out!"

      --
      FATMOUSE + YOU = FATMOUSE
  4. Remember, kids... by Atario · · Score: 2, Insightful

    ...pay attention to those tracking cookies.

    --
    "A great democracy must be progressive or it will soon cease to be a great democracy." --Theodore Roosevelt
  5. Can you patent an illegal process? by Anonymous Coward · · Score: 4, Insightful

    The patent may well have merit but to be used it would have to break the law. Notwithstanding that governments may keep them for national security reasons, if the law in a country prevents a third party using or selling browsing habits for commercial purposes is it possible to take out a patent that presumes illegal behaviour? Such as a method of extracting money from a bank using a shotgun? Aren't they getting a little ahead of themselves in thier race to the bottom of corporatist fascism? Or is this very revealing patent application telling us that they consider buying the necessary laws to use it a mere formality?

    1. Re:Can you patent an illegal process? by joe_cot · · Score: 2, Informative

      Point: quite a bit of spyware is perfectly legal. It's very clear in the EULA for the product it's strapped to, and it has a clear uninstaller (which breaks the program you wanted). Many people will use it anyway to use the program for free, or are just too clueless to notice.

      Point: a company does not need to do anything "invasive" to track your browsing habits. Back when DoubleClick had a majority share on banner ads, they could track what websites you went to simply based on the referring url when you got loaded of their ads on a site. Google Adsense can do the same trick, since it's a remotely loaded javascript. Why do you think Google Adsense ads are so damn targeted? They have your whole browsing (and probably email) history to work from.

      In summary, it's not unprecedented, and it happens all the time. That being said, unless this was some innovation by Xerox labs in the 80s that never saw daylight, there's more prior art here than the Patent Office can shake a stick at.

    2. Re:Can you patent an illegal process? by Ungrounded+Lightning · · Score: 3, Insightful

      Can you patent an illegal process?

      Why not?

      The law might change to make the process legal before the patent runs out. But you need to patent it right away to establish priority.

      Meanwhile you could sue everybody accused of breaking the law in your particular way.

      The police do your investigating and the prosecutor does the work of putting the information together for you, too. And the jailers keep the infringer in a known place for your process server. How convenient.

      --
      Bantam Dominique roosters crow a four-note song. Once you've heard it as "Happy BIRTHday" you can't NOT hear it that way
    3. Re:Can you patent an illegal process? by asninn · · Score: 2, Insightful

      Point: quite a bit of spyware is perfectly legal. It's very clear in the EULA for the product it's strapped to, and it has a clear uninstaller (which breaks the program you wanted). Many people will use it anyway to use the program for free, or are just too clueless to notice.

      That doesn't necessarily make it legal. If someone wants to install a piece of spyware and realises what he's doing, that's fine, of course, but for someone who's too clueless to notice, it's not nearly as clear. I might argue that there is no meeting of minds, and thus no contract (that's assuming we're in a jurisdiction where EULAs can be binding contracts at all). You might argue that it's his duty to read the contracts he's signing/accepting, so it's his own fault. I might argue that many EULAs are written / presented in a way that deliberately makes them hard to read, and that software manufacturers shouldn't get away with relying on people not reading things after they made it as difficult as possible to actually read them. And so on...

      I don't know what the solution is, but it's certainly not clear that spyware that's mentioned somewhere deep down in an obfuscated EULA and that will get installed without people realising (and that people would NOT install IF they knew it was there) is really legal.

      --
      butter the donkey
  6. Re:So if I'm looking at bestiality pr0n by Anonymous Coward · · Score: 5, Funny

    Sheep farmer, Male, Scotland.

  7. Xerox must be doing worse than I thought by StringBlade · · Score: 4, Insightful

    They used to come up with new and innovative ideas such as, the Xerox copier, a graphical user interface using windows, and a host of other innovate technologies.
    Now they've reduced themselves to patent trolling in order to pander to marketing scum. Just, wow.

    --
    ...and that's the way the cookie crumbles.
  8. Just an application... by sotweed · · Score: 2, Insightful

    .. not yet a patent. Look for it as a patent in 2-3 years. Maybe never.

    1. Re:Just an application... by mavenguy · · Score: 2, Informative

      It's more complicated than that. This is a divisional application of another application which has issued as a patent. Furthermore, they filed another divisional application which also has been published.

      Divisional applications are made when the examiner determines that more than one invention is claimed in an application and requires the applicant to elect one of the inventions to prosecute on the merits. The claims to a non-elected invention(s) are not examined in that application and must be cancelled if the elected claims are allowed, but may be presented in a new application which will get the benefit of the filing date of the parent application if filed before its issue or abandonment date. Any claims in a patent issuing from a divisional application are immune to an attack of double patenting over the claims of the parent patent if the divisional was filed in response to a requirement made by an examiner.

      The disclosures of all the applications/patent here are all the same, since no new matter is added in the divisional application (theoretically they could do so, but such added matter would not be entitled to the filing date of the parent application); the differences among the applications is in what is claimed, and, as with any patent/application, the claims define what the invention is, not the title, abstract, etc. The terms used in the claims are, however, interpreted in light of the description in the specification.

  9. Re:So if I'm looking at bestiality pr0n by grolschie · · Score: 2, Funny

    hhhmmm... that's weird. My data-mining algorithm produces: "Sheep farmer, Male, Australia". Well, it's not an exact science. :-)

  10. Xerox, my ass. by Anonymous Coward · · Score: 2, Funny

    They've also given name to a photocopy of my buttcheeks.

    PC LOAD TP.

  11. This assumes most people passively surf by tekrat · · Score: 3, Insightful

    You know, I'm really sick of the whole "Guess your personal needs based on browsing habits". I get this enough from Amazon, recommending crap to me that I don't want, but that I sell to others.

    I run a website which sells stuff. Now, it may not be stuff I personally want, but obviously other people do. So, I go through Amazon looking for products to sell. Of course, the advantage is that Amazon recommends items to me that I might sell to the other people reading my site, so it works out, but still, Amazon has a screwed up image of what I want as an individual.

    Now imagine all these people who do searches online to find crap to feed their blogs. All the people who scour the internet in search of material for websites, stuff they are going to mention in passing, and then move on.

    All the marketing people are going to get is that 50% of the people who surf the web want to see dismemberments via locomotive accidents on YouTube. That's the "vector".

    The point I'm trying to make is that only half the people on the internet are the passive surfers this technology would work with. The other half are people who create the content online via looking for content online. (and then there's a small percentage who actually create content, but they don't surf as much).

    So, the entire concept to start with is screwed because it assumes that the web is TV.

    --
    If telephones are outlawed, then only outlaws will have telephones.
  12. Use Firefox Add-Ons; don't log into Yahoo! Google by SpecialAgentXXX · · Score: 2, Informative

    I use Firefox's NoScript and AdBlockPlus. Between them both, I've blocked out all of the Yahoo!'s and Google's ad-tracking capabilities (and nearly all ads in general). Without cookies to track from one site to the next, I don't see how this will work unless every site does URL redirection. Also, don't log into Yahoo! or Google, then surf. They can record all of your search queries and tracking via your logged in cookie.

  13. Not too much to worry about by PatentMagus · · Score: 2, Interesting

    Well, they aren't trying to patent statistical analysis or tracking cookies in general, but they are grabbing for a lot. The claims don't look too impressive either. All the claims are too broad without a single super narrow claim that might just get accepted. Maybe the person who wrote the claims didn't understand the math. Double click, some of the old shopping sites, and similar companies must have had this well before the 11/2/2001 priority date. There is nothing in the claims or the application saying it only applies across domains. The examiner is probably going to reject every claim and love it because it won't take too long.

    1. A machine-implemented method for extrapolating user profile information from user web page access patterns, comprising:

    computing bias values for a plurality of web pages; ancient and general

    assigning said bias values to the plurality of web pages; ancient and general

    detecting at least a subset of said web pages accessed by a user having an unknown user profile attribute; ancient and general - tracking cookies?

    combining said bias values of said subset of web pages to obtain a combination result; and ancient and "combining" is way general

    assigning a selected user profile attribute to said user in response to said combination result indicating a positive bias of the selected user profile attribute; ancient

    wherein computing said bias values for the plurality of web pages further comprises determining a fraction of users with the selected user profile attribute who visit a selected web page as measured over the plurality of web pages.hmmm, intersting limitation. Calculating a demoographic from the tracked pages. Pretty ancient too.



    Eventually, this app is going to land on a junior associate's desk who will try to rescue it. In the long run, there's a decent chance that Xerox will get a patent on a very specific algorithm applied to limited data that is gathered in a few different ways. Hey, inventions like electrostatic duplication don't come along every day.

    --
    I am a lawyer, but not yours. Anything I tell you might be a total lie intended to benefit my clients at your expense.
  14. It'll be used as smoke and mirrors by Moraelin · · Score: 2, Interesting

    Aggreed, it is stupid, if your purpose is actually to do marketting with it and sell more products. But my guess is that it's not how it'll get used.

    Thing is, if you think about it... it fits just neatly in the eternal 3-way total war, whee the ad provider tries to shaft both the advertising company and the web master, and in most cases the two try to shaft the ad provider too. Tons of useless metrics exist just so the ad provider can tell some company "here's why you owe us a big pile of money for serving your ads", or so they can tell the web master "here's why we owe you a pittance."

    (And just so it doesn't sound like the ad provider is the only scumbag, the whole dot-com bubble was based on the "hey, look, we can rip off the ad providers" idea. Ad rates in the beginning were based on sites which had one banner on the whole site. It tended to be somewhat targetted too, since if the web master chose just one, it tended to be somewhat related to what the site was about. And people used to even click on them occasionally. And they were worth decent money. Then some people discovered, basically, "woo, but if we put 10 ads per page, now we're owed gazillions of dollars." Whole companies went to IPO with that as their only business plan. But I digress)

    The fact is, the _only_ real criterion of whether a marketting campaign was successful, is whether you sold more stuff as a result. Everything else, eyeballs, clicks, etc, is just smoke and mirrors. It's just some useless metrics that get gamed all the time.

    E.g., it may sound like "clicks" is a relevant number, but it is only in a world where everyone clicked only because you got them equally interested in the product. Once you figure out other ways to fake it, comparing numbers of clicks becomes apples to oranges. And once you give someone a criterion like "number of clicks" to justify their salary, we're already seen the result: fake UI ads, punch the monkey ads, and outright redirects served as ads. It doesn't mean that those people became more interested in a company's product just because they got hijacked, but on a "look how many people we got to click your ads" statistic it looks the same.

    So my take is that this is what it will be used like. Some ad provider will make up some scientific-sounding "how well we matched your ad to a target demographic" metric and use it to justify why you should pay a premium to advertise through them. Never mind that the demographic was a wild guess, and it actually lost information in the process... twice. It will look neatly in the marketting materials anyway.

    --
    A polar bear is a cartesian bear after a coordinate transform.