Slashdot Mirror


Ask Slashdot: Where Can I Find "Nuts and Bolts" Info On Cookies & Tracking Mechanisms?

New submitter tanstaaf1 writes: I was thinking about the whole tracking and privacy train-wreck and I'm wondering why specific information on how it is done, and how it can be micromanaged or undone by a decent programmer (at least), isn't vastly more accessible? By searching, I can only find information on how to erase cookies using the browser. Browser level (black box) solutions aren't anywhere near good enough; if it were, the exploits would be few and far between instead everywhere everyday. Read below for the rest of tanstaaf1's question. On Amazon, I haven't found a likely good book on the topic. There are books on protocols but I'm really only interested in how I can detect and track and block, and erase, and re-write and spoof all the tracking attempts on a case by case basis. Maybe a book on how to write my own tracker — or my own tracking blocker from scratch?

In theory it wouldn't seem to be that hard to uttlerly micromanage your own computer. Here's how I think it could be done:

(1) Have an explicit on/off switch, ideally OS based and trivial to control with a mouse-twitch, which turns internet access on and off as certainly as a mechanical light switch controls lights. Along with this, maybe the whole screen can change color, red-light green-light, to keep the user always aware of incoming or outgoing traffic. I should instant be able to get detailed information on any unexpected write or read request. Think unix "ps" or better. (Actually, a file system which allowed the owner to attached detailed memos and other information would be a nice touch...once litter builds up it quickly gets easy to hide real malware everywhere; that is a common technique used by embezzlers everywhere — create chaos and then hide your exploits within it).
(2) When the browser is started, make it start in a fresh virtual space / sand-box. Then copy into that space any "cookies" or other information I explicitly care to put into that space. I would, for example, put in site specific cookies to allow sites i whitelist to identify me. A good database of all the files in my virtual space, how they got there and what they are used for, would be really nice to see.
(3) As you browse you can block or not block ads and trackers; the add-ons already exist.
(4) When you decide to exit the browsing session, at least, the computer should save important cookies from sites you frequent for later restoration.
(5) The entire virtual space is then shredded and deleted.

This could all be done at a finer grain, I'm sure, but I wanted to lay out an overall strategy — and ask:
(1) What am I missing?
(2) Has this already been done and automated, say, under Linux? (I wouldn't expect Microsoft, Apple, or Google to facilitate this sort of security under their OS systems; foxes guarding the hen house and all that. However, even under Windows and OSX I can install virtualbox...)
(3) Why is it so hard to find the specifics of, step-by-step, how (not why or if) we are being conned and raped and what, specifically, can be done to stop it? Why are we screwing around with all these endless add-ons instead of striking at the root of the problem? Or have I not really identified the root?

I would appreciate any specific feedback on my scheme or, even better, a link or three.

10 of 84 comments (clear)

  1. Verizon Stealth cookies are undeletable by QuietLagoon · · Score: 2

    Now that Verizon has hooked up with AOL to share cookie data and personal information, it sure would be nice if the Verizon stealth cookies could be deletable.

  2. Can Verizon Stealth cookies be spoofed? by Okian+Warrior · · Score: 2

    Now that Verizon has hooked up with AOL to share cookie data and personal information, it sure would be nice if the Verizon stealth cookies could be deletable.

    Just a quick question, can the browser insert its own Verizon stealth cookie into the request URL?

    And if that can be done, can it be used to poison the data, or even crash the Verizon tracking system?

  3. You're making it more complex than it is by forevermore · · Score: 4, Informative

    You should be able to find some pretty straightforward documentation on HTTP cookies, flash data storage, HTTP Local Storage, and browser fingerprinting (see https://panopticlick.eff.org/ ). The tracking services aren't doing anything fancy -- they're just sharing that identifier behind the scenes. When you visit website1 they assign an id to your browser (via a cookie, or whatever). When you visit website2, it loads a script from website1 that puts your id somewhere into the DOM that website2's scripts can read and website2 assigns that id to your browser as well. Website3 does the same, and so forth. Then, websites 1 through N share the browsing habits of your id amongst themselves and gain some insight into what your browser is doing.

    --
    Do you really need reason for beer? Wingman Brewers
    1. Re:You're making it more complex than it is by Bite+The+Pillow · · Score: 2

      Also, the correlating behind the scenes that happens when you access multiple websites that use either a CDN or something like ajax.google.com

      Everyone hosting their JQuery on Google's servers basically allow Google to correlate visits, and build up a picture of which websites you visit. Combine that with direct access to GMail, Youtube, or Google searches, and they pretty much know what you do at least half the time on the internet.

      You're going to have to wipe everything, including your IP address, in order to avoid the kind of correlation that Google does, or Verizon and AOL, or any number of big data providers.

      Visit one website, no multitasking, torch everything, and start over. Best done via proxy. One that allocates IP addresses randomly. And switch proxies every time you visit another website.

      Or, block everything and only visit bookmarks and don't allow JavaScript and never give any information and... yeah, there's a whole lot more behind the scenes that does not involve delivering information to your browser as the delivery mechanism.

  4. Evercookie & uBlockOrigin by Anonymous Coward · · Score: 3, Informative

    Evercookie is how it is being done.
    UBlock Origin is how to block a ton of stuff.
    Both are open source so you can have a look at it.

    But in case of Verizon, you're talking about tracking on a whole other level.
    And since your MAC or IMEI device number are needed to allow you access on their network, there are no options to spoof that.
    The best you can do is block what they throw at you as a result of that data.
    Adding a VPN would make life for them pretty hard because no server side deep packet inspection can be applied.

  5. A number of ways... by CloneRanger · · Score: 2

    You can be tracked and identified by a large number of ways. Its not just cookies, its anything you click on, its hidden variables, its the URL, applets, javascript, and even your IP address. Have you heard of a Firefox plugin called Ghostery? Look at all the things it blocks. That will give you more clues about how you are being tracked. Cookies are not in themselves bad. They were designed for developers to cache information so that they could remember what the user was doing when they clicked. Advertisers decided to use them for different purposes. Then agains, the web sites are partly to blame. They want to know what you were doing, what pages you liked, where you spend time. It lets them know what interests people. But the sites have found that by signing up for programs that track users across multiple sites, they can get a deeper understanding of their customer. So, they deploy tracking code/cookies/pictures so that the companies who track across multiple sites can get info to share with them. Its really complicated.

  6. safe browsing by btb1 · · Score: 3

    A good Samaritan did the heaving lifting... https://gist.github.com/atcuno...

  7. Easy by lucm · · Score: 3, Informative

    If you are that preoccupied by evil cookies, use Linux Tails like Edward Snowden recommends.

    But really, you overestimate your value for those who allegedly "rape and con" you with cookies. They care about trends and patterns, not about you as a person, so browsing the web in a virtual space that you "shred" afterwards is more of a hobby than a necessity. Modern browsers are well-equipped to provide a decent level of privacy, there's no need to go thinfoil hat over this.

    --
    lucm, indeed.
  8. Re:Can Verizon Stealth by raymorris · · Score: 5, Interesting

    You can easily add your own X-UIDH header. It is likely that Verizon's proxy wouldn't add another if one were already present. It's also possible that the request would be sent on with two (or more) X-UIDH headers. Most programming is sloppy programming, so they probably didn't account for this correctly. It's extremely likely that random strings in the X-UIDH header would confuse the system.

    As I mentioned, most programming is sloppy programming. People keep making the same mistakes. One common mistake is, what if that string that's supposed to be about 16 characters is instead 500,000 characters (500KB)? Or 2MB? Things might break. What if it contains null characters (ascii value 0)? A lot of things break when strings have embedded nulls. Strings that are used to query a database to get a user's information often break when single quotes and semicolons are present.

    That said, it's also likely they use popular off-the-shelf, premade software for the proxy, and it's protected against the most obvious attacks. Their database query routines are probably written by their own programmers, and those programmers probably aren't security experts.

    Obviously, trying to harm their systems could very well be unlawful, even criminal. "I just sent web requests" might be about as convincing to a judge as "I just waved my arms around (while holding a knife)".
    I hope the bad guys don't mess with them too much.

  9. Re:Where can I find an extension by in10se · · Score: 2

    Except media queries are performed locally within the browser - not on the server. Even if the media query is specified in a link element in the head, the CSS file is still downloaded even if it doesn't fulfill the query requirement.

    https://scottjehl.github.io/CS...

    --
    Popisms.com - Connecting pop culture