MIT Creates Tor Alternative That Floods Networks With Fake Data (softpedia.com)
An anonymous reader writes with word that MIT researchers "created an alternative to Tor, a network messaging system called Vuvuzela that pollutes the network with dummy data so the NSA won't know who's talking to who." Initial tests show the systems overhead adding a 44-second delay, but the network can work fine and preserve anonymity even it has more than 50% of servers compromised.
More wasted bandwidth!
This is potentially good for an obfuscated messaging service, not an encrypted internet proxy for all traffic.
Silence is a state of mime.
I wonder what % of Tor servers are compromised. I abandoned Tor when I realised it didn't mix in junk traffic like this, as traffic analysis through compromised nodes/routers is such an obvious vulnerability that it seems to render Tor worthless.
I was talking to my Google-employee brother the other day and voicing my prediction that 'virtual camouflage' would become a defense against data mining and spying, similar to as described in the article. He thought the idea was ridiculous, and even if it were to come to pass, would be defeated by statistical means. Regardless, secure p2p communication is an arms race, and the virtual environment closely resembles nature in unexpected ways.
MIT was once the number one non-profit Department of Defence contractor in the nation. Don't know how much funding they get these days but it certainly seems as though this solution is provided to you by and for the U.S. Government.
This is just like in Neal Stephenson's novel Anathem. Except when the system became fragile, the noise was mixed with the signal so most communications became worthless.
Any bozo could write random garbage and waste bandwidth. Write something that can split encrypted data at the client through multiple nodes and recombine encrypted packets at the server. And make it an IP level protocol! Idiots!
i wash my hands with a firehose
So they're just tooting their own horn.
I eat only the real part of complex carbohydrates.
If this is designed to flood a network with junk data to conceal the relevant data, could this be interpreted as a form of a denial of service attack if it decreases network performance?
I see this as the proverbial "big stick" to push back against the conglomeration of TLAs and communication oligarchies.
"You don't want strong encryption? Then we will do this!"
Bent, folded, spindled, and mutilated.
Generating random message traffic to thwart message analysis and hide true communications is an old trick. It's really a form of steganography, just not a very efficient one. By participating in one of these networks, you draw suspicion.
People who really want to communicate clandestinely probably just use public forums and image sharing sites as digital dead drops for steganographically hidden messages. There are many steganograhic systems for a medium of your choice, many of them even auditable and open source.
Just get Netflix, Amazon, Hulu, etc... to stand up Tor exit nodes. Chum the pipeline with things like Gigli and The Last Airbender and let the NSA filter through all that. Maybe they'll just kill themselves - I know I would.
It must have been something you assimilated. . . .
> pollutes the network with dummy data
probably not so different from internet as we know it, isn't it ?!?
Who's talking to *whom*.
I agree. It would be better to disguise your data as normal traffic!
Bittorrent might be an ideal choice, since we've seen evidence that the sp00ks discard that traffic wholesale. You'd set up a legit bittorrent repository with a few thousand popular files, on a server which publicly appears to enforce a strict up/download ratio, which would help explain why so many clients stayed connected for long times downloading small chunks. Your own client would essentially join a cloud of encrypted proxies and contribute relay services.
Other good choices would be disguising traffic as https browsing blogs, forums and decentralized social networks. Your obfuscation server would be like a plugin to these kinds of legitimate public servers. It would look just like people were just catching up on news and posts. The higher the traffic the site has, the better. Occasionally, we might see major websites get hacked to include the obfuscation feature for a few days here and there. Sp00ks will have to log all https traffic to and from cnn, yahoo, google, microsoft, lolz!
Since sp00ks count bytes to determine what page you're looking at, even when using https, the obfuscation server could pad the packets to simulate a walk through various actual threads or web pages it hosts.
It's a perfect name for it, regardless if it works. Tells you exactly what it does... "WHAT??? WHAT?!?!?! FUCKING VUVUZELAS!!!" https://www.youtube.com/watch?...
That was in use as well. It really depends on what the traffic is being optimized for. Sometimes the shortest distance is not the quickest route nor is it always the most efficient route. There are other issues like throughput and capacity. We eventually expanded to do pedestrian traffic modeling which was quite different and meant a whole lot of new learning and research. It's akin to modeling chaos, we humans aren't entirely predictable.
"So long and thanks for all the fish."
That we do... My Ph.D is in Applied Mathematics and I truly stood on the shoulders of giants. There were many, many brilliant people not just before me but, by grace of luck, around me. Traffic is a fickle thing, we humans aren't that bright. One of the reasons I am skeptical about the speed with which we'll get fully autonomous vehicles is because of my familiarity with traffic. There are many things that people do not consider, the biggest one of which is privacy.
However, 'tis late and I've not slept much. I'm working on a project (a bet with a friend) so I've been busy coding a site and, probably, will work on that some more. Otherwise, it's me... I'd be happy to type a novella concerning traffic modeling but I've typed them all before and can think of nothing new to add. Not all traffic engineering is done with the goal of increasing efficiency or throughput. If you look into pedestrian traffic, specifically in areas such as retail environments, the goal is to direct, pace, and keep it orderly.
Even with vehicular traffic, there's a lot of psychology that goes into it. There are so many elements, so many variables... It'd take, well, a life's work to describe it in detail. I did have, at one time, the absolute greatest traffic sim game on the planet (I'm biased) but it did lack much in the way of graphics. Well, it had graphics later but it sure didn't start that way. I can't even begin to imagine the compute cycles that we'd have needed in order to add graphics back then. By the late 1990s we had disk arrays that enabled us to work with data sets that were nearing the 1 TB mark. There's a whole lot of fun involved. ;-)
"So long and thanks for all the fish."
Source code available here: https://github.com/davidlazar/...
That system seems to require a lot of random data. What is the plan to gave good enough entropy sources so that it is not broken by being predictable?