Slashdot Mirror


MIT Uses Machine Learning Algorithm To Make TCP Twice As Fast

An anonymous reader writes "MIT is claiming they can make the Internet faster if we let computers redesign TCP/IP instead of coding it by hand. They used machine learning to design a version of TCP that's twice the speed and causes half the delay, even with modern bufferbloated networks. They also claim it's more 'fair.' The researchers have put up a lengthy FAQ and source code where they admit they don't know why the system works, only that it goes faster than normal TCP."

23 of 250 comments (clear)

  1. I'd be wary. by Fishchip · · Score: 5, Funny

    This is how things like Skynet get started.

    1. Re:I'd be wary. by Sqr(twg) · · Score: 5, Funny
    2. Re:I'd be wary. by jamesh · · Score: 5, Funny

      At the very least i'd be doing a grep for things like "kill all humans" in the source code.

  2. Uh Oh... by Anonymous Coward · · Score: 5, Funny

    they admit they don't know why the system works, only that it goes faster than normal TCP

    And so it begins...

  3. All Jokes Aside... Still No. by Jane+Q.+Public · · Score: 5, Insightful

    Allow a computer to design a faster TCP? Sure!

    Let them actually implement it without knowing how it works? Oh, Hell no!

    I'm not talking "Skynet" or anything here... but if it breaks, who's going to fix it?

    1. Re:All Jokes Aside... Still No. by Intropy · · Score: 5, Interesting

      We're already in that boat. One of the reasons it's so hard to make changes is that nobody really knows why the internet works. We know how and why individual networks work. We can understand and model RIP and OSPF just fine. And we know how BGP operates too. But that large scale structure is a mess. It's unstable. The techniques we use could easily create disconnected or even weakly connected networks. But they don't except for occasionally a single autonomous system falling off. We've built ourselves a nice big gordian knot already. We know what it's made of, and we know how it operates, but good luck actually describing the thing.

    2. Re:All Jokes Aside... Still No. by Clarious · · Score: 4, Interesting

      Think of it as solving a multiobjective optimization problem using heuristic algorithm/machine learning. You can't solve the congestion problem completely as it is computionally infeasible, now they just use machine learning to find the (supposedly) optimal solution. Read TFA, it is quite interesting, I wonder if we can apply that to Linux writeback algo to avoid the current latency problem (trying copying 8 Gb of data into a slow storage medium such as SD card or USB flash, prepare for 15+ seconds stalls!), the underlying is the same anyway.

    3. Re:All Jokes Aside... Still No. by Clarious · · Score: 4, Insightful

      A bit offtopic, roughtly 10 years ago I came to /. and was amazed by the technological insight/information in the comments here. And now more than half of the comments are jokes about skynet without any insight of understanding what TFA is about. Of course, informative posts still can be found often, but slashdot has fallen quite low...

    4. Re:All Jokes Aside... Still No. by techhead79 · · Score: 5, Insightful

      we don't have to know WHY it works better, just that it does and how to build a working copy

      But the fact that it does work better means we're either missing a part of the picture that is obviously important or the AI version is leveraging quirks with the system that no current model we have represents. I'm shocked to read that anyone would be comfortable just ignoring the why of something just so we can progress beyond our understanding. If we don't understand the why then we're missing something very important that could lead to breakthroughs in many other areas. Do not let go of the curiosity that got us here to begin with.

    5. Re:All Jokes Aside... Still No. by blankinthefill · · Score: 5, Insightful

      I don't think they just drop the questions and run with it. I'm pretty sure that, when we don't understand how things that are useful work, we just implement them... and study them at the same time. I guarantee you that SOMEONE, at least, is studying why an AI antenna works better than our man-designed ones, and they're doing it for the very reasons that you mention. But I think the point the GP was trying to get at is that we've never let out ability to not understand things hinder our adoption of those very things in the past, and as long as we have good evidence that this thing performs correctly, and we can replicate it, then why wouldn't we use it at the same time we study it?

    6. Re:All Jokes Aside... Still No. by Ichijo · · Score: 4, Interesting

      So we built a computer that figured out the answer. Now we just need to build an even bigger computer to figure out the question!

      --
      Any sufficiently unpopular but cohesive argument is indistinguishable from trolling.
    7. Re:All Jokes Aside... Still No. by Animats · · Score: 5, Interesting

      One of the reasons it's so hard to make changes is that nobody really knows why the internet works.

      We still don't know how to deal with congestion in the middle of a pure datagram network. The Internet works because last-mile congestion is worse than backbone congestion. If you have a backbone switch today with more traffic coming in than the output links can handle, the switch is unlikely to do anything intelligent about which packets to drop. Fortunately, fiber optic links are cheap enough that the backbone can be over-provisioned.

      The problem with this is video over the Internet. Netflix is a third of peak Internet traffic. Netflix plus Youtube is about half of Internet traffic during prime time. This is creating demand for more and more bandwidth to home links. Fortunately the backbone companies are keeping up. Where there's been backbone trouble, it's been more political than technical. It also helps that there are so few big sources. Those sources are handled as special cases. Most of the bandwidth used today is one-to-many. That can be handled. If everybody was making HDTV video calls, we'd have a real problem.

      (I was involved with Internet congestion control from 1983-1986, and the big worry was congestion in the middle of the network. The ARPANET backbone links were 56Kb/s. Leased lines typically maxed out at 9600 baud. Backbone congestion was a big deal back then. This is partly why TCP was designed to avoid it at all costs.)

    8. Re:All Jokes Aside... Still No. by jkflying · · Score: 5, Informative

      It's not that we don't understand *why* something like a genetic-algorithm designed antenna works so well. We can evaluate its performance using Maxwell's equations and say, "Yes, it works well." without ever having to build the thing. What we don't have is a set of guidelines or 'rules of thumb' that can result in an antenna design that works just as well.

      The difference is that the computer evaluates a billion antennas for us, doing some sort of high-dimensional genetic optimisation on the various lengths of the antenna components. It doesn't 'understand' why it gets the results it does. We do, because we understand Maxwell's equations and we understand how genetic optimisation works. But Maxwell's equations only work for evaluating a design, not for giving a tweak which will improve it. And we're dealing with too many variables that are unknown to have a closed-form solution.

      As for this algorithm, they basically did the same thing. They defined a fitness function and then repeatedly varied the code they were using to find the best sequence of code. However, unlike the antenna analogy, they used actual equipment to evaluate the fitness function, not just a model. This means that they don't have an accurate model, which means that your complaint that we don't know why this works is entirely valid, and the antenna analogy is not =)

      --
      Help I am stuck in a signature factory!
    9. Re:All Jokes Aside... Still No. by seandiggity · · Score: 5, Interesting

      We should keep investigating why it works but, to be fair, the history of communications is implementing tech before we understand it (e.g. the first trans-Atlantic cable, implemented before we understood wave-particle duality, and therefore couldn't troubleshoot it well when it broke).

      Let's not forget this important quote: "I frame no hypotheses; for whatever is not deduced from the phenomena is to be called a hypothesis; and hypotheses, whether metaphysical or physical, whether of occult qualities or mechanical, have no place in experimental philosophy."

      ...that's Isaac Newton telling us, "I can explain the effects of gravity but I have no clue WTF it is."

      --
      Geeks like to think that they can ignore politics, you can leave politics alone, but politics won't leave you alone.-rms
    10. Re:All Jokes Aside... Still No. by Daniel+Dvorkin · · Score: 4, Interesting

      I'm shocked to read that anyone would be comfortable just ignoring the why of something just so we can progress beyond our understanding.

      If you insist that we know why something works before we make use of it, you're discarding a large portion of engineering. We're still nowhere near a complete understanding of the laws of physics, and yet we make machines that operate quite nicely according to the laws we do know (or at least, of which we have reasonable approximations). The same goes for the relationship between medicine and basic biology, and probably for lots of other stuff as well.

      If we don't understand the why then we're missing something very important that could lead to breakthroughs in many other areas. Do not let go of the curiosity that got us here to begin with.

      I don't think anyone's talking about letting go of the curiosity. They're not saying, "It works, let's just accept that and move on," but rather, "It works, and we might as well make use of it while we're trying to understand it." Or, from TFA: "Remy's algorithms have more than 150 rules, and will need to be reverse-engineered to figure out how and why they work. We suspect that there is considerable benefit to being able to combine window-based congestion control with pacing where appropriate, but we'll have to trace through these things as they execute to really understand why they work."

      --
      The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
    11. Re:All Jokes Aside... Still No. by Rockoon · · Score: 4, Informative

      You should go talk to Intel or AMD about your opinions on the matter, because I assure you that the specific layout of their chips is based on machine learning algorithms. No human can realistically optimize circuits containing a billion transistors.

      As a matter of fact, I recall genetic algorithms being thrown at rather small circuit design problems and producing solutions that were better than any human had come up with. Ah yes, here it is: Sorting networks and the END search algorithm.

      -- "Even a 25-year old result for the upper bound on the number of comparators for the 13-input case has been improved by one comparator"

      --
      "His name was James Damore."
    12. Re:All Jokes Aside... Still No. by Immerman · · Score: 4, Interesting

      I have heard claims along that line - something like one of the protective layers was effectively thermite? There seem to be as many theories as there are people making them, but it's hard to argue that the hydrogen wasn't at least an added accelerant.

      Personally I blame the television crews for the real disaster. Without them it would've just been a newspaper story about a German airship burning up and killing some people. With the dramatic visuals though it was the death-nell of the airship industry, for no good reason. The sinking of the Titanic was at least as big a disaster and had negligible effect on the oceanliner industry.

      --
      --- Most topics have many sides worth arguing, allow me to take one opposite you.
  4. err, can you walk me through it? by tloh · · Score: 4, Insightful

    they admit they don't know why the system works

    I'm guessing the next big revolution in AI is the quest to figure out how to get digital problem solvers to teach us meat heads how they actually figured this stuff out.

    --
    Stay sentient. Don't drink bad milk.
  5. Re:For a second there by cold+fjord · · Score: 5, Informative
    --
    much of left-wing thought is a kind of playing with fire by people who don't even know that fire is hot - George Orwell
  6. Re:Headline epic fails. by Lord_Naikon · · Score: 5, Interesting

    Huh? Did you read the same article as I did? As far as I can tell, the article is about a TCP congestion control algorithm, which runs on both endpoints of the connection, and has nothing to do with QoS on intermediate routers. The algorithm generates a set of rules based on three parameters resulting in a set of actions to take like increasing advertised receive window and tx rate control. The result of which is a vastly improved total network throughput (and lower latency) without changing the network itself.

    I fail to see the relevance of predictive/adaptive caching. It isn't even mentioned in the article.

  7. Come on now by dbIII · · Score: 5, Insightful

    As complex systems goes there are far worse. Go ask an engineer or a scientist.

    1. Re:Come on now by Daniel+Dvorkin · · Score: 5, Interesting

      As complex systems goes there are far worse. Go ask an engineer or a scientist.

      I am a scientist--specifically, a bioinformaticist, which means I try to build mathematical and computational models of processes in living organisms, which are kind of the canonical example of complex systems. And I will cheerfully admit that the internet, taken as a whole, is at least as complex as anything I deal with.

      --
      The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
  8. Re:The blurb is flat out wrong. by paithuk · · Score: 4, Informative

    The blurb says it "redesigns TCP/IP", and the article itself specifically says "congestion control". Which is NOT part of TCP/IP design. Congestion control is a routing feature.

    Seriously, it's both incredible how wrong you are with that statement and that somebody rated it as informative. I suggest you read up on the subject: http://en.wikipedia.org/wiki/TCP_congestion_avoidance_algorithm http://en.wikipedia.org/wiki/Congestion_window http://tools.ietf.org/html/rfc5681