Slashdot Mirror


A New Way to Look at Networking

Van Jacobson gave a Google Tech Talk on some of his ideas of how a modern, global network could work more effectively, and with more trust in the data which changes many hands on its journey to its final destination. Watch the talk on Google's site The man is very smart and his ideas are fascinating. He has the experience and knowledge to see the big picture and what can be done to solve some of the new problems we have. He starts with the beginning of the phone networks and then goes on to briefly explain the origins of the ARPAnet and its evolution into the Internet we use today. He explains the problems that were faced while using the phone networks for data, and how they were solved by realizing that a new problem had risen and needed a new, different solution. He then goes to explain how the Internet has changed significantly from the time it started off in research centres, schools, and government offices into what it is today (lots of identical bytes being redundantly pushed to many consumers, where broadcast would be more appropriate and efficient).

90 comments

  1. One to many. by Anonymous Coward · · Score: 0, Informative

    "He then goes to explain how the Internet has changed significantly from the time it started off in research centres, schools, and government offices into what it is today (lots of identical bytes being redundantly pushed to many consumers, where broadcast would be more appropriate and efficient)."

    Multicast.

  2. 8 months ago by Lars+T. · · Score: 1, Informative

    The talk was held on Aug 30, 2006.

    --

    Lars T.

    To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

    1. Re:8 months ago by MarkByers · · Score: 3, Funny

      8 months old?! Shame. I guess that most of the information about the history of the telephone network is out-of-date already.

      --
      I'll probably be modded down for this...
    2. Re:8 months ago by eugene+ts+wong · · Score: 2, Insightful

      8 months seems pretty new to me. I notice that many of our discussions seem to focus on 1984. Wake up people! A lot has happened since then, and now it's a brave new world.

    3. Re:8 months ago by AKELA501 · · Score: 1

      Thanks for submitting your article! For me it has given me a much broader view of what is meant by the phrase "Microsoft Windows Beyond". I'm a tad bit slow at this so if I make my way back here again I'll do my utmost best to stay on the stick at catching up. Try and affix a message to My TechNet page to bring me up to speed. Poss Unknown/?+nt9 EnterpriseDB Advanced Server is a relational database management system (RDBMS)

  3. eh... by djupedal · · Score: 1

    "...where broadcast would be more appropriate and efficient"

    If this means airwaves, same as TV, sure. Why not, since the whole thing is one big info-mercial swamp already. Otherwise, it also means guaranteed next -packet delivery, without any pauses, resends, spinning cursors. And the internet is not going to deliver that, sorry.

  4. I'm confused.. by Nodamnnicknamesavial · · Score: 2, Informative

    How is this anything new? Everything in that summary was already covered by Tanenbaum in his excellent book on networks - of course it's easier to hear it from someone if you're too lazy to read :p

    --
    I have spoken'eth.
  5. MODS ON CRACK by Anonymous Coward · · Score: 0

    How is the 3rd comment redundant?

    Idiot.

  6. Multicasting on a segmented network by charnov · · Score: 4, Insightful

    There is no reason you can't multicast across a large segmented network, i.e. the internet, and get good delivery. Radio, television, audio, phone, movies are all latency sensitive but not particularly bit sensitive so you can drop some packets here and there. That also means that some things would need QoS (VoIP) while others would need intelligent caching and buffering (movies, etc.).

    --
    [RIAA] says its concern is artists. That's true, in just the sense that a cattle rancher is concerned about its cattle.
  7. Decline of text by Anonymous Coward · · Score: 0
    Speaking of laziness and reading... I'm too lazy to watch these long talks as videos. I might have checked an article on this topic, but I'm certainly not going to watch a one and a half hour video.

    So stop posting videos! I wish we still had those 14kbps modems...

    1. Re:Decline of text by iminplaya · · Score: 1

      You know I have to agree with that. I wish there was a text transcript. I won't sit through a video either. It's so obnoxious. I quit watching news on the TV for that reason. I don't want to sit through twenty minutes of tripe to hear an interesting story or wait for the weather segment. Text is the way...maybe with a pretty picture or two.

      --
      What?
    2. Re:Decline of text by Peaker · · Score: 5, Informative

      I thought I was going to skim through that video when I first saw it a while ago.
      Then I started watching, and at some point noticed I watched the whole thing, without skipping anything.
      I think he gives a good talk, and it kept me interested the whole way.

      Its a very nice insight he has there, too bad it flies way over Slashdotters head (well, its just that almost all of them probably didn't even read the whole thing).

      By the way, I summarized his ideas (as I understood them, which may not be the same as he explained them).

    3. Re:Decline of text by iminplaya · · Score: 0, Offtopic

      Thanks. I saved the page for later. I also had another reason not to watch the video. I seem to be suffering some malfunction with the flash player in Mozilla that I don't feel like dealing with. And I need the "Google Player" to see it if I download it? No thank you. Make it more generic so I don't have to load more third party junk onto my machine. I wish Slashdot wouldn't link to so much ad ridden and proprietary garbage, but I guess the money has to flow.

      --
      What?
    4. Re:Decline of text by jp10558 · · Score: 1

      Wait, isn't this basically what freenet 0.5 was doing? I have no idea how much of the issue with that was the anonyminity part, and how much was the distributed server part, but it was painfully slow. Maybe if everyone was using it it would be faster...

      But doesn't this have several major issues?

      1) all the freenet problems - that is:

      1a)what if people don't want to share bandwidth or just some specific content? At one end, you have the freenet solution where you either share or don't use it (with that driving people away), at the other, there is another huge administrative issue for anyone who wants to manage what they share (which also will serve to drive people away).

      1b) slow as hell - even non anonymous protocols that handle the search like gnutella are pretty slow and finding stuff, and getting the actual transfer setup. bittorrent is slightly better, but there's still the overhead time of the out of band/protocol search. Metalinks might solve this however. I just worry that for most data, by the time a torrent got started, you'll have loaded the page over HTTP... Unless he's just suggesting squid style proxies, which have all of their own caching problems.

      2) Do people want to share their resources? Sometimes, but not everyone.

      3) Why is copyright considered a bogus question? I'm sorry, but with the current way the MAFIAA is working, I can already smell the lawsuits.

      --
      Opera, Proxomitron-Grypen,GPG 0x0A1C6EE3
    5. Re:Decline of text by Peaker · · Score: 1

      1a)what if people don't want to share bandwidth or just some specific content? At one end, you have the freenet solution where you either share or don't use it (with that driving people away), at the other, there is another huge administrative issue for anyone who wants to manage what they share (which also will serve to drive people away).

      The idea is that the "network stack" would implement this not just in end-points but in the network infrastructure as well.
      Routers and peers alike.

      Today you already have questions such as: What if somebody will modify his TCP stack to use more aggressive retransmitting and waste everybody's bandwidth?

      There's already an issue of being a "good netizen" and this makes it a bit more of an issue.

      Routers/etc are already trusted to be good netizens and are paid to be so.

      1b) slow as hell - even non anonymous protocols that handle the search like gnutella are pretty slow and finding stuff, and getting the actual transfer setup. bittorrent is slightly better, but there's still the overhead time of the out of band/protocol search. Metalinks might solve this however. I just worry that for most data, by the time a torrent got started, you'll have loaded the page over HTTP... Unless he's just suggesting squid style proxies, which have all of their own caching problems.

      If I didn't make it clear in my summary, he does make it clear: He is not proposing "caching". Caching is merely an approximation for what he is suggesting. "Searches" would still be the domain of search engines and such. The mapping between "keywords" or "generic names" and a specific "label" (which is a uniquely identified immutable piece of data you can get from anywhere) can be done by querying central search servers, asking nearby routers, or a "domain name server" equivalent. It does not have to be a "search network" like gnutella. The only thing it must be, is crypto-signed to verify the link between the query and the data is indeed done by those who made the data, or someone you trust with categorizing it.

      2) Do people want to share their resources? Sometimes, but not everyone.

      See reply about good netizens.

      3) Why is copyright considered a bogus question? I'm sorry, but with the current way the MAFIAA is working, I can already smell the lawsuits.

      The "Bogus questions" is simply my addition (for purposes of the Enough project), and wasn't at all mentioned in the talk.

      I believe it is bogus because it is legal and not technical, and because routers already duplicate information. If they also store it, its a mere technical difference.
  8. old news by Anonymous Coward · · Score: 0

    This was something we watched in the networking class last semester (fall 2006). Very interesting though.

  9. Internet is not TV by Dun+Malg · · Score: 4, Insightful

    "(lots of identical bytes being redundantly pushed to many consumers, where broadcast would be more appropriate and efficient)"

    The first part is true, but does not necessarily lead to the conclusion in the second. There is a huge, very important IF that belongs between them. Specifically, "if the recipients are all prepared to receive those bytes at the same time". The problem with the conclusion is that the evaluation of the "if" part is nearly always "they're not". This is yet another case of "if the internet were like television, it'd be more efficient". Yes, but it would then no longer be the internet people like. The great promise of the internet is information on demand. All this bullcrap about broadcast, push, and the like, it's all the efforts of 20th century throwbacks trying to fit the internet into their outdated worldview of "producers" and "consumers". They need to quit it. Broadcast is a square peg and the internet is a round hole. Every time anyone suggests putting the two together, they simply look like a bloody idiot.

    --
    If a job's not worth doing, it's not worth doing right.
    1. Re:Internet is not TV by linvir · · Score: 2, Interesting

      Just because he says broadcast would be more efficient, doesn't mean that he thinks we should go back to television. Believe it or not, Van Jacobson isn't a 20th century throwback shill for Big Media, or a bloody idiot.

    2. Re:Internet is not TV by rthille · · Score: 1


      I haven't watched the video (hey, this is slashdot), but a system which required me to have ~500GB of local cache wouldn't be out of the question for me. My pipe isn't too big because of where I live, but I've got plenty of storage. If I could keep that pipe full all the time (basically by having my system automatically receive stuff I'm likely to be interested in), that could work.

      On the other hand, given how oversold the networks are, it definitely would have to be broadcast/multicast based.

      --
      Awesome furniture, accessories and cabinetry in Santa Rosa, CA: http://humanity-home.com/
    3. Re:Internet is not TV by Yvanhoe · · Score: 3, Insightful

      Internet is information on demand, but given a large amount of demands, some of the demands are redundant. For instance, it would make a lot of sense for a local ISP to cache the google homepage. Also, when making a modification on said homepage, it would make sense for Google to broadcast a signal to all ISPs to update their caches, or even to broadcast the new homepage to everyone. It is even more interesting in the case of the homepage of news websites.

      I think that in order to see the benefits of the broadcasting of data, you have to take the ISPs and service providers point of vie, not the final user's. Today, the ISP transmit every request from their users to the service provider, and the service providers answer to each user request. In the case of a dynamic web like online shops or search engines, there are no alternatives. But in the case of semi-static websites like news sites, having a system of cache synchronized at the ISP level thanks to a regular broadcast from the server can actually save a lot of bandwidth to the ISP and the service provider.
      Remember the problem slashdot had with softwares like NewsTicker when it first provided a RSS feed. This is the kind of problems this wants to solve if I understand correctly.

      Disclaimer : I didn't watch the one-hour long video with no transcript. Give me a text and save this bandwidth already, dammit !

      --
      The Wise adapts himself to the world. The Fool adapts the world to himself. Therefore, all progress depends on the Fool.
    4. Re:Internet is not TV by MarkWatson · · Score: 1

      Watch the last 3 minutes: he is answering a question on multicast: why local multicast is sufficient: global UDP multicast is impossible, but not necessary; ideas for local multicast: multicast at room level, enterprise level, etc. I have always liked UDP for some applications (e.g., broadcasting distributed game or VR data), but I always used it locally.

    5. Re:Internet is not TV by PatrickThomson · · Score: 2, Informative

      I run an IRC server of sorts, and over 90% of my outgoing bandwidth bill is due to identical information being sent at the same time to many clients. Not a day goes by I don't wish there was some sort of error-correcting multicast protocol.

      --
      I am one of many. My idea is not unique, nor do I expect my voice alone to sway you. I speak in a chorus of opinion.
    6. Re:Internet is not TV by boteeka · · Score: 3, Interesting

      if the recipients are all prepared to receive those bytes at the same time

      You are totally right that the recipients will want those bytes at different times, BUT this is not like current television or radio broadcasts, where once broad casted they no longer exist. The data which is intended to be broad casted (in the example, an Olympic event) is stored at the same time it is broad casted and not only by the producer or creator of the data, but by every other party (or at least the proxy-like ones) who is receiving the data. It's a bit like BitTorrent: the more interested in getting the data, the more available sources to get from as the transfer progresses in time. It's kind of like cashing today: for some amount of time the data chunks will get cashed on multiple locations (where they go trough) and this way the data itself "lives" longer. The more clients request the data, the data itself will be available from multiple locations, and you get it faster when you ask for it.

      And if you're thinking that this way there will be stored redundant information on a horrible amount of locations and this means an even more horrible quantity of stored data, then you're wrong. This way the cached data remains in cache only if there is interest for it, and if there isn't then after some time it gets replaced by something new, which has a much more interest for.

      This whole idea is very similar to what BitTorrent is, just even more decentralized, and in order to work the needed protocols and tools need to be implemented.
      And the good thing is this whole stuff could run on the back of existing TCP/IP architecture.

    7. Re:Internet is not TV by samjensen · · Score: 1

      Note that neither him nor myself said that broadcasting all bits was the ideal solution. In retrospect I probably shouldn't have mentioned the word at all.

      --
      this space intentionally left blank
    8. Re:Internet is not TV by Tim+Locke · · Score: 1

      Seeing as you posted 45 minutes after the Slashdot story was posted, and the video is an hour and 21 minutes long, may I assume you didn't watch the whole thing?

      Your concern regarding everyone accessing the data at different times is dealt with in the video, and a lot of other interesting ideas are suggested as well.

      --
      *** On the Internet, no one knows you're using a VIC-20
    9. Re:Internet is not TV by RajivSLK · · Score: 1

      ISP to cache the google homepage. Also, when making a modification on said homepage, it would make sense for Google to broadcast a signal to all ISPs to update their caches, or even to broadcast the new homepage to everyone.

      Congratulations, your just invented the proxy! Yay! And it doesn't make sense because the "popular part" of the web is not static anymore. Even google's simple homepage lets me sign in and customizes the page for ME. Most news sites do this as well. Google's logo on the orther hand is worth caching and maybe the most popular youtube movie but proxies are already pretty good at caching static data so no need to reinvent the wheel there.

  10. Good ideas by MarkWatson · · Score: 3, Interesting

    I 'browsed' some of this video and book marked it for later: Van Jacobson's background is awesome.

    A bit off topic, but there are two things that I want to see happen: a complete upgrade to IPv6 and the creation of an alternative 'public Internet' based on emerging long distance wifi and software that lets people volunteer to be part of this new open grid, and optionally share some bandwidth bridging the 'real' Internet.

    It may seem pointless to want both higher performance (multi-casting UDP, essentially infinite IP address space) and low performance and ad-hoc systems, but please consider: the UK and USA seem to be going down the wrong path of surveillance and citizen control, the Internet may someday be viewed as something that the public just should not have because it is too free a source of information. I hope that I am wrong about this, but this unpleasant possible repressive future is a possibility.

    1. Re:Good ideas by MarkWatson · · Score: 1

      Something particularly cool, starting around the 1 hour, 4 minute time index on the video: the idea of both naming data resources and versioning: when you "put something out there" on the internet, it is immutable, but you can supersede it with versioning, but older versions are still there - sort of making data on the Internet like our personal or work group subversion repositories.

    2. Re:Good ideas by LighterShadeOfBlack · · Score: 1

      It may seem pointless to want both higher performance (multi-casting UDP, essentially infinite IP address space) and low performance and ad-hoc systems, but please consider: the UK and USA seem to be going down the wrong path of surveillance and citizen control, the Internet may someday be viewed as something that the public just should not have because it is too free a source of information. I hope that I am wrong about this, but this unpleasant possible repressive future is a possibility. Although finding alternative methods of networking and sharing information is always a welcome step, ultimately the only thing that will stop the progression of Orwellian "security at all costs" government control is education and subsequent action by the general populace. Otherwise the second your ad-hoc network challenges the censored internet they'll be driving past your home with wi-fi scanners and knocking on your door shortly thereafter. More to the point, it shouldn't be necessary for such action. We'll never have truly free communication as long as the people believe it is their lot to try and work around their government's draconian measures instead of standing against them to stop it. I'm not quite advocating blowing up parliament just yet but it would be nice if people at least realised that they can do more with their voices then just complain to each other about the state of our country.
      --
      Spelling mistakes, grammatical errors, and stupid comments are intentional.
    3. Re:Good ideas by MarkWatson · · Score: 1

      Good point on the ease of censorship of an aletrnative Internet. Thanks.

      I disagree with last second point however: I try to find good sources of information to share with people I know that contradict the spin that we see on the news. A small effect, but enough people take the effort it is effective.

      I also make it a habit to frequently contact my elected representatives for both things that they do that I don't like, and even the rare compliment when they get something right :-)

  11. MODS on CRACK by Anonymous Coward · · Score: 0

    Of all things, how the hell is this post redundant?

  12. bittorrent by mangu · · Score: 1
    Broadcast is a square peg and the internet is a round hole. Every time anyone suggests putting the two together, they simply look like a bloody idiot


    I think bittorrent is the internet answer to the broadcast problem. Bittorrent is intrinsically adapted to the way the internet works. Data which is most sought by people will be found on more nodes around the net, less popular data can be downloaded directly from the primary servers.
     

  13. Broadcasting by iminplaya · · Score: 1

    This is something that's been on my mind for a long time. In fact, I thought that's how streaming was done because I couldn't understand why the load would increase so much as more people watched. This should be especially true for internet radio/TV, and for ads that really suck up the bandwidth, which why I block them (the ads that is). I didn't know that each stream was being fed to only one viewer/listener. It always seemed kind of odd to do it that way.

    --
    What?
  14. Van Jacobson's quotes by diegocgteleline.es · · Score: 4, Interesting

    Based on all the measurements I'm aware of, Linux has the fastest & most complete stack of any OS (source)

  15. the closed captions are hysterical by Anonymous Coward · · Score: 0

    The talk is 1 h 21 m long. It's actually quite fascinating, Van Jacobson is teh bomb.

    But a few things caught my OCD in the closed caption stream ...

    awk => oc
    combinatorics => common torch

    The whole time I'm watching this thing, I'm thinking to myself, why doesn't that girl at the front of the audience just shut up and listen? She's gesturing wildly to her neighbor and not paying attention at all! Then the revelation hits me, she's signing for the hearing impaired. Pass the "insensitive clod" hat, I'm out ...

    1. Re:the closed captions are hysterical by Leebert · · Score: 1

      You missed fart => part. (Yes, I'm serious, at 7 minutes).

    2. Re:the closed captions are hysterical by Anonymous Coward · · Score: 0

      There are a lot of small errors, including interpunction that doesn't make sense. It doesn't read like a human transcription -- a human would make different mistakes. But if this was done by speech recognition software, it's pretty impressing. Does anybody know how Google makes these subtitles?

  16. I Remember Multicast by Anonymous Coward · · Score: 0

    It was to be the next big thing in the 90's. I think it was just before push technology that everyone was talking about multicast turning our PC's into TV's with "high quality" video through dial-up speeds.

  17. Someone educate me please. by The+Living+Fractal · · Score: 2, Insightful

    lots of identical bytes being redundantly pushed to many consumers, where broadcast would be more appropriate and efficient

    So, sending identical packets to everyone is somehow more bandwidth efficient than sending packets to only those who want them? Doesn't that seem backwards to anyone else? Furthermore, couldn't you define broadcasting as precisely the act of sending identical bytes to many consumers?! I'm teh confused.

    TLF
    --
    I do not respond to cowards. Especially anonymous ones.
    1. Re:Someone educate me please. by TeamSPAM · · Score: 3, Informative

      I think he's trying to push the internet into a bittorrent/usenet type of model. Instead of everyone grabbing a copy from the original server and eating up the bandwidth on the major backbones, we get the information from a more local server that have a cached copy. I believe from an ISP level, he's trying to reduce the WAN usage and keep things on the LAN. To an extent I think ISP are favoring this already, bittorrent is kinda frowned on, but they allow you download tv shows off the usenet server with a nzb file.

      --
      Brought to you by Team SPAM! where we believe: "Information in the noise!"
    2. Re:Someone educate me please. by bas.westerbaan · · Score: 1

      It's indeed a bit like P2P but then at the IP nodes level: normal P2P doesn't reduce traffic, it shares it amongst peers. I believe he wants to get rid of the ISP altogether. The problem, obiously, is how to finance it all and enforce people to store others' data.

    3. Re:Someone educate me please. by francium+de+neobie · · Score: 1

      Broadcasting makes sense when you're talking about wireless networks instead of wired networks. Isn't everybody getting wireless these days?

    4. Re:Someone educate me please. by kripkenstein · · Score: 1

      I think he's trying to push the internet into a bittorrent/usenet type of model.
      Yeah, this was sort of my reaction as I watched the talk - it does sound like Bittorrent. Basically instead of everyone fetching a page from a server, the server acts like a sort of 'permanent seed', while the data can be had from any location on the 'net where the data is located. Then it is hash-checked, and so forth. Quite like Bittorrent, but with a single worldwide tracker for everything... ok, perhaps I'm taking the analogy too far. Anyhow, an interesting talk.

      P.S. Am I the only one annoyed by the quality of subtitles? Not that I need to read them, but when people talk at normal speed I tend to read them anyhow out of boredom. Anyhow, the subtitles were apparently written by an uneducated nongeek, judging by the errors.
  18. Re:Hahaha Funny by ribuck · · Score: 0

    Well yes, he does.

    Also at 7:01 he says "I'm an old fart" but the subtitle gives this as "I'm an old part".

    Who cares?

  19. Transcript? by ManiaX+Killerian · · Score: 1

    Is there a transcript of the video available (e.g. just the subtitles pulled out)? It's a bit tedious watching this when reading it will take 1/10th of the time of the video...

  20. lots of identical bytes being redundantly pushed by dominious · · Score: 1

    haven't we solved that with proxies?

  21. What the heck? by fredrated · · Score: 0, Offtopic

    FTA: and with more trust in the data which changes many hands on its journey to its final destination

    I hope it doesn't change my hands!

  22. Re:!!!! by giorgosts · · Score: 1

    Everything he says applies to server>client. Producers>consumers. And he proposes a change to the current model of conversation to a model of multicast the same data to many consumers. And support this by findings that 99% of data is structured that way. I guess that's wishful thinking though, because he works at google, the massive meta-producer of data we all consume.

    What about bittorrent though? Uses the same TCP/IP protocol, trusts the data, not the source (like he says we must do), answers to the contemporary question "Who has the time" (ditto) AND DELIVERS.

    They are also other decentralization options to bittorrent, like trackerless torrents (DHT) and torrentless torrents (Magnet links)

    Ditto there are implementations for p2p phones, p2p tv, etc.

    I think google and other big companies are just pushing a way to change the Internet to suit their business models and nothing more.

  23. 3 hours ago by Nullav · · Score: 1

    The summary was posted on May 06, 2007 at 08:20.

    --
    I just read Slashdot for the articles.
  24. Re:!!!! by kubalaa · · Score: 2, Informative

    Everything he says applies to server>client. Producers>consumers.
    On the contrary, it sounds to me like he's describing an egalitarian network where anybody who is connected to the internet can inject data into it with very little hosting overhead (because the data will be cached inside the network).

    What about bittorrent though?
    The first question at the end of the presentation was, what does he think of Bittorrent? And his problems with it are:
    1. Only people actively downloading or seeding content are available to redistribute it, so there's a built-in time-dependence which makes in unsuitable for small pieces of data.
    2. It has no knowledge of topology, so it can't take advantage of topology to cache data closer to endpoints.
    I wish someone had asked about freenet, since that seems much closer to what he describes.
    --

    "If you look 'round the table and can't tell who the sucker is, it's you." -- Quiz Show

  25. Freenet? by Anonymous Coward · · Score: 0

    The architecture he describes is reminds me surprisingly closely to freenet.... other than the ability to do really short term routing, and any sort of 'fairness' built into the network (he talks about nodes that offer more connectivity or storage should somehow be given priority so that there's a feedback loop that encourages people to share more resources)

    I wonder if he has an answer for the latency involved in the freenet implementation... ...does anyone have an answer?

  26. Item potent, oc, you hear? by Handyman · · Score: 1

    Wow, some great subtitling here. The guy programs in oc and apparently speaks about item potent data packets...

    1. Re:Item potent, oc, you hear? by Handyman · · Score: 1

      More gems:

      ...if I'm connected to SourceForge.net, then the version of nome that I'm pulling over is the most recent nome, because...

      And if it did try distributing trackers, you'd be in the Nutella world, where...

    2. Re:Item potent, oc, you hear? by oh_the_humanity · · Score: 1

      There were tons of errors and a few times the sub titler just used a - because they had no idea what he said. The person doing the subtitles was definitely not a networker. Like SHA1 , they totally skipped in the subtitles.

      --
      "When they invent bitch slaps that can go through a monitor you better f'ing duck" --deft (253558)
    3. Re:Item potent, oc, you hear? by nharmon · · Score: 1

      I liked this one:

      "When Copernicus first wrote his paper on planetary motions, the predictions that he gave were really crummy, compared to the tomaic[...] predictions"

      Somebody forgot the 'p'. :-)

  27. Superb talk: "data dissemination" not mcast/cache by Morgaine · · Score: 5, Informative

    I enjoyed this talk very much. It was more than just a statement of Van Jacobson's thoughts on data dissemination. It showed his analysis of the relationship between infrastructure and application across two generations of networking, and it pointed out very nicely why it's time now for phase 3: we've moved our usage goalposts compared to when the IP network was designed. Great stuff, and I agree completely.

    The article submitter didn't seem to "get" what Van Jacobson was saying though, as the talk had almost nothing to do with broadcasting or multicasting. Indeed, Van Jacobson actually pointed out why multicasting and broadcasting were inappropriate in most situations in this new world (they carry implicit time sync), so only use them as accelerators on LANs or in other special cases. The slightly wrong article description may have misdirected some of the posts here since not everybody reads TFA, and even fewer sit through an extended talk. It wasn't about broadcast or multicast at all, except in passing. :-)

    Maybe it'll help to summarize his thrust briefly.

    What he said was that the network underneath doesn't actually matter, and that the wires and fibre underneath don't actually matter either -- TCP/IP has abstracted away from them. However, the client-server model on which TCP/IP is based is no longer strictly relevant either, because it is founded on a somewhat obsolete concept, the "conversation". The vast bulk of our Internet traffic is no longer "conversations", but "data dissemination" (the migration of identified data objects from place to place), and actual conversations are just a special case of that.

    Data dissemination is utterly different to conversation as a communications paradigm, and that's what he's getting at. Fully identified, self-validating items of data as discrete entities are really where our focus needs to be, and how they get to us is rather immaterial, or abstracted away. *Where* they come from (ie. the actual server to which we connect) is quite immaterial too --- getting it from a passing plane would be as good as from a known server, when you can rely on data identity. Furthermore, if the data items were fully self-descriptive then many of the current problems like spam would go away as well. What's more, the nodes of the network would be able to work more intelligently too (and hence efficiently), if they were aware of data identity rather than just treat everything as a conversation.

    That's a very brief summary and can't hope to do the talk justice. Go listen! He's dead right. :-)

    --
    "The question of whether machines can think is no more interesting than [] whether submarines can swim" - Dijkstra
  28. mod parent up! by enjahova · · Score: 3, Informative

    You give a pretty good short summary of a long and interesting talk.

    One thing I pulled most out of it was the analogy to 60s and 70s networking and how it is only after technology has been adopted that we see what its used for.

    When the telephone was invented Bell didn't know what it would be used for, its a strange concept but he really didn't know what a "phone call" was. He just knew he could transmit voice. Not only that but you had to have wires to connect people, so there was this very expensive business of putting wires everywhere. What happened was that people used those wires to make conversations. To establish a conversation you had to have a path between two nodes. This encouraged a monopoly because the best known way to make paths was to have control of all the wires.
    When the idea of what TCP/IP was to become was introduced people thought it was lunacy. What they were proposing was adding all this crap onto your data to explicitly name your destination so that it could travel any path to get to its conversation partner. All the networking researchers didn't get it because they already had implicit addresses by way of making the path. Turns out that the supposed innefficiency solved several problems simply by construction. Being able to take any path meant not caring about the underlying topology.

    What Van Jacobsen is proposing is another abstraction. Essentially adding another layer of "crap" that will allow us to ignore the underlying network. He mentions how several technologies are working towards these ends to some degree like bittorrent and akami CDN, but I think he is advocating for something like a new protocol. This new protocol would then end up solving some of our current problems simply by construction. Broadcast and one-to-one will become the same thing. Whether you are sending a secure email (pgp signed and named) or downloading the front page of the nytimes you could rely on the nature of the new protocol to deliver you authentic data, no matter where it comes from.

    Personally I think its genius, I'd like to follow the progress of such a protocol if it exists. I just got done watching the talk so I'll be googling around for a little I suppose.

    --
    "how can they call it a MINE if everything here is THEIRS?!?!" -Straight Jacket
    1. Re:mod parent up! by owlstead · · Score: 1

      "Personally I think its genius, I'd like to follow the progress of such a protocol if it exists. I just got done watching the talk so I'll be googling around for a little I suppose."

      It's not a protocol. It's a bunch of ideas meant for researchers (or grad students in his talk). It's meant to get people to think about what they are doing - or, to be more precise - what their goal should be. He mentioned some protocols as examples of partial implementations of his ideas, maybe you could start there.

  29. Fairly interesting talk... by Kjella · · Score: 2, Insightful

    ...but the more he talked, the more it reminded me of some halfbreed between akamai and freenet.
    Basicly, he's speaking of named resources, that an URL would be key like KSKs in Freenet
    Content would self-verify, that's basicly CHKs in Freenet
    Then you need add security into it which pretty much amount to SSKs

    Only in his case, it wasn't talk about making the end nodes treat information this way but rather the core of internet, and it didn't involve anonymity. But the general idea was the same, to grab content from a persistant swarm of hosts who doesn't need a connection to the original source. Unfortunately, most of the examples he gives are simply false, like the NY Times front page. If I want up-to-the-minute news everybody need to pull fresh copies off the original source all the time, reducing it down to a caching proxy. Any sort of user-specifc content, or interactive content won't work. For example take slashdot. I've got my reading preferences set up, which means my content isn't the same as yours. Also my front page contains a link to my home page, which is not the same as yours. Getting a posting form and making a comment wouldn't be possible. Making any kind of feedback like digg, youtube, article feedback etc. isn't possible. Counters wouldn't be possible. The only thing where it'd work is reasonably static fire-and-forget content, and even then there's the problem of knowing what junk to keep. Notice that when asked about BT he said that only worked for big files, so the idea is that everyone will have some datastore where they keep small files until someone needs them. The only good example is the Olympic broadcast, which is exactly the same content at exactly the same time. Oh wait, that's classic broadcast. Classic broadcast works best in a broadcast model? Who'd think that.

    --
    Live today, because you never know what tomorrow brings
    1. Re:Fairly interesting talk... by enjahova · · Score: 1

      I don't think his examples are false. You aren't looking at what this shift would mean. It essentially eliminates the idea of broadcaster AND one-to-one "conversations." Everybody would have the capability of broadcasting or sending to just one person. The data would be named and signed so as long as you trust the signature and you know what you are looking for you can get the data.
      So take nytimes for example, if you are looking for the uptodate information you could get it from anyone who has it. Just because you are imposing a time constraint doesn't mean you are outside the capabilities of the system. You may be the first person to download todays version of the times, but since its signed you can distribute it to others who can then trust you didnt' tamper with it. So what if it expires in one day? Its still more efficient. Even more so, if the content is very specific you aren't losing anything by this new protocol because it will act just like the underlying tcp/ip connection thats one-to-one. If you are the only one getting special content from slashdot, sure it will be just like it is now, but if everybody is loading the front page then it will be more efficient.
      It boils down to a bunch of networking "freebies." We would have efficient caching, data integrity and security along with all the old things we know and love like direct connecting AND broadcasting.

      When you abstract out you don't lose any of the original functionality, but if you do it right you gain extra. It's a really nice idea.

      --
      "how can they call it a MINE if everything here is THEIRS?!?!" -Straight Jacket
    2. Re:Fairly interesting talk... by aprilsound · · Score: 1

      I think you kind of missed the point. He first talked about how originally networking research tried to use the phone system as it was to deliver data. That didn't work so well because in the amount of time it takes to set up a phone call, you can send gigabits of data, so if your conversation is only ms long, it's horribly inefficient. What we needed was a more appropriate model, i.e. packet switched networks.

      Now today, over 90% of internet traffic is for named content. Yes there is interactive/conversational stuff like posting comments blogging, doing taxes, etc. but the rest is smei-static data. He is saying that, like when the internet was first starting, we need a new model for the new ways we've found to use the internet.

      He is not advocating throwing out the old conversational internet. We still have the phone system, and we still use it because it serves a specific purpose very well.

      The internet originally ran over the phone system as an overlay. Eventually the network grew to be more like the overlay so that it could be more efficient. Today we have a significant number of peer-to-peer overlays doing exactly what he is describing: treating user objects as the first class entity, not endpoints. The internet's architecture will have to change to accommodate the modern usage, which is data centric. The point is that the future of networking is data centric networks, so trying to 'replace' or 'update' TCP is just like trying to use phone calls to have extremely short conversations, simply not appropriate.

    3. Re:Fairly interesting talk... by schnablebg · · Score: 1

      The parent here does a good job explaining that VJ's model is a *superset* of the conversational model we have now. The GP, however, has a very good point that dynamic content relies on the conversational model.

      What is the next logical step then?

      We can break down content into these named, secure objects. I may contact /. to get the home page using the conversational model we are used to. The page I get back, however, can have my customized content wrapped around references to the common, static pieces. For example, the article summaries could simply be named references. If I'm on a college campus, there is a good chance that my neighbors have already downloaded (and cached) the article summary text, so I can retrieve that data from them (based on the object name), and my browser can plug it into the dynamic content that had to come from /. itself.

      It is sort of like taking the idea of a CDN and applying BT or Freenet democratization to it. I understand that this is a shift in perspective compared to tradition IP, but BT and Freenet have been successfully applied at the application layer of the stack. Why do we need new architecture? We can build an application that understands these named references and use that on top of what we already have.

    4. Re:Fairly interesting talk... by killfixx · · Score: 1

      "Any sort of user-specifc content, or interactive content won't work. For example take slashdot. I've got my reading preferences set up, which means my content isn't the same as yours."

      Remember...that's server side communications in a conversation based system...

      If I walk into a mall I don't have to buy everything...I pick what I want...

      --
      "Helping to keep you two steps ahead of the Thought Police!"
  30. Re:Superb talk: "data dissemination" not mcast/cac by btaylor · · Score: 2, Insightful

    I see one problem with his idea of ignoring where data comes from.

    Corporations make money by restricting access to information.

    It doesn't seem that it will be possible for them to continue to do that with this model, so I don't think any of this will come to pass any time in the near future.

  31. My personal take on this... by Ex-MislTech · · Score: 1

    1) Optical only routing at the backbone, and traffic monitoring to ensure Tier1's don't
    let major choke points choke due to not wanting to spend money on equipment,
    share the costs like a Co-op if necessary amongst Tier1's to keep the backbones
    and choke points scaled up. People who run trace routes see tier1 choke points now.

    2) In the Tier1 core make sure it is DWDM folding many layers of Sonet/ATM into
    a multi-channel frequency spectrum, at some point plan on phasing out asynchronous
    communications and go with on time in order delivery of packets, aka Sonet and its like protocols.
    *** VoIP/streams works best with synchronous***

    3) Each Metro Area Network has its own Squid box cluster for reducing repetitive traffic,
    filling all identical requests for data vs. pulling it long haul a 2nd time,
    alot of ISPs have their own squid like system now, but unify and optimize it.

    4) Create a spam registry, like ACLs for routers, don't blacklist by single IP,
    blacklist by ISP, then Inclusively activate by white list single IP's within that range.
    Offenders would essential end up shutoff from most of the world.

    5) Create a modified Torrent protocol that can multicast via VPN like hamachi.cc,
    so that a single peer can send the same data at 50kb to a million ppl,
    make it like tuning into net radio audio stream in progress picking up the
    data in progress, and getting the first to middle bits on the 2nd "play" of the data stream,
    set a threshold for the modified torrent protocol to switch to this multicast mode.
    for ppl with privacy concerns make it virtual ip's on a VPM network like hamachi.cc

    6) All monetary transaction sites require end to end encryption and authentication,
    and some means of client IDS that verifies the client isn't compromised,
    a cursory one would be patch level and malware/virii client acknowledgement.
    CERN could maintain a central footprint repository or another agency could
    do it, but have open peer review and input.
    Verifying legitimacy of all in memory processes would get rid of the obvious
    intruders, but the ones that can 'checksum' mimic known registered processes
    would be more difficult to detect. Verifying client integrity particularly for
    Windows OS would require some brilliance beyond what we have now.

    7) Zombie / IDS Net Spiders that roams the internet looking for compromised systems
    and notifying the owner, and the ISP, and reports it to all other ISP's and
    is added to a blacklist at ISP of origination. Systems routing to known
    zombie network reporting IP's or security threat sites would be placed on watch lists
    and client warnings of possibly being compromised.

    8) Global and Nation based NOC's that monitor for network issues,
    and traffic control system that allows truly functioning
    reroutes via higher layer routing protocols for alternate routes.

    Just a few ideas I had, some it may not be workable with what we have now.

    --
    google "32 trillion offshore needs IRS attention"
  32. Re:Superb talk: "data dissemination" not mcast/cac by The_DoubleU · · Score: 1

    The data contains all the information, so why not an authentication system. DRM for data. If you can validate if the data is from Company X then the data should be able to validate that you are allowed to see it. You still can have a copy of the data on your server, just can't use it.

    --
    What power has law where only money rules.
  33. I think he only addresses one part of the problem. by flyingrobots · · Score: 1

    I think the idea of having a fully transparent networking paradigm is what is of paramount importance to both data and software that manages that data. Adding application layer logic to routers that will effectively cause data I want (or need) to be available everywhere would (in my mind) restrict the things that we can do with the net. I'm a bottom up kind of guy.

    We need not only the data but the intelligence to manage that data spread around. I can't think of any news or information site that doesn't require direct interaction with the user of that data. I mean, take Slashdot for example, it tracks your usage of the site and gives you moderator points when you meet certain criteria. How is that going to work if somewhere along the line you don't have some kind of interaction with Slashdot's software? How does that get distributed as well? Are we going to allow server algorithms to run everywhere as well? I guess if we solve the secuirty problem, that would actually be quite nice.

    I agree there needs to be a better mechanism for delivering content everyone wants more efficiently. I'm reminded of this as well when I go to watch a live event and the response is very slow (not to mention 30 to 45 seconds behind), or I try to download a large file. Also with VOIP routing headers taking up 30% of wireless bandwidth alone, something has to be done with our protocols so they are lighter weight.

    Just as mechanical switches had to be programmed and were part of the problem, I think TCP/IP is part of the problem here as well. There are still (as he mentioned) all kinds of barriers that have to be known about and planned for. As he said, it is a very static system requiring a day or at least minutes to pass before you can be seen by someone else on the net. I use a networking system that is tightly integrated into an operating system (QNX) that I like very much (still has its weaknesses) but it goes a long way to making the entire computing platform very transparent with the ability to add security.

    Content can be very static or it can be extremely dynamic (sensor data in the dynamic example). I think his ideas seem focus around managing very static data or data that can be "late" and making it more readily available. When it comes to dynamic content (such as live video or VOIP) he didn't seem to offer much in the way of substance, or I may have just missed his point. I guess I see the need for direct "conversation" like connections as well as data "dissemination" type connections.

    In fact, I'm not sure things like the NY Times daily is a good example, as dynamic advertising is very much a part of what they sell, and they need to have their individual subscribers connect to their servers so they can analyze what advertising to focus on the poor unsuspecting reader.

    I personally think that the dynamics of the internet are moving towards very dynamic content and this requires direct connections to the servers which manage that content. Unless this content management (software) can be distributed to other devices along with the data, I'm not sure its going to fly. Microsoft already tried that and now most interactive content is blocked (unless specifically enabled) because of security and other concerns.

    However, these are all application layer problems, do they really belong at the bit level of data being bounced around? I don't know and it is an interesting question.

    We can add layers that do all kinds of fancy things with data to get it to places that were harder for it to get to previously (mirrors, caching, etc). However, in order to make work what he is indicating makes the internet look like nothing more than a big informational resource (file versions, news, video, etc). Like a big file system or database, like a big static storage system. However, the internet is much more dynamic than that and I don't have the feeling that he addressed that aspect well enough.

    What we need is something that will enable us to make "conversation" connections

  34. Re:Superb talk: "data dissemination" not mcast/cac by samjensen · · Score: 1

    I got it, I just posted this late at night with a poor description. In any case people are watching and talking about it, which was my goal.

    --
    this space intentionally left blank
  35. Akamai by amalcon · · Score: 1

    That was one of very few useful talks I've *ever* seen on shortcomings in the Internet.

    Akamai Technologies is really very much in the business of solving the main problem Jacobson describes. Yes, lots of people want the same information. Jacobson is a very bright man, and he got pretty much everything right except: "You can't Akamize dynamic content." Yes, you can -- unless live feeds of sporting events (NCAA March Madness) aren't considered dynamic enough.

    That said, there probably is room for a truly open platform in this case. If it can do half of what the Akamai network can, I wholeheartedly congratulate the creators their amazing feat of engineering. To my knowledge, the closest thing these days is the coral cache, which actually is restricted to static content.

    --
    -Amalcon
    1. Re:Akamai by rreyelts · · Score: 1

      he got pretty much everything right except: "You can't Akamize dynamic content." Yes, you can -- unless live feeds of sporting events (NCAA March Madness) aren't considered dynamic enough.

      I didn't watch the video, but usually when people talk about "dynamic content" they mean content that is generated on the fly, personalized to a particular user. So, as an example, you typically can't wholesale cache a page generated that way for a user with Akamai - it doesn't make any sense.
  36. Re:Superb talk: "data dissemination" not mcast/cac by Anonymous Coward · · Score: 0

    Corporations make money by restricting access to information.

    Judging by the many jabs he took at Microsoft and at the PTTs and the RIAA etc, I'd say that Van Jacobson is very much in touch with companies and their shenanigans. He almost certainly expects them to have learned nothing from their experience so far, and to stick DRM into the data objects distributed by his "3rd phase" network as well.

    In any case, his data dissemination approach would work regardless of what's actually in each data object, and whether it's DRM'd or not. He seems to be very agnostic on content as befits a networking scientist, but on the issue of control then he's strongly in favor of empowering the user. Spammers won't like that, and nor of course will control freaks like media corporations. Tough. :-)

  37. Suddenly the Senator Makes More Sense by Aaron_Pike · · Score: 1

    At one point in his talk, Van Jacobsen talks about segmenting. He mentions that segmenting solves a problem that is metaphorically like having trains and cars on the same city streets; one doesn't want to wait around in a car for a train to clear the intersection. Then he says something that I've heard before, but never made the connection:

    "It would be nice here to not have big trucks, just little cars."

    And suddenly I realized that Senator Stevens had gotten a lecture that he completely misunderstood.

    Granted, the man shouldn't have tried to give a speech on something about which he was completely clueless, but now I think I know where the heck that truck he mentioned came from.

  38. Works fine, no such problems by Anonymous Coward · · Score: 0

    I've got my reading preferences set up, which means my content isn't the same as yours.

    It's partially the same as mine. I have a different set of preferences to you but there is some overlap between us --- that's extremely likely to be the norm.

    Consequently any data dissemination router on the path between Slashdot and our two machines needs to fetch those data items that we both want from upstream only once, and then we both get it sent to us out of its item store. As he said, it's somewhat like a cache, but differs in not being tied to any specific upstream source.

    Making any kind of feedback like digg, youtube, article feedback etc. isn't possible.

    Of course it is. All feedback does is create a new data item. If the result is private then it's no different to now, and if it's shared then this merely adds your data item to the set of current items on the server. Anyone who is interested in items of your type will then see it the next time that they request the list of new or updated objects.

    It's worth noting that even if this list of updates comes from the central server, the data items themselves needn't --- they may already be stored in intermediate nodes on the path, as a result of someone else's slightly earlier query. In effect, this is a data-centric distributed cache with the role of the servers reduced to state updates and data injection only.

  39. Re:I think he only addresses one part of the probl by rbrewer123 · · Score: 1

    When I'm logged into slashdot and browsing news, there's only a very small part of the page that is customized specifically to me. Everything else is the exact same content that everyone else is seeing. Currently the web browser does separate queries to pull down the images in the page, which are mostly the same for everyone. Perhaps under VJ's scheme the text parts that are specific to me would have to come pretty much directly from slashdot's site, but it could contain references to all the common content on the site which would be fully cacheable by his dissemination model.

    Instead of slashdot's content authors writing templates that are processed on their servers before giving the web page to my browser, something similar to their template would be shipped to my browser, which would then ask for all the common (cacheable) content to fill out the page. Image-heavy pages would already work well with his scheme if the browser just requested the images using the new semantic.

  40. Re:I think he only addresses one part of the probl by bsquizzato · · Score: 1

    I don't think he plans for the "request/respond" protocols to be used everywhere in a network. The issue is when many users are requesting the same data. Take for example when a site gets "slashdotted," it'd be more efficient if there weren't several thousand requests for the same exact index.html from some host being individually transmitted through the network.

    In your example of tracking users on Slashdot, parts of that could fall back on the current conversational protocols. It should probably be possible for clients to decide on whether they want to request data by means of the broadcast system or the conversational system. Then I guess the issue is how does the client decide when it should use one or when it should use the other?

    Or, maybe custom homepages and whatnot would have to start being initiated by the client sending a request indicating it needs a specific set of data, instead of the server side realizing that a specific client is requesting data and that it should send the modified data. So in other words, you'd still have to know you want to receive a certain set of data.

    There's definitely a lot of finer details to this...

  41. Water by killfixx · · Score: 1

    This is the water system...translated for data...that's the only way to make it work...All the information (non-specific-user-defined) would have to be in the system at all times...

    If you want a drink you don't specify where the water has to come from...you open the tap and out it comes...but when you want to make lemonade or distilled water lets say...you have to have software on your end (pot and fire) to get out of it what you want...

    But that presents the problem of too much bandwidth usage at old usage methods...unless we go to completely over the air data transfer which takes the phone system out of the loop and allows a more radio style internet...where the data is already being disseminated just waiting for you to tune to that station...you're not asking for the information you're allowing your device to consume it...

    hrmmm

    --
    "Helping to keep you two steps ahead of the Thought Police!"
  42. Dissemination by ant_tmwx · · Score: 1

    Some of these issues seem to be addressed (or are being attempted, at least in the early stages) by metalink which was discussed at http://slashdot.org/article.pl?sid=07/02/25/144209 a few months ago, but I don't think people really understood what it was.

  43. It doesn't solve the spam problem. by Anonymous Coward · · Score: 0


    In his talk, he alludes to it solving "the spam problem", as did a couple of commenters here.


    It doesn't.


    Nor does it solve the "phishing problem".


    Why?


    First, spam.


    Spam is unwanted email that is being delivered to you. Spam isn't resitricted to email, we also get snail mail spam that is also known as "junk mail" - advertising pamphlets from stores, your credit card company, etc. To eliminate spam requires the computer to undertsand what we think we want and don't want. In some cases this is easy - anyone that we've sent email to we probably want to receive email from. Easy. The problem is how do I get email from strangers that I want to read? And the problem for spam software is how to distinguish that from the email coming from strangers that I don't want to read? Changing the model of the internet to require data to always be authenticated by someone or something doesn't solve the problem of "do I want it?"


    Next is phishing. Phishing scams take advantage of naive people by presenting them with something that looks like something else and tricking them into doing something they shouldn't. This is equally achievable using surface mail except that it may require the offender to leave more soid footprints. If I give you an image that says "nytimes.com" on it, but I present it as being from x.com and it validates as being from x.com, then no red flags are raised - it is left to the user to be somehow discover that what they're looking at comes from x.com and not "nytimes.com". Software won't be designed to do that too prominently because it will interfere with the delivery or real content where people don't want the user's screen cluttered with "this came from blah" everywhere.


    But if you can solve the spam problem then you're on track to also solve phishing because phishing is spam.


    And as has been seen so far with SPF for SMTP, spammers are quite willing to do whatever is necessary technologically to overcome simple hurdles.

  44. meapplicationdeveloper by AKELA501 · · Score: 1

    <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
    <html>
    <head>
      <style type="text/css">
    <!-- /* <![CDATA[ */
    @import "/branding/css/tigris.css";
    @import "/inst.css";

    /* ]]> */
    -->

    1. Re:meapplicationdeveloper by eugene+ts+wong · · Score: 1

      I was intending to be humourous with my comment. Were you kidding around too? Were you just trying to emphasize that CSS is good? I agree that CSS is good.