Peer-to-Peer for Academia
Andy Oram has a good speech online about peer-to-peer and universities. He discusses a variety of possible research topics under the p2p umbrella and urges university administrators to promote this instead of squashing it.
A Peer2Peer application that traded, and then firewalled IP addresses known to belong to RIAA and MPAA companies, or the 'IP Watchdog' companies that work for them?
The next Slashdot story will be ready soon, but subscribers can beat the rush and slashdot the links early!
...peer-to-peer file sharing in a purely academic sense is not discouraged or directly banned. However, internet file sharing programs (beginning with Napster) were banned due to the hit that they put on my school's available bandwidth. With over 12,000 100Mbps dorm room connections, it proved a little bit too easy for the student body to overrun the entire network by queueing up 100+ songs on Napster.
I would imagine that it is the same for most universities...they don't discourage file sharing in a more academic capacity, but they know that it's going to be used for Napster-esque file sharing, and thus they are forced to implement an overall ban.
Anyone know the status of this amendment? Did it get tacked onto the bill that passed a few days ago?
Peer-to-peer, like many other technologies, has it's advantages and disadvantages. For some purposes (not just file-swapping), it's absolutely ideal (OK, OK so I'm a fan of SETI@Home :) ).
:) As I was taking my mandatory networking classes (which I wish I had paid more attention during), we discussed p2p quite a bit. By my senior year (Waaaaay back in...'98 :) ) there had already been several groups of students who created p2p final projects.
I just find it rather surprising that academia has taken this long to embrace p2p. It's not as if p2p has been an unknown or undiscussed topic in the realm of computer science. When I was in college, it seemed that the university was eager to stress the importance of object-oriented programming and relational databases...well, as soon as the market stressed their importance.
Is the market the core of the issue? Do colleges only adapt to teaching new technologies quickly when the market demands it? If that's the case, it would seem like more CS degress would be the equivalent of training at a vocational/technical school.
My sigs always suck.
I have often thought of P2P as being similiar to neural networks or the brain. Nodes that are structurally similiar and carry info to and fro.
Do our brains have bandwidth issues? No, because supposedly we only use 10%. Gnutella is always ridiculed because of it's overhead though. But Napster and the rest don't really count though because they are centralized, so how does our brain not get overwhelmed and how can this be applied to P2P.
the flavour of the month is kaza and morpheus. Both install with sharing turned on by default so regardless of whether or not the student understands the implications of this, (they often don't) significant school bandwidth is consumed by people who have no affiliation whatsoever with the school.
And then people complain about transfer rates.
- Toby
There's nothing quite as satisfying as having academics argue for perfectly logical systems that will also allow me to continue downloading porn, warez and mp3s.
m00.
Meanwhile, you could possibly get some serious p2p going, at this catchy web address, if you are shrewd enough to follow some simple directions.
Have you seen these face scans, etc..., of the REAL .commIEs? I thought so.
"I'm not surprised that colleges would complain about Napster bandwidth requirements because I hear the same wringing of hands over education in general. I hear there are too many applicants to top colleges. Excuse me, but wouldn't it be good to educate more students? Instead of saying there are too many applicants, why don't you work on increasing the availability of high-quality course offerings? I know you don't have tenure-track positions for all the people awarded doctorates, but it's not your job to offer everyone a position; it's your job to educate them."
Excellent point.
P2P is a topic of some interest for networking and distributed systems researchers. There are people all over the world already working hard on this topic for a few years and probably for more than that without even recognizing or uttering the word p2p. The speech strikes me as not terribly timely. Researchers don't give a damn about whether or not Napster is banned on the campus, at least with respect to working on, say, distributed file systems or reliable and efficient p2p routing. For example, at my school, I am working on an open reimplementation of a P2P routing scheme for a project course (the original implementation is closed as it was developed while working for the bad people). How many small teams of graduate students and seniors do you think are doing the same thing at other universities? I would guess a half dozen around the world. Academia is already busy fixing the problems with P2P and it has access to manpower and hardware (cluster machines and network simulators). We don't have to worry that people will be scared to think about this topic.
For your information, here at UM, It's not discouraged. However, I've noticed a few things while here.
This year @ UM, they dropped each port down to 10mbps in the dorm rooms. They also put a cap on the lines at 5mbps
Interestingly enough, this year they have put everything behind a NAT/Proxy. This has prooved to be somewhat of a problem.
Supposedly the reason behind this move was as follows:
1. Someone set up a server and leased multiple IP's in their dorm room, and hosted porn sites. I have yet to figure out how this person got an authority on DNS, but that information we don't know.
2. Many people were setting up FTP's to serve multiple clients.
It wasn't the peertopeer programs that did it, it was those who actually took advantage of our non locked down network. It was nice then, those who knew the possibilities, could use it as they felt necessary, while not taking advantage of it. But, they decided to take advantage of the system and flood the line with unnecessary bandwidth. Thus ruining it for all of us.
Thus, everything was put behind a proxy/NAT. Now, the connection in the dorms *suck* because they aren't doing it properly. I'm loosing like 30-50% of all packets sent out, within the campus network. Their servers pipe'ing all the traffic must be bogged down here at the U.
I did notice however (when performance wasn't varying from 0kbps to 3mbps, mostly towards the lower end..) that in the beginning of the year ports that Napster, and other programs use were blocked, and that wasn't too nice. I wound up using my own Proxy on one of my servers, off site.
The point is, that I think they won't be banning these things all together, but many providers will be implimenting strategies to imply discouragement of these services. IE. The implimentation of QoS and limiting everyone. Or, you will see more active flaging of people who use lots of bandwidth quite frequently.
------------
Sase
"It's the opposite of that."
The real sticking point, however, is what happens when general file-sharing software becomes popular, and people are sending each other pictures of the kids, notes, and all other sorts of digital goodies in addition to music.
Napster was banned for two reasons: bandwidth and copyright infringement. What's likely to happen in the case of general purposes P2P apps is that universities and ISPs will start to block out the software(such as gnutella) rather than individual users when they get complaints of copyright infringement, making the public suffer for the actions of the few. Worse, all of those legitimate users of P2P software will be labeled as "pirates."
The society for a thought-free internet welcomes you.
College administrators have fallen into the same rut as telephone companies that are slow to roll out high-bandwidth lines, or the recording industry that is shutting down Napster. These institutions all find it more profitable to manage scarcity than to offer abundance.
(emphasis mine)
That's the problem right there. As resources become abundant, price should drop, availablility goes up, the product reaches a wider audience. It took how many years (lack of competition) for Microsoft to ship a decent product? How many DSL providers dissapeared? The RIAA and MPAA want to strangle any revolutions in the distribution of their product. What kind of market model is that!?!
When companies can hold back on the resources they control to keep profits rising, there's a problem.
The thing is that occasionally there does arise a legitimate need to send a large file outside the university. It's really frustrating to have to wait several hours for a file transfer that could have taken 20 minutes. What's odd is that this in no way reduces piracy - people can still download whatever they want at ungodly speeds. I don't understand why they only blocked the sending.
So far, I don't know of any way to get around the cap, though I've tried a few little things. I don't know how it's implemented, but do let me know if you have any ideas. Or you can just rant at me.
Ceci n'est pas une sig
I'm going into CS graduate school next year and my proposed research area is something close to P2P optimisation. Does anyone know of professors already doing research in P2P?
Growing Trend in Peer-to-Peer Girlfriends
Stamford, CT - Internet consulting firm Gartner Group predicts that growth in peer-to-peer girlfriends will explode in the coming months. "Right now the P2P girlfriends are in the hands of early adopters in the tech community. We think that by the end of the year they will have reached critical mass and move into the mainstream. We forecast that by 2003, 65% of girlfriends will be peer-to-peer," said consultant Dawn Haisley.
One of the first movers was Computer Science student Neil Joseph, "I was pretty pissed when she told me she slept with someone else, but when I found out she was one of the new peer-to-peer girlfriends I was geeked. I love being a beta-tester. My friends are telling me I should leave her, but I know they are just jealous."
The beauty of a peer-to-peer girlfriend is that one peer doesn't know what the other peer is doing. Anonymity is extremely important in maintaining the integrity the network. Most girlfriends report that the speed between peers is more satisfying in a local network, but anonymity is easier to keep in a world wide network.
Some techies aren't pleased with P2P girlfriends. "These consultants throw around terms like peer-to-peer and they don't even know what the phrase means," said networking guru Mitch Mead, "P2P girlfriends aren't even a true peer-to-peer network. They are just a client-server model trying to jump on the P2P bandwagon."
Tom Mansfield agrees, "I had a so-called P2P girlfriend, but she was more like a lyin', cheatin' slut."
... I'd believe that no "technology" or "idea" should be squashed. Isn't that the point of academia? To learn?
I think all the systems and networks at a university should have a splashing of all the old & new technologies, throughout.
Good quote, too many chars. Seriously, the slashdot 120 char limit sucks!
thinkgeek has turned really lame. is it just me, or are the advertisements they use on slashdot just fucking stupid? who would pay this much for this shit? hello pricewatch.com?
who gives a shit about caffeinated drinks...coke works great and it's only 99 cents per two liter with no shipping.
the latency associated with some dot-com "companies" has apparently hit thinkgeek. thankfully it will die very soon.
I know this is OT but since it comes up so often I thought we would all benefit by knowing that the idea "We only use 10% of our brain!" is a myth.
The two points snipped from the article:
1.) Brain imaging research techniques such as PET scans (positron emission tomography) and fMRI (functional magnetic resonance imaging) clearly show that the vast majority of the brain does not lie fallow. Indeed, although certain minor functions may use only a small part of the brain at one time, any sufficiently complex set of activities or thought patterns will indeed use many parts of the brain. Just as people don't use all of their muscle groups at one time, they also don't use all of their brain at once. For any given activity, such as eating, watching television, making love, or reading Skeptical Inquirer, you may use a few specific parts of your brain. Over the course of a whole day, however, just about all of the brain is used at one time or another.
2.) The myth presupposes an extreme localization of functions in the brain. If the "used" or "necessary" parts of the brain were scattered all around the organ, that would imply that much of the brain is in fact necessary. But the myth implies that the "used" part of the brain is a discrete area, and the "unused" part is like an appendix or tonsil, taking up space but essentially unnecessary. But if all those parts of the brain are unused, removal or damage to the "unused" part of the brain should be minor or unnoticed. Yet people who have suffered head trauma, a stroke, or other brain injury are frequently severely impaired. Have you ever heard a doctor say, ". . . But luckily when that bullet entered his skull, it only damaged the 90 percent of his brain he didn't use"? Of course not.
As the article says "For a much more thorough and detailed analysis of the subject, see Barry Beyerstein's chapter in the new book Mind Myths: Exploring Everyday Mysteries of the Mind [1999]"
What is music when you despise all sound?
It is often that I read knowledgeless prattle on Slashdot ... usually only from fellow commentors. This is not a troll, it is serious criticism of an article that is blatently wrong. Let's examine Mr. Oram's discussions of P2P ...
Did Universities try to stop P2P? Napster, certainly. Probably many other file sharing systems too. Why on earth would they do that? Bandwidth, security, liability. I'll elaborate later.
Mr. Oram asserts that P2P is a great way to overcome limited resources. Then expounds on how Internet2 and IPv6 are going to remove the resource barriers to P2P.
Is P2P new? No. IRC's DCC extensions have been around for at least 8 years; ytalk is even older. The idea of dsitributing information on a whole lot of servers without central control is, surprise surprise, the basis for the Web. P2P simply involves direct communication between clients, at most using a server to mediate discovery.
I'm going to ignore the anti-DMCA dissertation, because its been heard before. It also has nothing to do with P2P; just a few specialised services that use P2P as a means to swap copyright information. If it wasn't for people like Mr Oram confusing P2P with specific P2P applications, then P2P as a whole wouldn't have a bad name.
A little later we hit the "IPv6 will help" argument, to which I can only say: security. Sure, you get rid of NAT. But at the risk of placing your device in the line of fire. Even if it is "secure by default" (so end users don't have to worry too much), it is still accessible from everywhere. That means DOS vulnerable, attack vulnerable when a security hole is found, and each and every individual is responsible for their own security. That doesn't work in corporate of group/organization networking. A central point needs primary control over security for the entire network. NAT, firewalls, and prevention of arbitary data coming IN to the network unsolicited are significant defenses against attack.
Which brings up the strongest point for universities to deny P2P: they would have to allow access to P2P services (yes, P2P is actually a client and a server on each machine) behind their firewalls, causing a security risk. Typically universities have a limited number of computers providing services behind firewalls, and take care to guard them against attack, and quarantine them in case of breach. With P2P, this approach goes out of the window.
For the same reason Mr Oram has ignored the security communities hatred of SOAP, a protocol explicitly designed to penetrate those nasty firewalls that administrators put up. Tell me, why don't we just set up a public inbound IP-over-TCPIP tunnel available on all firewalls so that we can get past them?
Now Mr Oram turns to debunking the security argument. Totally missing the point of course. You can encrypt and sign until your CPU is blue in the face, and still have zero security because your computer has been compromised. Unless you can adequately secure ALL services on your computer, you are insecure. One of the best ways to secure a service is to shut it down. The more services, the more ports of entry. Not surprisingly, P2P is a service.
Sendmail and apache serve massive amount of network traffic every day. They have taken years to mature to a point where they are mostly secure, yet new hacks are found for them every so often. How long until P2P implementations reach this level of maturity, and security?
The McAfee example is laughable, to say the least. Multitier client-server technology isn't P2P, not matter what this supposed expert wants to believe. Oh yes -- what was that announcement two weeks ago about an attack on the McAfee auto-upgrade feature?
While most of the assertions regarding bandwidth are true (shock!), Mr Oram is WAY OUT on the University issue. You see, students may be downloading the same amount irrespective of whether they use P2P or FTP ... but there is the issue of UPLOADING. Having administered a network for just a small company at the time of Napsterism, I saw a massive increase in bandwidth use just from Napster fielding and responsing to queries, even before local users started downloading the music.
Finally we conclude by returning to nonsense: Seti@home is P2P?!? In what universe does distributed computing offloaded by a central server and in which none of the computing nodes communicate with each other get classified as P2P?
Please, Mr Oram. Understand at least the vaguest basics of a topic before spewing garbage about it.
i-name =twylite [http://public.xdi.org/=twylite], see idcommons.net
Second, P2P may work fine within the university with current equipment for current applications. Now add in P2P video, streaming audio, you name it. Now you're talking about multiples, or decades, of new traffic for your new P2P applications. (Almost nobody wants to do great new things with ASCII text, alas!) Soon you will need new switches, routers, all within your on-site network. A $12,000 router may not be too bad, until you need one for every 1,000 users. And if traffic keeps growing, you may need to replace it in 3-5 years. Flat rate fees? Going up!
You get what you pay for.
TANSTAAFL.
Now just how bad do you want more bandwidth?
As I recall, there was an attempt made to tack on a provision that would allow these organizations to inspect your machine, but that the attempt was retracted. There are serious issues in enacting a provision like this - how do they know which are legal copies, for instance. There was a /. post about this a few weeks back. Any other readers may feel free to correct me on this, but I believe that was the 'state of the union' so to speak.
First off, academic bandwidth should be used for academic purposes. Sure, limited personal use is fine but its main purpose isn't entertainment. That being said, I know my university doesn't care if you run SETI@Home but do care when you run Kazzaa or other file sharing software. I think that's a good stand by the administrators. My university doesn't force anyone to use their network services. (They actually encourage you to get @Home.) If you want to use P2P software, get @Home or DSL and do whatever you want.
I'm not sure in the States, but in Canada Universities have very small budgets that are being cut yearly. I'd rather the University had a decent network and focus spending on research rather than worry about supporting P2P stuff.
Isn't P2P just the current Internet reversed? I see alot of articles that seem to focus on the software, which is no small part. However, isn't p2p just changing the communications model from a network of dumb terminals attached to smart backbones, to that of a network of intelligent terminals attached to less intelligent backbones?
Instead of my stupid computer being passive when retrieving information, I can have passive retrieval while agressively distributing information too. This means that we are all content providers, with high redundancy. This is supposed to be a Good Thing.
The problem there of course is that that opens up a can of worms on intellectual property, and copyright all that crap.
But certain powerful groups want to curtail this, much like the church despaired when literacy reared it's ugly head. Too bad for them. We all know that information should be democritized. And only civil disobediance will be able to counter pressures against that democritization.
Things have gotten worse since television, when our entertainment and our news/information became entwined a little too close together. P2P allows us to change this. But lableleing people pirates and copyright thieves is the Old Way. It really is. Forget about the dot-bomb and IPOs, a new found ability to communicate amongst one another is at risk right now.
We all need to pay for the goods and services we use to access information, and those who work hard to build that infrastructure need to reap the benefits. I think that Freenets and private neighbourhood nets are a good thing, as are commercial ventures, but the actual money value of information will go down simply because it is now so easily reproducible. Profit should be made in it's distribution and not in the hoarding of easily gotten patents and copyrights. That does no one any good.
ramble ramble ramble....gnashing of teeth
IPv6 will definitely help. It will, we hope, bring users' systems out into the open, eliminating the current Network Address Translation system that hides the users.
I think that if there is anything that will make users systems less obscurely identified on a network, it will *not* be IPv6. With the power that the general public will have over IP addresses, NAT may be only slightly less useful, and IP's will change so frequently that nobody will be able to figure out where the 'ghost host' went. I for one, prefer the miniscule amount of obscurity my wireless NAT'd connection provides me when browsing.
Try setting up a machine that's completely open to cookies and the like, but only use it to occasionally browse the type of sites you normally wouldn't- say Pokemon and Barney sites. Just watch the spam and pop-ups accumulate relative to those subjects. Nah, I'd rather not "log in automatically" or "save your username and password" - disable all those people tracking devices, and change IP's / MACs on a regular basis.
I apparently forgot that sig != uptime...
I'd be more worried about the tendency of some universities to build strong firewalls around their networks that filter out all incoming traffic, thus preventing the use of any private servers and peer-to-peer clients of students as well as researchers.
Our university did this, which has annoyed especially many computer science students. For me, it closed down my largeish website, together with many CGI programs for research (such as a data equalizer for neural net research) and personal purposes.
I wrote a long complaint (in Finnish sorry) about the problem, but since most people don't need (or don't know they need) the service, they don't care. The students still can put up their web page to a poorly administered and always outdated main server, which doesn't have any DB or other softwares, and has very severe restrictions on disk space (on the order of 10 megs while I'd need some 10 gigs).
I see this also as a serious threat to the development of new Internet services. If you look at most of the existing Internet technologies (http, nntp, smtp, bind...), they were all created in universities as "gray research", often by students. In a tightly firewalled Internet, they might never have made it out.
Sure, researchers and deparments of our university can theoretically have their own servers, if the department's head takes personal official responsibility and the department officially allocates money for the upkeep. This means absolute ban for almost all "gray research" projects (often part of larger projects.)
In our case, firewalling was explained with need for tighter security. However, an easy-to-use unofficial port registration would have solved most of the security problems. It's difficult to say what's the real reason; perhaps over-enthusiasm for "high-end security tech", or perhaps just low interest to administer the system - if the net isn't used it doesn't cause so much work, right?
Oh, and we pay for our connections, although they are partly subvented. Well, it might even be profitable for the university. (Note that studying doesn't cost anything here.)
Universities generally aren't concerned with P2P file sharing over Internet2. We have plenty of capacity. No Internet2 core circuit was ever saturated. Congestion on campus connections to GigaPoPs and GigaPoP connections to Abilene is very infrequent and easy to deal with (usually, by upgrading the circuit).
What universities are concerned about is Internet1 usage. They generally have metered commodity connections that cost a lot of money and are often congested.
Many universities have unwittingly become information producers for home users on cable and DSL connections, who download a lot of stuff from university dorms. This costs universities serious money while it's hard to argue that it furthers any educational goals.
-- Stanislav Shalunov
I'm one of the developers on a new p2p network system called pods.
Pods is a decentralised P2P network for sharing abstracted resources over nodes that need not supply or know of the resource through the use of XML.
The Disk and Processor resources are already in place and working well.
Have a look at it here
CFS and PAST are P2P readonly file systems a la Napster/Gnutella/Freenet. Both had papers in this year's SOSP. Both are based on log(N) P2P overlay routing/lookup substrates.
OceanStore seeks to be a more general (writable) global storage system.
And several P2P conferences have formed and will continue to form.
Some of these projects have been going on for years. So you shouldn't buy the "Academic networking/CS researchers are a bunch of P2P haters" line without a few grains of your favorite seasoning.
Another big reason not to squash the P2P servers is because usually transferring files within the university network more efficient causing it to go faster because it easies up on the external hand off, which for most university is very slow and expensive.
Granted P2P does supply a breeding ground for piracy, but in the p2p programs that have chat capabilities it provides another community where information can spread such as concerts and fundraisers. It becomes a local internet community which is something that really hasn't really been seen before
I agree with the others who said that there IS a lot of P2P research going on in Universities. I know because I am writing a paper on it and was forced to find journal articles on it. The phrase Peer-to-peer or P2P is not always used - sometimes its 'Serverless Distributed File Systems', etc, I think in part because P2P is so mis-used.
For more info check out these implementations:
Farsite, xFS, Frangipani, Intermemory, OceanStore, Eternity Service, India, PAST, Free Haven Gnutella, Freenet, Pastry, Tapestry, CHORD and CAN. (Not Napster!)
Oram's speech is interesting, but offers little that is new.
His assertion that academia was uninterested in p2p technologies because university administrations acted (responsibly, imho) to prevent the use of tools designed for the illegal redistribution of copyright works is a little disingenuous. Certain areas of computer science research are extremely interested in these technologies (I count myself amongst these - my PhD research involved p2p resource discovery techniques), in particular those which deal with developments in distributed systems.
However, there is a great deal of hype about the efficacy and efficiency of p2p systems (I consider Oram's article to be an example of such), so it is right that academia should judge these systems on their merits rather than simply accepting the claims at face value.
For example, Oram's article contains the (uncontroversial) statement that peer-to-peer technologies cannot only distribute files, it can also distribute the burden of supporting network connections and then goes on to claim that the overall bandwidth remains the same as in centralised systems - which is rarely the case (in the p2p domain I have studied - resource and service discovery - even the more efficient decentralised systems have a significantly greater communication complexity than do centralised systems).
Valid research cannot be founded on spurious claims such as these. p2p technologies may have a number of advantages, but forgive us in academia if we don't get very excited about their disadvantages.
Wired News yesterday ran an interesting story about how Intel adopted the Napter model to distribute its own multimedia material to its various far flung offices around the world. They found the system was ten times cheaper than sending the file out from big central servers, and a lot faster as well.
It was interesting to see the p2p idea moved beyond academic theory and actually implemented in real world situations by a commercial entity with beneficial and measurable results.
Trickster Coyote
Reality isn't all its cracked up to be.
Ideology is for ideots.
Here at my school we only have a 3 T-1 connection to the internet for about 3000 users, and since we're in the middle of no where, they are pegged almost 24/7 since we have nothing better to do. So to help this problem a group of friends and I got together and hacked together a Gnutella clone that only works in our class B that we call Stotella
has anyone else done anything like this?
P2P is more than swapping files efficiently, sweating about copyright and bashing the RIAA. at P2PQ for example they are doing somehting that uses P2P but which is unrelated to filesharing. This is a perfect example of P2P put to a pure academic purpose. If all the people holed up in dorms in colleges around the country used it as much as they use file swapping P2P, we would not hear so much of the tired arguments against P2P being trottedd around whenever those three characters are strung together.
Hey Anonymous Coward, You're right, P2P is more than just bashing the RIAA etc... it is a whole way of thinking, beyond "information should be free"... It's almost like an addition to the saying... Information wants to be free, BUT In order for information systems to work at all, we must tend to their problems and optimize the 'bad code'. I just finished an article, which happens to be on a related train of thought at Shift.com called "P2P Terror: The People Are The Network" and the subheading is "Although the RIAA won the war on Napster, peer-to-peer file sharing survives. Now the U.S. is going to war against a decentralized terrorist threat. Can they win?" I'd really appreciate some feedback, so Checker oot if you can at P2P Terror thanks :)
geeks are cats who dig a certain kind of cool