P2P Web searches
prostoalex writes "Researchers at UCLA are looking for easier ways to implement Web searches by using peer-to-peer techniques to decrease the workload. 'Queries need to be passed along only a few links rather than flooded throughout the network, which keeps search-related traffic low,' reports Technology Research News."
The searching load on servers might be reduced i suppose. But from my experiences with P2P searches are long and slow. How would this help exactly?
Maybe in future Google will implement a small server in our "Gmail notifier" application, and each time we search for something on google, it will cache some of the results, and should anyone close by ask for it, just forward the old results to them.
:D
Save the server load on the main google server!
**Plus maybe some smart guy will figure out how to trade mp3s over the GoOgLe-P2p network!
Online backup with Mozy, sounds like Ozzie, but more!
Google still works.
Results 1 - 10 of about 6,290,000 for p2p [definition]. (0.19 seconds)
webpage
The old ways of doing things still work.
Which is why I still prefer walking everywhere, using chalk and slate for taking notes, and refuse to use a zipper...
In other words, wtf?!?!
From a quick read of the article it sounds like what they've done is implemented a slightly more sophistcated/less deterministic version of the ultrapeer/hub system already in use by Gnutella/G2 Basicaly quereies are routed such that they are guarenteed to reach a "highly-connected node" which is the equivalent of an ultrapeer/hub node. The main difference is the folks at UCLA have come up with a novel method of picking ultrapeers, but the end result isn't much different.
Q: What is $search_term and how does it work?
A: A simple google search shows that $search_term is $blahblah and you use it like $this (repeated a hundred times)
Add another hundred replies about how the poster should search before submitting, and how AskSlashdot is degenerating into AskPeopleToGoogleForYou, and there you have it. P2P searching in all its glory.
Ceci n'est pas une sig
That wouldn't solve the problem of local areas of users that are disconnected from everyone but themselves. I know this is an issue with other p2p apps. You can only connect to someone who's in your area, and sometimes that just isn't good enough. I know China is in many respects isolated from the rest of the internet.
Si la vida me da palo, yo la voy a soportar Si la vida me da palo, yo la voy a espabilar
Which is why I still prefer walking everywhere, using chalk and slate for taking notes, and refuse to use a zipper...
And why I like using my RPN calculator to change the TV station...
No, they won't. You need tons of server hardware to cope with the bandwidth of anything even remotely popular. Thus free services tend to be spoiled with ads and whatnot.
The magic of p2p is that you can build the same out of 'thin air'. There are no expensive server rooms and gigabit lines but just a bunch of nodes that are slightly more complicated than simple clients. You use it, you provide it. Fair game and you get exactly the kind of service you want without strings attached. At least theoretically - the reality still seems to be something different.
Infrasearch was working on this, until Sun paid $8M for the company, them had them work on something else, then Gene Kan committed suicide. Be careful what you work on.
Google, Yahoo etc of course crawl the web at large, but even if you want to throw a peer network at crawling, aren't you mitigating freshness?
What I can see is a DNS-like system for propogating metadata in to the interior of the network, and maybe a caching mechanism as a result...not sure if this is what they mean.
It's called grub.
Feel free to shoot full of holes as needed....
Every website has DNS servers so what if that same company that ran the DNS servers indexed the pages of the sites that it hosted? Daily?
Wouldn't that then provide a complete index of the web?
Start a search and somehow get the results back through that distributed method. Haven't figure that out yet...... but if you can...
PROFIT!!!!!
It's an ariticle describing a new p2p query routing method. Nothing more. There's already a lot of such algorithms out there. This one seems to exhibit some nice completness properties that hold in idealized scale free networks. But I'm not convinced such a theoretical property would hold in the real world. While p2p networks tend to be roughly scale free, the "roughly" and "tend to be" qualifiers are what make such theoretical properties unlikely to hold in practice.
Nice to see they plan to release some software based on the technique though.
A peer to peer program Ants P2P has just implimented a Distributed Search Engine .Ants P2P is Based on Ant Routing Anlgorithms so it needed a solution to finding files on its network it found a solution that works .The Network also has a HTTP tunneling feature and its developer Roberto Rossi is creating a search solution based on simmilar methoods to search Web Pages published on the network .
Ants P2P is designed to protect the identity of its users by using a series of middle-men nodes to transfer files from the source to destination. As additional security, transfers are Point to Point secured and EndPoint to EndPoint secured.
1. Distributed search Engine - Each node performs periodic random queries over the network and keeps an indexed table of the results it gets. When you do a query you will get files with or without sources. If you get files simply indexed (without a source), you can schedule the download. As soon as Ants finds a valid source, it will begin the download. This will also solve the problem of unprocessed queries. This way you will get almost all the files in the network that match your query with a single search.
http://sourceforge.net/projects/antsp2p/
If you'd take some time to actually read the article, you'd see that the story is about research that addresses congestion problems with existing p2p methods.
Besides, much if not most traffic from p2p networks is from file downloads, not query routing. Moving files to a centralized server isn't going to reduce that traffic at all. In fact, the bottlenecks that result can make congestion even worse.
Moving files to central servers only seems to help congestion because central servers with anything interesting to download tend to be shut down quickly.
I'm so sick of companies wanting to push off their crap onto us. If I want something from them they should offer it me on terms I find acceptable.
In this case a couple of text links which may intrest me (Google refrence : check).
I don't want to have to share my bandwith with 50 other people so they can do the same. If you want to run a service, website or game server you should pay for it. Don't start passing off the bandwith bill onto us users.
Either get used to the heat (price) or get out of the kitchen (market).
I like muppets.
You're right, but consider this:
The entertainment industry is trying very hard to convince the US government that all P2P can be used for is copyright infringement, so it should be banned completely.
Any non-infringing use obviously proves them wrong, no matter how out there it is.
Right now, I think we need as many off-the-wall uses as possible for P2P, even if it's not the most efficient way to accomplish the task.
Calling mass attention to these uses wouldn't hurt, either.
"City hall" in German is "Rathaus" Kinda explains a few things......
?
... so what problem are they trying to solve? an academic exercise? well, that's okay ... but let's call it what it is.
the point is that new technologies are adopted when they improve on an existing method. we already have super-fast, super robust, complete search technologies that are not p2p
google is already so fast as i would not notice it is if were any faster. the best a p2p search technology could achieve would be equivalent speed with the addition of the consumption of my bandwidth.
Step 1) Find established technology which is working more or less happily as-is
Step 2) Add the word 'p2p' in front of it.
Step 3) ???
Step 4) Profit
I assume Step 3) is now as simple as "show name of new product with 'p2p' in the subject and explain how its NOT related to pirating movies or music" (to increase investor confidence they're not going to get taken to town by the RIAA/MPAA), then its just sit back and watch the fat investment/grant dollars roll in!