Web Page Entanglement
jason continues:
"By viewing the web through a tangle proxy, you can see the connections and associations left by those who surfed the web before you. By surfing the web using tangle, you also leave behind connections and associations for others who will surf in the future.
When you exit one page and enter another (by clicking a link or performing a search), a two-way link is created between the pages. As users surf through a particular page over time, tangle keeps track of popular ways to get to the page and popular places to go next. These entry and exit links are displayed at the top of each page, sorted by popularity.
Clicking on one of these entry/exit links tells tangle that you think the link is relevant and useful (like a vote for the link) and increases the link's popularity. In other words, if a user thinks of something relevant while reading a page and performs a search for it from that page, tangle gauges how others react to that association over time.
tangle is similar in some ways to the closed-loop hypertext system Everything2, though tangle works for the web at large.
We have several tangle proxies up and running. The tangle proxy software is also available for download.
A note for the paranoid:
Though tangle keeps track of web usage patterns, the focus is not on tracking the habits of individual users, but on tracking the trends of an entire community of users. tangle is GPL'd open source [source here], so you can see for yourself: clicking a link through a tangle proxy simply bumps up the links popularity---user IP addresses are completely ignored."
FP baby!
You'd get the a goatse.cx link on top of every page.
I consistently get timeouts when trying to goto that ucsc.edu link :(
--- If I were a fish, I'd be wet
Through goatse.cx, and If we all play our part, we can get gnu.org associated with goatse.cx!
Does this mean that once quantum computers arrive, we will experience quantum entanglement?
Thank you, I'll be here all week :P
"Einstein argued that [...] God is not capricious or arbitrary. No such faith comforts the software engineer." ~ Brooks
It seems pretty poiintless to me...
What?!? Pointless? Think of how the porn industry can apply this technology...
Microsoft does something similar with their Smart Tags. That is, they modify your page without you realizing it. Only with entanglement, it's done on the server, rather than on the browser.
Is there a way to block entanglement?
I trust this will enable me to surf the web at a speed Faster Than Light. Otherwise I want a refund.
-- Thou hast strayed far from the path of the Avatar.
how does it run on FreeBSD. Can we get it in the ports collection?
--
pants ahoy
Wow a new record for being modded down. Keep up the good work fucktards!
Brilliant, I can't believe someone hasn't come up with this before. It reminds me of the traveling salesment implementation that models the way ants work. Most ants go the way most ants go, everyonce and a while some ants stray to find a better path.
If this isn't abused by users, I see the net becoming much more efficient for searching for information. You won't have to wait for the search engines to catch up while looking for the most popular page on a topic, because the best (or should I say most popular) pages on a topic will automatically link to each other based on user flow.
Am I missing something here, or am I right in thinking this will revolutionize the way we surf (that is if enough sites do it.)?
Why, o why must the sky fall when I've learned to fly?
If the more popular links are shown first, doesn't it just reinforce their popularity? Once a link becomes popular, is there any way to vote it down?
Microsoft Corporation
The page brings up your header "more play, your way"
Sort of ironic
How long before this goes the way of the search engines with people abusing this to promote their own links?
This looks a lot like Everything2's automatic links. I wonder if people won't start using it to express their dislike in an anonymous manner (like, outlinking to "pieceofcrap.com" if they don't like the page)...
Excluding mutually authenticated ssl sessions, how can I trust that the document I'm reading is the document I tried to download? The tangle service is already modifying the page to add its navigation links, so why not change the content too ( e.g. remove content that users might find offensive, replace ads on popular pages with ads that you've sold, change links to documents you host, etc. )? The same really goes for any proxy or cache service, and I'm not accusing these good people of doing this, but how do we protect ourselves from services that would as more of them appear?
Combine this idea with Google News and we'll really have it made:
TOP STORIES: US
Fark has posted three new boobies links. Ten million kittens reported dead. (5,398,298 related)
It sounds cool, but might prove to be useless... the phenomenon will happen that popular sites will be the ones getting the most hits and just perpetuating that way just because they are popular. More useful but less popular sites will be overlooked because they haven't been looked at much.
Comment removed based on user account deletion
If this caught on, I can imagine that it might be possible that people would tend to depend on it. It seems that information would become stagnate and new information ignored since nowone would have exited to it initally. Then again, maybe not. Just a thought.
Im not here now... Im out KILLING pepperoni
I'm noticing a downward trend here. The web is becoming increasingly overloaded. Once critical mass is reached, the entire Internet user database will be beseached. As I'm sure all music stealers are familiar with, peer-to-peer is becoming increasingly more practical. I see Tangle as a great step towards a peer-to-peer HTTP, but its only a step. One small step for HTTP. One giant leep for P2P. Fact is, not everyone is going to run Tangle. Could it be possible to shoehorn on P2P possibly using multiple DNS A RR's, or in such a way as to bring P2P to a more broad pool of users?
"The lesson to be learned is not to take the comments on slashdot too literally." --Vinnie Falco, BearShare
Well, we know exactly what the first entry link at NineNine's and autopr0n's sites will be.
bytesmythe
Hypocrisy is the resin that holds the plywood of society together.
-- Scott Meyer
Tangle is also the name of a literate programming utility by Donald Knuth. Along with WEB.
Responstimes are close to a minute right now on the linked proxy. How would it stack up, if you ran a local entanglement proxy? Would response times still be high, due to negotiations with other nodes?
We do not live in the 21st century. We live in the 20 second century.
Click on this link to associate the GNU hippies with Goatse :-)
Click this link to associate the GNU hippies with Goatse :-)
OK - not really like it, but if they start letting people leave comments, it'll be like thirdvoice (man, I feel OLD in internet time and thirdvoice wasn't even all that long ago!)
creation science book
I'm currently trying to figure out why people visit /. most often after visiting this link, which the entangle system tells me is a popular entry link:
/. to see if a similar article was posted. Weird.
CNN: Iraq Weighs U.N. Resolution
I can only guess that a lot of people rushed over to
Cogito ergo sum in Slashdot.
You should be using this (http://zip.cse.ucsc.edu:8080/request?inform_about _proxy=&link_from_page_title=&link_from_page_url=h ttp://slashdot.org/&link_to_page_url=http://www.gn u.org/ for those who don't trust me) link instead so the referrer will be Slashdot, so the referrer will be correct.
--j
put links on your web pages based on what the web page is about
http://www.vanillaafro.com - take me seriously and I will shoot you
This appears to use the same idea as referer-links on weblogs. Here's the progression from idea to uselessness:
- Obtain data from visitors as they browse.
- Post data obtained form visitors on the same site.
- Watch as three new internet startups market a tool to spam pr0n links on all the pages that use (1) and (2), above.
Only let your users post shit on your site if you want it to all be pr0n spam or goatse links.The point is that it's live interactive software building links between urls. You can't cache that!
....Spam links start appearing?
Another question... When does Alexa get involved in doing "web page entanglement"... It would sort of complement their existing spyware infested "toolbar".
Hey, I just checked the entangled version of the Microsoft.com site, and all the entry and exist links seem to go to Slashdot, Free Software Foundation, or other places that Microsoft stands against. Looks like Slashdot has done its job. Pretty funny.
Ceci n'est pas un post
This appears as an exit link:
"anarax.net - easier to use than a virgin on prom night"
Not very tasteful for a professional site.
Don't get me wrong though, this is a very creative and useful thing. For example, this would be extremely useful for searching through technical support knowledge bases or for a large company's document archive system. I would just rather they leave my web surfing alone. ;)
A major backbone provider sets up automatic proxy redirection for http traffic, and uses proxies like this to gather links between pages, and with that, create a better search engine than google?
I am preempting the likely posts about misspelling, grammar etc.
several tangle proxies that you can try"
and they're all, you guessed it, slashdotted
you mean carrier pigeons right?
This sure sounds a lot like softlinks on Everything2.
Really, rather interesting things. Kind of makes a "nueron net" of the database (or web, for tangle). You get to see everyone's thought patterns, from the relevant links to the one or two offbeat ones.
Stupid like a fox!
I just added something along those lines to my website. I agree it's a cool idea. Of course mine is way more simplistic ::P
-- taking over the world, we are.
no slashdotting anymore? is that it?...
this sounds like another tool for aome Farking asshat advertiser to use...Sorry things are bad enough as it is out here without this BS. I imagine a week befoerr the source has been modified and in use by someone to track and record every piece of infor they can get their grubby hands on...
yeah, that's slashdot, modding it up as insightful instead of funny ;)
This is like the concept of node links at the bottom of everything2 nodes, isn't it? It's a neat idea, but it's easily abused (as seen by the goatse posts above). It can make surfing fun, though. I often spend hours at everything2 following links I find interesting.
if(!cool) exit(-1);
While entangle is useful for killing fleeing units (or peasants at mines), his Web Page Thorns aura is much better... Not to mention Web Page tranquility... but you have to creep slashdot a whole lot to get enough experience.
I know more than you drink.
This is another example of how P2P will eventually take over the everyday tasks of running the Internet. As corporations, such as Worldcoms UUnet, go under in bad economic times it is inevitable that the Internet as a whole becomes P2P.
Where the Music Matters
4. Profit!!
Its nice.....Maybe I am mistaken but isn't this similar to most search technologies a.g. [after google:)] That is to say what other people prefer is automatically tagged the most relevant - google uses it for pageranking, these people display it and some more features.....
Also as another poster suggested what if I virtually stamp all over the place like goto a page and then immediately goto mine - ad inifnitum. Potential to abuse is always there I guess?
Thanks,
vv
Is it just me or did the proxies listed on the site already get /. ed?
"Not knowing when the dawn will come, I open every door." - Emily Dickinson
Mods, this isn't redundant, it's true... and old news since Everything2 is already around.
Of course the problem they've experienced on Everything2 is that some cool or sexy sounding link is irresistible to click on, causing these links to rise to the top regardless of their relevance. Thus, it decreases the usefulness of the "entanglement".
Sex memes really are the most pernicious out there... can you honestly tell me you could resist clicking on "The Screensavers - Nude Episode"? The cost (clicking) to possible benefit (grrrrrrrrrrrrrrr) ratio is just too small not to expend the click.
Pop-up hell might increase cost, thereby disciplining hormonal clickers, but even then. The Onion used to have an ad called "Naked Scottish Weathergirls" -- one of the most clicked on on the web. It led to a messageboard eventually where people posted digitized women in Scotland -- so many people must have arrived there and posted messages asking about the naked women it was unreal.
I'm reminded of the idea of leaving your campus grounds unpaved, and then waiting for the "natural" grooves to appear in the ground where people walk, and then paving over those to make the sidewalks. You've probably seen an example of where there's a sidewalk connecting two points but then there's a worn-out groove nearby that's better, or connects from a more popular location.
Some people think it's rude or immature for people to create these grooves by not walking on the sidewalk, but I see it as an example of an arrogant designer who thinks he knows the best way simply by studying a piece of paper. It's amazing sometimes, the groove just appears almost magically in an optimal place, given the layout of buildings and traffic patterns.
This applies to web pages too. But, unlike sidewalks and buildings, you can't see your other destinations when you're sitting on a web page, so how do you know where to go next? This seems like it will just constantly reinforce the previous set of links, whatever they are.
I didn't fully read the documents (/. strikes again) but what I saw says you move from page to page either by 1) following an existing link or 2) using a search function. #1 is not going to create fresh paths.
It seems to me, a better idea would be to present a user with all possible links, or a subset of possible links, the first few times they visit. Then as they click through the site, add their arcs to the database.
After the first few visits, you can stop showing all links, and show them the "most popular" links. If you just show the popular links up front, new paths may not be discovered.
So perhaps this technique could be seen as a way to remove unpopular links, to trim the fat from a page. Then again, it might not be good to change a page after a person has gotten used to it.
It's very interesting though. As the web matures, you'll see more of this sort of analysis to move beyond static web pages.
3.5. ???
so now every subject in the world is going to
lead to cheap porn
yet another case of someone taking a fish and
calling it a chicken
how about you fucking hippies cut out the weed
and do something useful?
1. Server load.
2. Limited feedback. Would be much more interesting as a tool for discovery if users could grade their findings. Presumably annotation would allow memos to be posted.
3a. Privacy concerns, i.e. this would seem to provide more transparency to crowds. And Slashdotters might become more predictable. (Nah!)
3b. Privacy concerns II. By announcing statistics of aggregate use it might be possible for a repressive regime (China, Scientology) to gain ammunition against individual websites by being able to prove how many visitors they had and (by purchasing an advertisement on an associated server like yahoo) what their IP addresses and demographic profile are (as impled by 3a above). ActiveX or Javascript exploits may also target heavy traffic streams with relatively little effort.
4. Confusing intent. Adding visible backlinks seems quite valuable. However the client still cannot look more than one ply above its current location in what is still an undirected tangle. Is the tangle team (nice name by the way) aware of the large body of work already accomplished in annotation, syntactic web, Xanadu, etc.? What pressures exist to get people to take the less-travelled routes, or is the purpose to increase the traffic of popular sites? In that case are annotations superfluous? More docs please.
5. (?) a bug in slash they note.
Does anyone else remember the What's Related feature that was in Netscape? It's still in Mozilla, but as a sidebar - pop open the sidebar, and there should be a tab labeled "What's Related." It's a list of links between the current page and webpages that people most frequently either leave from the site or use to arrive at the site (I think). Sounds very similar, but since it's already been Slashdotted, I can't compare the two. An interesting idea, but based on having played with What's Related, it isn't really all that useful - you wind up with a common set of sites, and the less well-known sites just get lost in the flood of popular ones.
You are in a maze of twisty little relative jumps, all alike.
I wonder how long it will take shady marketers to hone this power (Gator anyone?)?
So what happens when someone adds a line in their hosts file for gnu.org that points to a local server, adds a link to a modified version of the site with a link of their choice and clicks it?
doesn't this infringe on this?
http://www.tangletoys.com/
as a trademark?
What if your brower doesn't ever send referer headers? How does the system cope with that? Or do simply pass through without voting?
We're going to make information free Mr. Anderson, whether you like it, or not.
"Socket Error
Connection refused by Remote Host"
Awesome use of P2P there to create a robust, scaleable Web. Good thing we dont have to rely on that slow, unreliable old client-server Web architecture anymore!
A note for the paranoid:
Though tangle keeps track of web usage patterns, the focus is not on tracking the habits of individual users, but on tracking the trends of an entire community of users. tangle is GPL'd open source [source here], so you can see for yourself...
Yes, but since this runs on the server, how do I know you're really running the source that's available?.
Or maybe I'm worrying too much, and the check really is in the mail, my information really won't be sold to 3rd parties, that really does happen to all guys at one time or another, and it's not me, it's you.
Ever notice that most comments starting with "Excellent" or "Brilliant", etc tend to be trolls? I almost overlooked this one because of it.
I still see a problem with the described methods though. That being, I don't think that the second-best search page selling product X would want a link running to the next-up competitor selling same product X.
The same is definately true for the second-best... do you really want users checking out where everybody else is looking for better deals?
If you knew that your prices beat the competition it would be a no-brainer, but otherwise it would be in some ways virtual suicide.
Isn't that the same thing that Google is right now?
I do that in the morning. First, the news from europe.cnn.com. Then onto slashdot.org. After that, it's drudgereport.com and a couple of other sites if I have time.
Gosh, these Slashdot people really should check that the links they post go to working websites!
:-)=
Don't hate me coz I'm funny.
bits and peace
Nicholas Daley
I get a 'connection was refused' message on all servers. Is it
a. mozilla
b. a firewall
c. slashdotted
d. they don't want me to go to that site!
I think b. But I'm not sure.
I can see "popular" sites being added for generating income, warez sites aka porn sites/ad sites overwhelming everything, and even "advertising" by having websites removed.
People will naturally click on the top-ranked link(s) on a page in the hope that they're useful. If they're not, you've just voted for them, making them even higher ranked.
Google has a much better method for this - it looks to see how many links there are on the web at large to a page. People don't tend to link to stuff unless they like it. Although it's open to some abuse, it's a much better solution.
I am typing this in my French cybercafe, which has 10 linux terminals on a broadband connection and an ageing Minitel (1200/75 baud, 9" monochrome screen, Cornflakes packet keyboard...). Scary thing is, to find a specific (and reliable) bit of information, it is often faster to use the Minitel. One of the main reasons is that the Minitel is structured in a way that is relatively intuitive for most people.
Tracking which paths people follow is very clever, but I can't help thinking that it would be better if website designers put more effort into their navigation aids, link pages, and - gasp - maybe listened to their visitors a bit more.
The real genius of the Minitel is that it got thin client technology into millions of French homes long before anyone in France or the USA had heard of the Internet, because it is as easy to use as a telephone. The Internet has a long way to go on that score, and I don't think being able to see how everyone else gets lost is going to help in this respect.
Virtually serving coffee
Has anyone actually found a source tarball anywhere?
:8(
What's with the sourceforge.net page saying the project "has not released any files"?
If you download the tangle module in CVS, you get a bunch of C++ that looks like it's missing rather a lot...
Has anyone built this from source yet?
~Tim
--
Rushing on down to the circle of the turn
I've seen the same effect happen with Google and Referer (sic) logs. Consider:
Once the loop gets started, it may keep going even after the entry that started the loop falls off the front page, endlessly perpetuating itself.
Content providers don't have control of what happens to their content after it leaves their server (other than not publishing it to the web in the first place). A link between two similar products is to the benefit of the visitor. They can do comparisions between products, and make a better educated decision. This benefits the visitor - the people who make the Web a thriving community.
If a company doesn't want a link on "their" page to a competitors better product, then they can catch a wake-up and improve their product, instead of rallying against freedom of information (in this case links) and the freedom of user choice.
A company has no problems with being indexed by Google and ranked lower than their competition - so they should have no problem with this method of ranking.
The original article can be found here
mt
No, I didn't. I was hit by a van a few years ago, but I didn't die.
Karma: Undead.
Softlinks have escaped from E2 to the rest of the Web! No one is safe!
About four years ago I came up with a similar idea (with no idea how to implement it. I've learned a little since then, but... glad to see some else doing this) However, it had a couple of additional features.
1. Browser Integration - Server the 'tangle' seprately from the page (perhaps as an option) that the browser can implement as a sort of map. The browser could then have both 'up' and 'down' buttons as well as 'back' and 'forward'. Up and down would serve as a sort of zoom out/in feature, narrowing or widening the context of the 'tangle'. 'back' would go where you came from. 'forward' would take you to the highest ranked link.
2. Selectable (modular?) ranking functions - the default function for page ranking sounds like it is 'popularity'. That is one good metric, but as you can read from the responses on slashdot, it's not the only useful one and may not be useful at all on its own. Other possible metrics could be 'age', 'number of links' (similar to page rank), editor specific (i.e., 'Joe's Tangle'), and various combinations (ie, customize my 'tangle' to rank things acording it rank in multiple other 'tangle's - 1st pop, 3rd newset, 2nd on 'Joe's Tangle' = a score of 6. Rank by score.)
3. Client/Server model - there may be various reasons that I cannot or do not want to do my websurfing through a Tangle Proxy but still need/want the functionality. Perhaps I'm already running through another proxy and don't want to change. Perhaps I'm Joe User and don't know how to navigate through a proxy server. In many ways this is just a rehash and extention of suggestion 1. The browser can contain your client code, downloading 'tangle's and sending updates to the 'tangle' server. Additionally, assuming the client is open source, this allows the user to control what ranking information his client is collecting and sending back to the server. Separating 'tangle' information from the page itself will also allow it to be used with signed content.
(The original idea was also ment to be a browser integrated into the OS to manage files and launch applications.)
Aaron Madsen
aaronmadsen@netscape.ne t
This was five years ago (+/-), and "grooves from walkers on a campus" was given as an analogy by Brewster as he showed off the alpha version. I recall that people's choices were only one of six factors going into the calculations, so popularity wouldn't create a positive feedback loop of overly deep grooveness (my paraphrase).
modconf (0.2.37) stable unstable; urgency=medium
[...]
* Eduard Bloch:
- fixed Makefile broken Marcin Owsiany a while ago. The default manpage
has been overwritten with the polish translation. I still wonder why
nobody noticed this before. Closes: #117474
[...]
-- Eduard Bloch Sun, 28 Oct 2001 12:53:27 +0100
- this post brought to you by the Automated Last Post Generator...