Slashdot Mirror


Web Page Entanglement

jason writes "tangle is a system for what we call "web page entanglement". tangle creates links between pages automatically based on how users move from one page to another. tangle proxies connect together in a peer-to-peer network for scalability: as users surf the entangled web, they are passed from proxy to proxy. Each proxy serves as an expert for a particular subset of web pages. For example, you can take a look at the entangled version of the GNU homepage as seen through a tangle proxy. tangle alpha2, the first public version, has just been released. See http://tangle.sourceforge.net for more information, or read on..."

jason continues:

"By viewing the web through a tangle proxy, you can see the connections and associations left by those who surfed the web before you. By surfing the web using tangle, you also leave behind connections and associations for others who will surf in the future.

When you exit one page and enter another (by clicking a link or performing a search), a two-way link is created between the pages. As users surf through a particular page over time, tangle keeps track of popular ways to get to the page and popular places to go next. These entry and exit links are displayed at the top of each page, sorted by popularity.

Clicking on one of these entry/exit links tells tangle that you think the link is relevant and useful (like a vote for the link) and increases the link's popularity. In other words, if a user thinks of something relevant while reading a page and performs a search for it from that page, tangle gauges how others react to that association over time.

tangle is similar in some ways to the closed-loop hypertext system Everything2, though tangle works for the web at large.

We have several tangle proxies up and running. The tangle proxy software is also available for download.

A note for the paranoid:
Though tangle keeps track of web usage patterns, the focus is not on tracking the habits of individual users, but on tracking the trends of an entire community of users. tangle is GPL'd open source [source here], so you can see for yourself: clicking a link through a tangle proxy simply bumps up the links popularity---user IP addresses are completely ignored."

21 of 176 comments (clear)

  1. isn't this done already? by Anonymous Coward · · Score: 5, Interesting

    Microsoft does something similar with their Smart Tags. That is, they modify your page without you realizing it. Only with entanglement, it's done on the server, rather than on the browser.

    Is there a way to block entanglement?

    1. Re:isn't this done already? by Mnemia · · Score: 4, Interesting

      I don't think you shouldn't be able to use mod_rewrite to alter all your URLs so people can't access things in way you didn't allow. There's nothing legally stopping you from doing that; after all, you own the server. But I do think this is unethical behavior if it is done for some reason other than security. It undermines the reason the Web is a powerful medium and not just clickable television or an electronic magazine. Linking and relinking is at the heart of a peer publishing world where anyone can put their work out there on an equal plane with the professionals and where anyone can comment, criticize, or critique the contents of other people's information.

      My view is that when you make a public website you are contributing your views and information to the massive global community of links and related information. This ecosystem feeds off of openness and places the quality of the content above marketing and branding. I think that you should be willing to accept that when you make a public website, unless you are worried you can't compete on merit.

      Basically, you're free to make whatever you want available, but you can't control what OTHER people do with that content once it leaves your site (within the bounds of copyright law, which has no bearing IMHO on the copy in the browser cache). That's the price you pay for using the Web to publish: you have to let everyone else have the same rights as you, and that includes the right to link. That's why you shouldn't use mod_rewrite to prevent deep linking, etc, though that's certainly preferable to sending out legal threats. You can do this if you want, but you're not being a responsible member of the Internet community.

  2. Wow... by dubious9 · · Score: 5, Interesting

    Brilliant, I can't believe someone hasn't come up with this before. It reminds me of the traveling salesment implementation that models the way ants work. Most ants go the way most ants go, everyonce and a while some ants stray to find a better path.

    If this isn't abused by users, I see the net becoming much more efficient for searching for information. You won't have to wait for the search engines to catch up while looking for the most popular page on a topic, because the best (or should I say most popular) pages on a topic will automatically link to each other based on user flow.

    Am I missing something here, or am I right in thinking this will revolutionize the way we surf (that is if enough sites do it.)?

    --
    Why, o why must the sky fall when I've learned to fly?
    1. Re:Wow... by wurp · · Score: 3, Interesting

      That's what I thought about crit.org in all of its incarnations. crit.org is a decorating proxy, like the entangler. But instead of tracking linking, it let you mark up web pages to make corrections, suggest links, request clarifications, etc. I used it for a while, then I used the ThirdVoice toolbar which did the same thing but was proprietary. AFAIK, virtually no one else used it. Even on the sites associated with the creators, it was rare to find anyone posting or get a response to your issues.

      Until there's a plugin you can put in your browser so that every page you visit is automatically viewed through these decorating proxies, they won't revolutionize anything. : (

    2. Re:Wow... by Anonymous Coward · · Score: 2, Interesting

      Actaully, there are algorithms for solving problems related to the traveling salesman problem that model the way ants work.

      Read more here:
      http://iridia.ulb.ac.be/~mdorigo/ACO/about. html

      or here:
      http://citeseer.nj.nec.com/421147.html

      Ironically, you could have typed four words into Google and understood what he was referring to, rather than typing in several dozen insulting him unfairly.

    3. Re:Wow... by dubious9 · · Score: 4, Interesting

      -1 Ignorant. I happen to have a degree in CS.

      I may not have gotten the exact idea down, but yes a very good approximative traveling salesman algorithm is based on ant behavior.

      Do some research here for some undergrads that used the idea learned from here(pdf)

      (Which are link i got from a two minute perusal on google for "traveling salesman ants")

      Please have an idea what you are talking about next time.

      Here's the abstract from the latter source.

      We describe an artificial ant colony capable of solving the traveling salesman problem (TSP). Ants of the artificial colony are able to generate successively shorter feasible tours by using information accumulated in the form of a pheromone trail deposited on the edges of the TSP graph. Computer simulations demonstrate that the artificial ant colony is capable of generating good solutions to both symmetric and asymmetric instances of the TSP. The method is an example, like simulated annealing, neural networks, and evolutionary computation, of the successful use of a natural metaphor to design an optimization algorithm.

      --
      Why, o why must the sky fall when I've learned to fly?
  3. Slippery Slope? by moronga · · Score: 5, Interesting

    If the more popular links are shown first, doesn't it just reinforce their popularity? Once a link becomes popular, is there any way to vote it down?

    1. Re:Slippery Slope? by mikeatzelea · · Score: 2, Interesting
      Reinforcing popularity . . . It works as sort of an amplifier, doesn't it? As such, it is prone to positive feedback (that high-pitched whine you hear from PA systems). People will click on links, just because other people clicked on them.

      Does it add useful information, about a given page, that will be 'heard' above this noise? If there are two links, one of which is brightly lit up, but useless, and the other obscurely positioned, but useful, then which will be the most popular, with or without entangling?

  4. Trusting what you read. by clunis · · Score: 5, Interesting

    Excluding mutually authenticated ssl sessions, how can I trust that the document I'm reading is the document I tried to download? The tangle service is already modifying the page to add its navigation links, so why not change the content too ( e.g. remove content that users might find offensive, replace ads on popular pages with ads that you've sold, change links to documents you host, etc. )? The same really goes for any proxy or cache service, and I'm not accusing these good people of doing this, but how do we protect ourselves from services that would as more of them appear?

  5. Re:Woot by Anonymous Coward · · Score: 1, Interesting
    tangle is similar in some ways to the closed-loop hypertext system Everything2, though tangle works for the web at large.
    And E2 is just the plain ole concept of two-way links.
  6. New information by Catskul · · Score: 4, Interesting


    If this caught on, I can imagine that it might be possible that people would tend to depend on it. It seems that information would become stagnate and new information ignored since nowone would have exited to it initally. Then again, maybe not. Just a thought.

    --

    Im not here now... Im out KILLING pepperoni
  7. name recycling by Anonymous Coward · · Score: 1, Interesting

    Tangle is also the name of a literate programming utility by Donald Knuth. Along with WEB.

  8. I like it the way it is by mao+che+minh · · Score: 4, Interesting
    Call me old fashoined, but I really like the way that it works now. I like browsing the web, page by page, without having my surfing and the surfing of others being influenced by the content's popularity. I enjoy having many different outlets for the searching of information that retrieve information and "rank" it by a variety of ways (and many search engines using different means in which to "rank" it).

    Don't get me wrong though, this is a very creative and useful thing. For example, this would be extremely useful for searching through technical support knowledge bases or for a large company's document archive system. I would just rather they leave my web surfing alone. ;)

  9. What if.. by Anonymous Coward · · Score: 1, Interesting

    A major backbone provider sets up automatic proxy redirection for http traffic, and uses proxies like this to gather links between pages, and with that, create a better search engine than google?

  10. heheh by aliens · · Score: 2, Interesting

    I just added something along those lines to my website. I agree it's a cool idea. Of course mine is way more simplistic ::P

    --
    -- taking over the world, we are.
  11. meaning... by Anonymous Coward · · Score: 1, Interesting

    no slashdotting anymore? is that it?...

  12. Re:Tangleless P2P Web by foniksonik · · Score: 5, Interesting

    Is anyone working on a personal P2P portal? Seems like an extension of what you're talking about. What I see is software which works like a webserver but is local and accessed P2P. Instead of DNS you use the P2P model to direct traffick and search for content, whether it is files or html/web media. All you'd need is a renderer (think gecko) hooked in to parse html, etc. to the peer who is browsing your site. This of course could also serve up blogs or calendars or whatever other types of web services you wanted to offer to your peers.

    --
    A fool throws a stone into a well and a thousand sages can not remove it.
  13. oh boy by Anonymous Coward · · Score: 1, Interesting

    this sounds like another tool for aome Farking asshat advertiser to use...Sorry things are bad enough as it is out here without this BS. I imagine a week befoerr the source has been modified and in use by someone to track and record every piece of infor they can get their grubby hands on...

  14. interesting .. but is it effective? by Dr.+Awktagon · · Score: 5, Interesting

    I'm reminded of the idea of leaving your campus grounds unpaved, and then waiting for the "natural" grooves to appear in the ground where people walk, and then paving over those to make the sidewalks. You've probably seen an example of where there's a sidewalk connecting two points but then there's a worn-out groove nearby that's better, or connects from a more popular location.

    Some people think it's rude or immature for people to create these grooves by not walking on the sidewalk, but I see it as an example of an arrogant designer who thinks he knows the best way simply by studying a piece of paper. It's amazing sometimes, the groove just appears almost magically in an optimal place, given the layout of buildings and traffic patterns.

    This applies to web pages too. But, unlike sidewalks and buildings, you can't see your other destinations when you're sitting on a web page, so how do you know where to go next? This seems like it will just constantly reinforce the previous set of links, whatever they are.

    I didn't fully read the documents (/. strikes again) but what I saw says you move from page to page either by 1) following an existing link or 2) using a search function. #1 is not going to create fresh paths.

    It seems to me, a better idea would be to present a user with all possible links, or a subset of possible links, the first few times they visit. Then as they click through the site, add their arcs to the database.

    After the first few visits, you can stop showing all links, and show them the "most popular" links. If you just show the popular links up front, new paths may not be discovered.

    So perhaps this technique could be seen as a way to remove unpopular links, to trim the fat from a page. Then again, it might not be good to change a page after a person has gotten used to it.

    It's very interesting though. As the web matures, you'll see more of this sort of analysis to move beyond static web pages.

  15. hosts? by NewWaveNet · · Score: 2, Interesting

    So what happens when someone adds a line in their hosts file for gnu.org that points to a local server, adds a link to a modified version of the site with a link of their choice and clicks it?

  16. How utterley useless. by almaw · · Score: 2, Interesting

    People will naturally click on the top-ranked link(s) on a page in the hope that they're useful. If they're not, you've just voted for them, making them even higher ranked.

    Google has a much better method for this - it looks to see how many links there are on the web at large to a page. People don't tend to link to stuff unless they like it. Although it's open to some abuse, it's a much better solution.