The Semantic Web Going Mainstream

← Back to Stories (view on slashdot.org)

The Semantic Web Going Mainstream

Posted by ryuzaki0 on Monday October 29, 2007 @02:53AM from the no-not-the-virus-company dept.

Jamie found a story about a new web tool that is trying to break ground into the semantic web. It's called twine, and it supposedly will intelligently aggregate your data, be it youtube videos, emails, or whatever you accumulate in your travels. Not the first, not the last, but here's hoping something comes out of the ideas someday.

19 of 110 comments (clear)

Min score:

Reason:

Sort:

Sorry, but it's not for me. by AltGrendel · 2007-10-29 03:01 · Score: 4, Insightful

Twine is a website where people can dump information that's important to them, from strings of e-mails to YouTube videos.
I really don't like this idea. One good hack from the Russian MAFIA and the game would be over. All your eggs are belong to us, as it were.

--
The simple truth is that interstellar distances will not fit into the human imagination
- Douglas Adams
1. Re:Sorry, but it's not for me. by Colin+Smith · 2007-10-29 03:32 · Score: 3, Insightful
  
  Mmm. Nah. The Semantic Web is most useful to remove humans from the loop completely. i.e. When Skynet wants to know which batter had the most RBIs in 1997 it will be able to understand from the XML DTD what a batter is and how that relates to RBIs...
  
  So... What's an RBI then when it's at home?
  
  --
  Deleted
2. Re:Sorry, but it's not for me. by CRCulver · 2007-10-29 03:53 · Score: 2, Insightful
  
  Anyone that uses the word 'semantic' with 'web' should be pointed at and laughed at, then perhaps hit in the face with a brick. Keep trying, marketeers, you'll find a new way to game your hits and get another easy payday sooner or later.
  
  The driving force behind semantic web research is Sir Tim Berners-Lee, hardly a marketer just trying to get rich from buzzwords. He's an academic, and it's precisely in academic circles that the semantic web is already a reality. Just see Visualizing the Semantic Web ed. Geroimenko & Chen (Springer-Verlag, 2nd ed. 2005). It shows several projects where the concepts of the semantic web were used to great effect.
  
  If anything, marketers have been staying away from the semantic web, since developing semantic web technologies requires the sort of learning and practising that costs money, when employees could just trot out the same FUBARed markup and table-based layouts that they are already used to.
3. Re:Sorry, but it's not for me. by Tim+C · 2007-10-29 04:51 · Score: 2, Insightful
  
  Or for that matter... Wouldn't it be easier for the Russian Mafia to hack your average unsecured windows computer and blow away your data that way?
  But why would they?
  
  I'm insignificant, a nobody, and no target at all. A site used by hundreds of thousands or even millions, on the other hand - now that's a target. A well-defended target you'd hope, but a target nonetheless.
  
  --
  It's official. Most of you are morons.
Terms and Conditions by Mateo_LeFou · 2007-10-29 03:04 · Score: 2, Insightful

"access and use the Site and electronically copy, (except where prohibited without a license) and print to hard copy portions of the Site Materials for your informational, non-commercial and personal use only"

Can't use their service for commercial purposes; how mainstream can it be?

--
My turnips listen for the soft cry of your love
Re:Relevant by Anonymous Coward · 2007-10-29 03:24 · Score: 2, Insightful

u mean matrix lovin kids:

Here's a futuristic tailored smeantic search example!
Semantic search:#> sem.search s1 = Kung_Fu +online_course +display=practical Layout +embedded=video +moves=kicking +armchair=relaxed_position + muscles.linked(search(0)) + 3d.harness=on +holographic.image_projector=on +environment=O2; s1;
"Fighting the hype problem"? by Red+Flayer · 2007-10-29 03:33 · Score: 4, Insightful

FTA (second page):
It's still too early to know if Twine will be successful with consumers, says Tony Shaw, president of Semantic Universe, an organization committed to raising awareness of semantic technologies in business and consumer settings. Success will not simply depend on making the technology work, but also on managing people's expectations of the technology, he says. "It's about fighting the hype problem."
Hmm. Let's fight the hype problem by publishing more hype. And maybe if we include a statement saying we're fighting hype, people will assume this reformatted press release isn't hype.

Sure, I understand that managing expectations is important, but let's not lose sight of what this article really is.

--
"Trolls they were, but filled with the evil will of their master: a fell race..." -- J.R.R. Tolkien on Olog-hai
1. Re:"Fighting the hype problem"? by YGingras · 2007-10-29 05:20 · Score: 2, Insightful
  
  Let's fight the hype problem by publishing more hype.
  Of course, it builds hype resistance. Historical evidences show that it worked for IPv6 and Duke Nukem Forever.
Clearly some new meaning of semantic Web here by clickclickdrone · 2007-10-29 03:37 · Score: 5, Insightful

Unless I've missed some whole new sub-branch, semantic web to me means marking it up properly to give meaning to the various page elements via correct tags and microformats. This is just an overgrown agregator.

--
I want a list of atrocities done in your name - Recoil
Re:Attack of the Misunderstood Acronyms! by MrMunkey · 2007-10-29 03:44 · Score: 2, Insightful

Even better... On their site they say

Twine is one of the first mainstream applications of the Semantic Web, or what is sometimes referred to as Web 3.0.

http://www.twine.com/about and there's a great section about Web 3.0 here

It's great for a laugh... until you realize that by this time next year we'll probably be on Web 10.0
I call BS by abes · 2007-10-29 03:47 · Score: 2, Insightful

While I am a fan of the "esoteric field of machine learning", as the article mentions, I am also well aware of the countless of disappointments so far (thus no AI..). There have been many designs that can tackle toy problems, but nothing yet that has been able to handle large corpuses of text so far. The big problem being is that to really be able to do proper categorization the program must understand what it's reading. Which, again, requires some type of intelligence.

While methods are available to do categorization based on either static or learned heuristics, they are less than perfect (think about Safe Search in Google images -- it works decently, but definitely not perfectly). In fact, just parsing a single English sentences can be a difficult task for computers (if the sentence doesn't fall into a context free grammar). So the best we can probably hope Twine to do is categorize based off of word frequency (okay, they probably use some higher order stats).

Whenever I read about a new semantic technology, I always think of Wordnet (developed by Miller, who is the same guy responsible for the study showing we can remember 5-7 digits). Wordnet was developed as a database for the hierarchy of all words. Words are defined by their relationship to other words.

While it's a great idea, and useful for some projects, it also far from perfect, as words do not in the end have a static relationship to each other. The semantic web in the end relies on a static relationship between words (either through common usage or through a relationship through words).
I'm already using the Semantic Web by progprog · 2007-10-29 04:01 · Score: 5, Insightful

Let's see if it works on Slashdot.
Goofy project by wytcld · 2007-10-29 04:05 · Score: 4, Insightful

It's well-known in linguistics and philosophy that "You don't get semantics from syntax." It's well-known in computer science that computers are syntactical. It's well-known in recent business history that all startups claiming they'd produce "expert systems" or "artificial intelligence" in which computer systems would, despite these accepted truths, perform semantic feats have miserably failed to live up to their claims.

So why don't we give PR puff pieces like this the same warm reception we give to the latest announcement of a perpetual motion machine? It's the kind of project only plausible to those who know very little of the basic background well-accepted by experts in the pertinent adjacent fields. That one or two big names from the success of the syntactical www either aren't familiar with or don't accept core knowledge from linguistics and philosophy of language is finally no different than Thomas Edison working for years on a machine to talk to ghosts: brilliance in one area most often doesn't translate into other areas in which you have no background - and even more rarely into areas where nobody knows how it would be done.

--
"with their freedom lost all virtue lose" - Milton
1. Re:Goofy project by Lodragandraoidh · 2007-10-29 05:24 · Score: 2, Insightful
  
  Humans don't even get semantics right consistently. In many cases there is no one 'right' meaning for any given collection of symbols. It all resides within the human skull, and is ever changing over time - and is reflected in how languages and symbology morph through the centuries.
  
  There have been various attempts to tame the semantic beast - formalized hierarchies being the most successful in conjunction with the advancement of scientific thought, and more recently less formalized meta-tagging systems. In these systems that seem to work best the human is involved in providing the meaning in terms a computer (or other humans) can understand: lists of keys/pointers to other lists ad infinitum. Of course there is always that undefineable exception that breaks such simple systems (e.g. the Platypus).
  
  Reality is an ever changing and evolving continuum - and quantum physicists would probably take issue with that.
  
  --
  
  Lodragan Draoidh
  The more you explain it, the more I don't understand it. - Mark Twain
2. Re:Goofy project by jpfed · 2007-10-29 05:34 · Score: 3, Insightful
  
  It's well-known in linguistics and philosophy that "You don't get semantics from syntax." That's right- we get semantics from interpretation.
  So why don't we give PR puff pieces like this the same warm reception we give to the latest announcement of a perpetual motion machine? Because the right syntax can give to a computer very helpful clues towards productive interpretations. Data- which is just "syntax"- helps to drive computers to more effectively interpret other, related data all the freaking time. That's not what kills the semantic web idea.
  
  What kills the semantic web idea is that all the millions of individual producers of data don't have any immediate incentive to mark their own data up for the benefit of others.
3. Re:Goofy project by maharg · 2007-10-29 07:10 · Score: 2, Insightful
  
  .. and 640k ought to be enough for anyone ! lol
  
  Artificial Intelligence is a very different field from Semantic Web. The technology for SemWeb is here now, AI is still a ways off, I will admit.
  
  --
  
  $ strings FTP.EXE | grep Copyright
  @(#) Copyright (c) 1983 The Regents of the University of California.
Re:Relevant by Arthur+B. · 2007-10-29 04:06 · Score: 2, Insightful

The search engines are currently still mostly syntaxic. Look for a word, see pages matching that word, in a more or less relevant order... This means you have to play trick games with search engines in order to find what you want...

Imagine you could simply query things like: Find me an appointement with a dentist that takes my insurance, has good ratings and lives near where I live. From your personal information (your calendar, where you live), public information (consummer ratings on the dentists, maps, information from de dentist office, from your insurance etc) a semantic web search engine could provide you with an answer.

All it takes is for the data published on the internet to be *structured*

--
\u262D = \u5350
Re:No! Load More On! by darthflo · 2007-10-29 05:48 · Score: 2, Insightful

Maybe they can integrate it into Web 2.0...
You got it all wrong. Web 2.0 is "You make all the content, they get all the revenue". Web 3.0 is going to be The Semantic Web. Web 4.0 will, like Winamp 4.0, be skipped in favour of Web 5.0 where users provide the content, search engines look at the ads while grabbing the content and returning it, processed and summarized, to said users. This will also perfectly integrate with GWEI and similar projects for other search engines!
So Gmail is better? by techbiz108 · 2007-10-29 07:24 · Score: 2, Insightful

Hold on so you are saying that any hosted service is unsafe then? What about all the people who use hosted email, or hosted collaboration, or hosted file servers? Sure if a hacker gets into anything it's unsafe. Heck even enterprise software that is locally hosted is at risk. Geez, if we're that terrified, let's not even use computers or the Internet at all then. Twine is no more at risk than Gmail, Facebook, Salesforce or any other online service that holds information that is not all public. Get real.