How Google Will Have Achieved The Semantic Web

Heh by Anonymous Coward · 2004-08-01 09:01 · Score: 5, Interesting

Remember back when we all thought that XML was going to achieve the semantic web by making good search engines unnecessary? Now XML has gone nowhere except as a set of popular libraries for cross-language data serialization, and we're starting to talk about just making really smart search engines.

Re:Heh by primordial+ooze · 2004-08-01 09:17 · Score: 5, Insightful

Remember back when we all thought that XML was going to achieve the semantic web by making good search engines unnecessary?

Not really, and XML is still such a recent development that to say "Remember when" is silly if not outright disengenuous. I was at the SGML '86 conference in Boston where the XML initial draft was presented. That's less than ten years ago. Can you name a information technology that reached anything like its full potential less than a decade after its first mention?
Re:Heh by phats+garage · 2004-08-01 09:22 · Score: 5, Funny

I'd have to say "Microsoft Bob" peaked pretty early.
Re:Heh by KefabiMe · 2004-08-01 09:24 · Score: 5, Insightful

I RTFA. Intriguing, but it would be a huge struggle for Google to become like anything in the article. There's too much money in having the right information at the right time.

"Now XML has gone nowhere except as a set of popular libraries for cross-language data serialization..."

XML is still getting more popular and more accepted with each passing month.

The biggest issues are that there are a few monstrous companies out there that want to control the standard of how information is shared, and mutate XML into some proprietary form that their company can control.

XML is a good thing, like most standards. Standards can fall short at times, especially when the uber-companies start trying to fight for control over them. I believe that this fight for control will do more to prevent the easy transfer of data, more than any problem with XML itself.
Re:Heh by Anonymous Coward · 2004-08-01 09:35 · Score: 5, Insightful

Remember back when we all thought that XML was going to achieve the semantic web by making good search engines unnecessary?

Nope. I remember a bunch of people with no clue hyping it up as such, but anybody actually involved with XML in any technical capacity, including the creators, understood that it was simply a standardised syntax for file formats. So-called pundits jumped on each others' bandwagons in touting it as some kind of miracle, but anybody who actually knew what they were talking about wouldn't make claims about XML that you reckon.
Re:Heh by Profane+MuthaFucka · 2004-08-01 09:36 · Score: 5, Funny

Careful, they might implement their XML with just one tag, called 'data', and just stick their regular old Word documents into that as an encoded binary.

--
Fascism trolls keeping me up every night. When I starts a preachin', he HITS ME WITH HIS REICH!
Re:Heh by primordial+ooze · 2004-08-01 11:17 · Score: 4, Interesting

1986 was 18 years ago.

Well, hopefully you saw that I corrected my dates above - the initial draft was floated in 96, not 86, so it has been less than ten years.

Even if it was less then 10 years ago, 10 years is a long time in Internet land. 10 years ago most people had barely even heard of the Internet.

That actually proves my point - ten years ago most people might have barely heard of the Internet, but it had been around for 25 years (first ARPANet nodes were brought online in '69 - one can even make a case for it being 30 years if you go by the 1964 initial public proposal by the RAND Corporation.

Maybe you misspoke though and meant the World Wide Web. Well the first web browser was built in 1990, but the first working hypertext system is arguably Doug Engelbart's NLS back in the early 60s! (for which he also built the first mouse, btw. What a creative brain!)

It can take a long time for technologies to mature.

The whole idea of "Internet time" is a myth. What there is "dot.com business cycle" time, which is faster than normal, perhaps in part because of the technologies involved, but also in large part because of the naïveté (ie gullibility) of both the associated employee and investor populations.

***

In any case, the article we are discussing isn't interesting because of the specific details (XML, Google, Preident Ashcroft ) but because of the larger idea that emerging inet technologies may (continue to) radically change the nature of information exchange and commerce. It makes sense to think about and debate the possible forms that this change could take and what advantages and pitfalls might be waiting for us.
Re:Heh by mcrbids · 2004-08-01 19:39 · Score: 5, Insightful

Within ten years the DNS will have migrated to an XML format.

I've heard some RETARDED statements on /. before, but this near takes the cake. DNS using XML?

Whatever you are smoking, I want some - 'cause it's clearly some REALLY GOOD SHIAT!

Given that:

1) DNS is a protocol, not a data format, and

2) XML is a data format, not a protocol, and

3) DNS is incredibly light and efficient, and

4) DNS has already proven that it scales well to just about any size, and

5) XML offers no particular advantage, since you could serve DVD ISOs over the DNS, and

6) moving to an "XML PROTOCOL" format would require the update of every single DNS server on the face of the earth, many of which are still running Bind 8.x, and some are still running BIND 4.X for god's sake,

I consider this to be HIGHLY UNLIKELY(tm) !!!!!

--
I have no problem with your religion until you decide it's reason to deprive others of the truth.

Google 2012: The Singularity by Anonymous Coward · 2004-08-01 09:03 · Score: 5, Funny

How Google become self-aware and took over the world.

Semantic Web by ZeroExistenZ · 2004-08-01 09:06 · Score: 5, Informative

Source

Semantic Web, proper noun

An attempt to apply the Dewey Decimal system to an orgy.

Or

The Semantic Web is a project underway that intends to create a universal medium for the exchange of information by giving meaning, in a manner understandable by machines, to the content of documents on the web. Currently under the direction of its creator, Tim Berners-Lee of the World Wide Web Consortium, the Semantic Web extends the ability of the World Wide Web through the use of standards, markup languages and related processing tools.

--
I think we can keep recursing like this until someone returns 1

Re:Semantic Web by morcheeba · 2004-08-01 09:37 · Score: 4, Funny

An attempt to apply the Dewey Decimal system to an orgy.

I just filed my orgy under 126 -- see it is useful!!

--
HIV Crosses Species Barrier... into Muppets

First post? by primordial+ooze · 2004-08-01 09:07 · Score: 5, Interesting

Very interesting ideas, but I seriously doubt that Google could (or would) try to squeeze a percentage out of every transaction performed using the hypothesized marketplace manager. That just doesn't seem to fit their modus operandi. More likely they'd give place preference to paying clients, much as they do now with the existing search pages.

But as I said, a provocative read. Metadata truly is the future.

I can see many uses for this semweb stuff by CrackedButter · 2004-08-01 09:10 · Score: 5, Funny

So, you're a small African republic in the midst of a revolution with a megalomaniac leader, an expatriate Russian scientist in your employ, and 6 billion in heroin profits in your bank account, and you need to buy some weapons-grade plutonium.
Who does it for you?
Google Personal Agent
Now there's innovation and balls in one sentence! I take it the War on terror is won in 2009 or these sorts of semweb transaction become the norm. How *could* Amazon and Ebay compete when it comes to selling nuclear weapons?

--
Jonathanjk.com

Slashdot purpose by someguy456 · 2004-08-01 09:11 · Score: 4, Insightful

So I guess since ./ couldn't handle the past, and is failing miserably with the present, it will now resort to fortune-telling?

Editors, could we at least keep the dupes down? :)

--
Robert Bindler
A Computer Science student's views on technology.

But none of this ever happened by Halcyon-X · 2004-08-01 09:13 · Score: 4, Insightful

because all of the patents to do so were tied up between various companies that didn't want to cooperate with each other.

--

.sig: Open Source, Open Mind

Who wrote the article? by CrackedButter · 2004-08-01 09:14 · Score: 4, Funny

Hari Seldon?

--
Jonathanjk.com

old article.... by TheClam · 2004-08-01 09:15 · Score: 5, Informative

Anyone else notice that this is from July 26, 2002?

Finance Transfer Protocol? by Scythr0x0rs · 2004-08-01 09:18 · Score: 4, Funny

Finance Transfer Protocol?

They need to think about this more.
'FTP me $25'
Then you find a 15mb top resolution scan of a couple of green bills in your /pub folder.

BB by jals · 2004-08-01 09:26 · Score: 4, Insightful

I started off reading this and gradually got quite excited by the ideas presented.

About half way through I mistakenly thought I was reading an online copy of 1984.

The benifits of this happening sound fantastic. It just sounds very cool for everyone to be connected like that - which is what scares me even more. Here is an absolutely huge privacy concern; and it has me totally excited about the prospect of it happening.

Sorry to go slightly off topic, but it's things like this that worry me a lot, that a possible 1984 scenario could disguise itself so well that even a person like me - who is verging on (if not already there) being a member of the tin foil hat brigade - excited by the very idea of it.

Well the idea by Anonymous Coward · 2004-08-01 09:34 · Score: 5, Interesting

didn't so much refer to XML the technology as to one of XML's proposed applications. There was a popular theory within the press when XHTML was first introduced that XML would supplant webpages and drag the web back to that primordial point when HTML was intended as a content markup language, not a display language, and even go beyond that. Supposedly we were going to wind up where stylesheets would go beyond just a mapping from XML tags to some set of HTML4 tags, and into a point where content was just a minimal set of XML-tagged text and everything about the way the site displayed was deferred to CSS-like technologies. And when this happened supposedly web browsers would be totally free to reset stuff, and we could toss out amazon.com's presentation of, say, the search results for "Michael Jackson" (as a series of paragraph-delimited links to categories (books, music, etc) to search within in a blocked-off area surrounded by amazon.com's navbars and logos, which then pointed to a series of pages containing little formatted blips of information about various items for sale presented in groups of ten separated by little gray lines in a blocked-off area surrounded by amazon.com's navbars and logos), and instead have it display as a heirarchial file browser or whatever we liked.

Well, I think it's safe to say that idea's been mostly shelved for the time being. This isn't a matter of a lack of "reaching potential", it's a matter of total failure to move in that direction. XML has been incredibly popular as a storage mechanism but has had roughly zero takeup as a communication mechanism. (There have been communication substrates, such as XML-RPC, based off of XML, but that's not the same thing.) I don't know if it's fair to assume a technology come to fruition within 8 years of being proposed, but I think it's fair to assume that unless we see some kind of signs of progress or interest in progress within 8 years, there's no reason to expect further progress within the 8 years after that.

Other points of view... by Beige+Tangerine · 2004-08-01 09:51 · Score: 5, Informative

Not necessarily all good points, but as always, it's hard to argue with "people lie" as an argument against anything:

Strong AI not required for software agents by MarkWatson · 2004-08-01 09:53 · Score: 4, Interesting

Strong AI requires grounding symbols in real world things, events, and processes.

I think that simply defining the "meaning" of words in ontologies is likely good enough for useful web-based software agents. It will take time, but with well defined ontologies, and common use of RDF using standard schemas will make a lot of cool things possible. I think that dealing with ungrounded symbols, but symbols defined and related to other symbols in a structured way, is OK.

One of the classic complaints of AI systems can be summed up with a trivial example:

Define a relation in Prolog:

father(ken, mark).

A human reader assigns their own meaning to "father", "ken", and "mark". To a prolog system, this could just as easily be:

aaa1(aaa2, aaa3).

Somewhere, on the edge of symbol-slamming systems, there has to be some connection with the real world, with our experiences.

For semantic web applications, this "edge connection" can simply be tying into symbols defined in OWL ontologies, RDF Schema, etc.

The problem is getting people to use RDF (I added RDF to my main web site years ago, but it only contains limited information).

Another problem with RDF is that there are several kluges to get it into XHTML, but that will hopefully change soon.

A good toolkit for experimenting with the semantic web is the Swi-Prolog semweb library (http://www.swi-prolog.org/packages/semweb.html/)

-Mark

My prediction for 5 years by Julian+Morrison · 2004-08-01 10:21 · Score: 5, Interesting

...RDF will be in the same category as VRML: a sexy sounding solution having long given up the search for any real problem.

Reasons:

It relies on worldwide standardized nuance-free semantic mappings, which are probably linguistically impossible for anything but the most contrived of examples.

It relies on millions of pig-ignorant dreamweaver jockeys somehow comprehending and correctly operating the above semantic mappings.

It relies on said dreamweaver jockeys bothering to do this at all, let alone correctly.

The real semantic web will involve AI spidering and parsing of human-readable web pages. It will be as inaccurate, but as useful as babelfish. It's the only answer that makes sense -- because that's where all the juicy data is.

Article bit disappointing by mauddib~ · 2004-08-01 10:30 · Score: 5, Interesting

I was actually a bit disappointed by the article. First of all: it is very hard to search in distributed knowledge networks, if not impossible. Some structures, which are a necessity to make explainable in an onthology are possible to describe, but not possible to make deductions on (some of the queries cannot be proved to finish at all). An example are meta-classes (a Chardonai wine can be an instance of the class Wines, in which case a specific bottle of wine can be an instance of Chardonai as well as a normal wine).

Second of all, the article fails to mention anything about the Ontology Web Language (OWL, see this site on W3), which has become an official specificion of W3C since May this year. This language, based on RDF is much more expressive than RDF is, it also contains several 'language levels' based on the amount of complexity and decidability involved.

Last, but not least: the article is still very vague on privacy and thrustworthyness. I would think that public-private key cryptography would not do in these areas: far too many single points of failures when, for example registering. Only one user with a hacked account can derail the whole system!

I'm really interested, by the way, to speak with some people who are deep (at least above their knees) in OWL and RDF. Planning on making a study at intelligent databases and datamining.

--
This is a replacement signature.

Oh? by FlutterVertigo(gmail · 2004-08-01 10:31 · Score: 5, Interesting

Microsoft Bob succeeded, but not in the way you have expected.

Melinda Gates (nee' French) was the Product Manager of Microsoft Bob.

(just don't brag to your friends you've known that forever)

p.s. Microsoft Bob is|was one of the products (along with things such as RedHat) which Virtual PC can run successfully; so it hasn't disappeared completely. I still have a copy sitting here in one of my CD wallets. (Handed out at a Tech Ed or some other conference)

Re:well that was thoroughly frightening by MsGeek · 2004-08-01 10:50 · Score: 4, Interesting

Gematria, or the use of the numeric values of Hebrew letters as a means of interpreting the Torah, began being used around the Greco-Syrian occupation of Judea and Israel. The term is actually a corruption of the Greek term "Geometria" or the science of Geometry. The transliteration of Nero Augustus Caesar into Hebrew letters can be done in two ways: one gives the value of 666, the other 616. I don't know if Caligula Augustus Caesar works in the same way but Nero works both ways.

The book "The Revelation of Jesus Christ to John The Divine" is a very interesting one if looked at not as prophecy (anyone and anything can be worked into the fantastic account) but as historical allegory. It seems to have been written to encourage the persecuted Pauline Christians, who were persecuted with great vigor around the time the book is said to have been written. Just google on Nero and 666 and you can read some very interesting stuff.

--
Knowledge is power. Knowledge shared is power multiplied.

I don't buy it. by migurski · 2004-08-01 12:30 · Score: 4, Interesting

The article failed to mention flying cars, another no-duh prediction that seemed completely obvious, and won't happen either.

A short while ago, Cory Doctorow published an piece entitled Metacrap: Putting the torch to seven straw-men of the meta-utopia, which mentioned two very good reasons why the semantic web won't take off the same way that these articles predict: schemas aren't neutral, and there's more than one way to describe something. These are basic problems that have been hounding AI research for years, dictionary & encyclopedia publishers for centuries, and all other academics for millenia, and they aren't going to go away.

The central problem with universal metadata is that it requires informed work on the part of data creators, and it's a major pain in the ass. The OED took almost a century to create, and the first few decades were essentially wasted figuring out that dilletantes were not adequately capable of properly cataloging use of language. Even with a profit motive, good metadata is a bitch (see EBay comment in the article above).

It's like the senator's (I forget who) comment about pornography: "I can't define it, but I know it when I see it." Often, we don't know what it is we're looking for exactly, and we don't know how to describe what we've got so other people can find it except in very narrow terms. I have a few creative projects which I've released under the creative commons license and dutifully marked up with cc's provided RDF information, but all that code just says what the license is, not what the project is like in a way that's as meaningful as, for example, a music recommendation from a friend who knows your tastes. The porn industry (as usual, on the bleeding edge of information and communications technology) deals with this to some degree by having a very narrow semantic universe to describe: Search Extreme is a stupendously complete metadata set, but even it contains only a few kinds of information.

Re:Duplication of Quote? by jaltoids · 2004-08-01 12:43 · Score: 4, Interesting

All joking aside about the above comment, it raises a question about "Semantics" in general that NO meta -data system no matter how good you make it is going to address...

If Jim has a friend, your talking about an expression FROM the perspective of Jim. This expression is is given CONTEXT by Jim, it can NOT be said that Paul has a friend named Jim... Paul might not know Jim (stalker). I could poke holes in this for days but I dont't rreally have the time.

Why cant the W3C come out with some standards that are USEFUL. How about some tags for address, phone umber, geocode, so I can search on what is "local". How about some working samples that are publicly available, so people can start putting them to use.

-- RANT FOLLOWS --
I'm beginning to view t.b. lee like I view jacob nelson and Steve balmier, just another talking head throwing anything and everything out there and hoping it sticks. (picture all 3 of them on stage, Steve screaming "developers" Jacob saying "usability" and Tim going "XML")

It seems that Tim has forgotten how he got into the position that he is in, by putting something out there that WORKED (html) and letting it evolve from there. We had to live it for a while, let others take it up before we could get to a better place. It is starting to look like the W3C has over stepped it posisiton as the shepard of the web to try to be an innovator, and I don't think that is the ROLE a standards body should be assuming. I would really LIKE to see them (the w3c) stop wasting money on crap that 90% of the world is NEVER going to use, and start trying to make things more accessible to Joe average user. Most of the "standards are good and you should use them" has come from small iconic developers (zeldman anyone), while we get OWL from the w3c? How about making some of those wonderfull standards clear and accessible, how about a solid explanation of semantics for Joe average developer, who could not read through the "doccumentation" to save his life....

-- end rant ---

Funny, Amazon already won that "race"... by agilen · 2004-08-01 13:04 · Score: 4, Interesting

While that article is interesting and all, the author is pretty quick to say how Amazon didn't embrace the semantic web.

Amazon is the best (most useful) application of the theory and technology behind the semantic web that you will find anywhere right now. Granted, I don't *know* exactly how they are doing what they do, and its not a "public" interface in the way that the semantic web is envisioned, but it is a large scale implementation of knowledge management principles.

Did you ever notice that whenever you look at a book (or anything really) on Amazon, it gives you suggestions for similar books, suggestions for books that other people looked at who also bought that book, suggestions for books on topics that you have previously bought books for, etc? The semantic web is at heart a directed graph. Amazon is at heart a directed graph, too. Their graph grows every day with new knowledge based on the actions of people shopping on Amazon, and new conjectures about the relationships between products can be made by simply walking that graph, and computing the transitive closures of the statements (ie John likes the things that Mary likes, and Mary likes Jane's taste in music, so John may like the music that Jane bought).

This technology has incredible power, the ability for a machine to draw conclusions like that. Do I think that it will work the way that article thinks it will? No, not if the masses are left in charge of the metadata. It works very well for Amazon because they can control the quality of the metadata, so erroneous conjectures are not made on bad information. I don't think Google is by any means _not_ paying attention to the semantic web, but I think that Amazon is already there and has been for quite some time.

Slashdot Mirror

How Google Will Have Achieved The Semantic Web

29 of 242 comments (clear)