Google Acquires Metaweb

← Back to Stories (view on slashdot.org)

Posted by Soulskill on Friday July 16, 2010 @08:33AM from the nothing-to-do-with-neal-stephenson dept.

eldavojohn writes "A startup called Metaweb (looks like an ontological, entity-based approach to Web 2.0 tagging) has been acquired by Google. You can find out what they're about from a super marketing fluff video they put together. The neat thing about Metaweb is that the database of entities it has is free. Will Google be able to make Metaweb work on their omniscient scale, or was this just Google making sure a startup doesn't become yet another player in search?"

51 of 63 comments (clear)

Min score:

Reason:

Sort:

Silly Logic by Monkeedude1212 · 2010-07-16 08:35 · Score: 5, Insightful

Will Google be able to make Metaweb work on their omniscient scale, or was this just Google making sure a startup doesn't become yet another player in search?"
If Metaweb doesn't work at Google's Scale, then it couldn't compete with them.
1. Re:Silly Logic by Svartalfar · 2010-07-16 09:09 · Score: 1, Insightful
  
  This wouldn't be the first time a large company bought a start up to prevent it from possibly ever becoming a competitor though. Better to spend peanuts now to buy the possibility then spend a fortune later to fight off a competitor you didn't see coming.
2. Re:Silly Logic by jbell730 · 2010-07-16 09:36 · Score: 1
  
  Better to spend peanuts now to buy the possibility then spend a fortune later to fight off a competitor you didn't see coming.
  See also: Microsoft.
3. Re:Silly Logic by Monkeedude1212 · 2010-07-16 09:50 · Score: 3, Funny
  
  I'm not sure how you got from "Machines automatically creating and ranking indexed entities" to "Destroy all humans" but I'm sure it involved an illegal substance.
4. Re:Silly Logic by Jurily · 2010-07-16 09:52 · Score: 3, Funny
  
  I'm not sure how you got from "Machines automatically creating and ranking indexed entities" to "Destroy all humans" but I'm sure it involved an illegal substance.
  It's the AI that was supposed to index 4chan.
5. Re:Silly Logic by nobodylocalhost · 2010-07-16 10:10 · Score: 1
  
  Simple, currently humans are integrated in this system. But we will eventually be phased out in favor of better performing and more efficient machines that don't generate as much repeated junk data (aka social media). It's a natural upgrade process.
  
  --
  Where is the "Ignorant" mod tag?
6. Re:Silly Logic by Monkeedude1212 · 2010-07-16 10:19 · Score: 1
  
  But the system is FOR humans - there'd be no need for a system without humans - I think thats the part of your theory that fails entirely. AI cannot be self serving.
7. Re:Silly Logic by yppiz · 2010-07-16 20:01 · Score: 1
  
  This is a good point. I don't know if this was a factor in Google's acquisition, but Powerset (acquired by Microsoft and now part of Bing) uses Metaweb's Freebase.
8. Re:Silly Logic by RockDoctor · 2010-07-18 05:48 · Score: 1
  
  Better to spend peanuts now to buy the possibility then spend a fortune later to fight off a competitor you didn't see coming.
  If you didn't "see it coming" in five years from now, then you either didn't know of it's existence today, or you made an error of judgment today, or you forgot tomorrow what you knew of today. In which cases you still deserve to die (in a corporate sense) for being incompetent.
  Sounds like this company/ idea has been around a while and Google have decided that their ideas/ technologies/ implementations are worth a good, close look at. Which, to be fair, is what the Evil Empire of Redmond are good at too.
  The interesting question really is whether Google will squash them, incorporate them in their behind-the-scenes work, or make their technologies and ideas publicly usable. Time, as they say, will tell.
  
  --
  Birds are not dinosaur descendants;birds are dinosaurs, for all useful meanings of "birds", "are" and "dinosaurs"
Didn't see it coming. by pushing-robot · 2010-07-16 08:37 · Score: 5, Funny

Everyone was thinking Google would take over the Web, and here they skip right past it and acquire the Metaweb.
Well played, Google, well played.

--
How can I believe you when you tell me what I don't want to hear?
1. Re:Didn't see it coming. by Monkeedude1212 · 2010-07-16 08:41 · Score: 1
  
  Before you know it, they'll move on from the web and acquire the mesh!
2. Re:Didn't see it coming. by IAmGarethAdams · 2010-07-16 08:44 · Score: 5, Interesting
  
  I've been using Freebase integrations on a couple of sites, and the possibilities Freebase already offers for rich metadata integration is HUGE.
  For example, a couple of their simple API samples are a list of Police songs from the Synchronicity album, ordered by track length, or Graduates of Stanford born since 1960 who are board members of companies
3. Re:Didn't see it coming. by Spazntwich · 2010-07-16 09:50 · Score: 1
  
  Holy shit that is cool.
  I wonder if investigative reporters will be able to utilize this. Datamining for the little guy.
4. Re:Didn't see it coming. by Joe+Tie. · 2010-07-16 10:15 · Score: 2, Insightful
  
  Reading over all of this, I've been wondering what the hell metaweb is. Your couple sentences explained it better than the pages of text leading up to it by others. Showing, in this case, seems far better to telling in order to properly describe it. And holy shit is that awesome!
  
  --
  Everything will be taken away from you.
5. Re:Didn't see it coming. by Anne_Nonymous · 2010-07-16 10:18 · Score: 2, Funny
  
  Once you start freebasing, you just can't stop.
6. Re:Didn't see it coming. by IAmGarethAdams · 2010-07-16 10:37 · Score: 1
  
  Well, Freebase is just an application of the Metaweb technologies. However, the storage and organisation of data (which is what the core of Metaweb is geared around) is useless without any means of retrieval
7. Re:Didn't see it coming. by Frankie70 · 2010-07-16 15:08 · Score: 1
  
  Everyone was thinking Google would take over the Web, and here they skip right past it and acquire the Metaweb.
  Well played, Google, well played.
  Well, they could have got the web for free - http://www.free-web.org/
8. Re:Didn't see it coming. by yppiz · 2010-07-16 20:12 · Score: 1
  
  Freebase makes their data available as free CC-licensed data dumps. You can import this into any database you want. There's no requirement for Metaweb's technology to use the data. It's just a very convenient way to do so!
I never metaweb by abbynormal+brain · 2010-07-16 08:41 · Score: 2, Funny

i didn't' like.

--
L'esperienza de questa dolce vita (The experience of this sweet life) - Dante Alighieri, The Divine Comedy
Re:"Ontological" is a synonym for failure. by Itninja · 2010-07-16 08:43 · Score: 1

I always thought "Ontological" was a synonym for a useless philosophy degree.

--
I judt got a nre Kinesis keybiartf so please excusr ant egregiou typos.
Rehab by masterwit · 2010-07-16 08:45 · Score: 4, Funny

Will Google be able to make Metaweb work on their omniscient scale, or was this just Google making sure a startup doesn't become yet another player in search?
Wrong and wrong, you see Google is freebasing now:

The web isn’t merely words[, or water-soluble,] it’s information about things in the real world, and understanding the relationships between real-world entities...
Sometimes you have to give it a good ole "smoke-test" to see the possibilities...Google should be careful though, the path they have chosen is a slippery slope!

--
We should start a new Slashdot and return control to the geeks. It actually wouldn't be that hard to get some users to
For a web 2.0 company by TyFoN · 2010-07-16 08:47 · Score: 1

They sure have an ugly web page.
1. Re:For a web 2.0 company by Tumbleweed · 2010-07-16 09:49 · Score: 1
  
  For a web 2.0 company ... They sure have an ugly web page.
  Okay, two jokes come to mind right away:
  1) That's why _Google_ bought them!
  2) You already said 'web 2.0'; you don't need to say 'ugly' when you've said that.
Palantir by Squib · 2010-07-16 08:48 · Score: 1

Looks like this may be a way to make a play for competition in homeland security and business support, like Palantir has done plus medical data tracking, and other possible extrapolations
I'm fairly sure it's not going to be used for just generating websites.

--
First winter rain-
even the monkey
seems to want a raincoat.
-Basho
Expanding reach by ceraphis · 2010-07-16 09:00 · Score: 2, Interesting

Slowly but surely google continues to acquire startups and expand their business. Not that I mind it that much in Google's case but isn't this the type of thing that Microsoft or AT&T eventually got hammered for?

Legitimately wondering if Microsoft and AT&T did it much more dastardly or if there's no significant comparison whatsoever.
1. Re:Expanding reach by thoughtsatthemoment · 2010-07-16 09:45 · Score: 2, Interesting
  
  In this case, Google is trying to enhance their core business that is search. The way we search on the internet is still quite primitive and it's also some kind of brute force. I bet all search engine providers are working on making their engines more intelligent and the result will ultimately decide which one will be the last one standing.
I hope they keep working on Gridworks by xenocide2 · 2010-07-16 09:03 · Score: 1

One of the challenges with generating and using data sets is cleaning them up. Data entry errors, OCR failures, conflicts between multiple sources, etc. make it a pain to search and summarize data. Gridworks helps me hunt down bad records and normalize fields. If it keeps improving, people might start using it before publishing their crap data.

--
I Browse at +4 Flamebait
Open Source Sysadmin
Re:"Ontological" is a synonym for failure. by Fnkmaster · 2010-07-16 09:04 · Score: 1

The problem is that nobody wants to express information through RDF tuples and ontologies. Instead, they express information in human-readable text, with structural and visual markup. Search technology has come very far in terms of figuring out what information we actually want, with things like personalization, disambiguation (see DuckDuckGo for example), shopping/product search, and so on. All this stuff can be teased out of traditional web content with far less effort than trying to get every company and individual to express information through formal ontologies, etc.
So yeah, basically, the goals and use cases of the semantic web fall into two broad categories 1) stuff that data mining or search can do now and 2) stuff that requires hard AI or tons of human labor and thus won't be happening any time soon. This is why "ontologies" have become synonymous with fail.
Re:"Ontological" is a synonym for failure. by infinitelink · 2010-07-16 09:05 · Score: 2, Informative

"Ontological" is an essential adjective for describing different aspects of knowledge (science); ontology for ordering it.

--
Intelligent idiots are we. | Evil men do not understand justice.
Re:"Ontological" is a synonym for failure. by LarryRiedel · 2010-07-16 09:12 · Score: 4, Insightful

stuff that requires hard AI or tons of human labor and thus won't be happening any time soon.

Wikipedia.
Freebase by Haffner · 2010-07-16 09:17 · Score: 1

Was anyone else amused by this? (RTFA)

--
"Going to war without the French is like going deer hunting without your accordion." ~General Norman Schwarzkopf
Something Alta Vista had Google does not... by wowbagger · 2010-07-16 09:32 · Score: 5, Interesting
In a way, I miss Alta Vista, in that they had a few things that Google does not:
- NEAR operator (require the phases occur close in the page, which helped to eliminate the "pile of unrelated stuff" pages)
- proper Boolean operators in the search, with arbitrary complexity (e.g. "((pre-emergent OR preemergent) AND herbicide AND liquid) AND NOT gluten")
- and the thing that makes this post on-topic: Alta Vista had a search mode where-in you could refine your search by it presenting a set of additional search terms that helped qualify the meaning of what you searched for.
Say you searched for "wine", and activated that mode. It would present you with some possible extra terms you could search on, such as "white", "red", "tannic", "windows", "microsoft", "emulator".
Were you to be searching for the fermented beverage, you could select "red", "white", "tannic" and so on.
Were you searching for the ABI adapter package, you could select "windows", "Microsoft", and "emulator" (which yes, Wine is NOT...)
I'd love to see Google add that sort of refinement, ideally "learning" what sorts of terms go with what (Wine + tannic = beverage, wine + OLE = software).
--
www.eFax.com are spammers
1. Re:Something Alta Vista had Google does not... by morcego · 2010-07-16 09:42 · Score: 1
  
  I wish they would just allow us to use regular expressions and be done with it ...
  
  --
  morcego
2. Re:Something Alta Vista had Google does not... by IAmGarethAdams · 2010-07-16 09:57 · Score: 1
  
  Well, that might work if indexes were stored as full text representations of a string
3. Re:Something Alta Vista had Google does not... by gilleain · 2010-07-16 10:00 · Score: 3, Interesting
  
  I wish they would just allow us to use regular expressions and be done with it ...
  There's a good reason why not - because of regex DDOS with people inputting "N(o|oo)" to match "Nooooooo....ooooo!" (or similar).
4. Re:Something Alta Vista had Google does not... by omar.sahal · 2010-07-16 10:14 · Score: 1
  
  Alta Vista had a search mode where-in you could refine your search by it presenting a set of additional search terms that helped qualify the meaning of what you searched for.
  This is in Google! I typed wine and if gave me a whole bunch of choices, like wine tasting and so on. Turn on JavaScript and it should work
5. Re:Something Alta Vista had Google does not... by Joe+Snipe · 2010-07-16 10:33 · Score: 2, Informative
  
  http://www.google.com/advanced_search?hl=en
  
  --
  Sometimes, life itself is sarcasm...
6. Re:Something Alta Vista had Google does not... by wowbagger · 2010-07-16 10:39 · Score: 2, Insightful
  
  It's not quite the same. The Alta Vista approach grouped the tags - it would have grouped "tasting" with "red" and "white", while grouping "OLE" and "DirectX" in a separate grouping. Moreover, it was smart enough to use that grouping to allow you to select the whole group.
  Thus, Alta Vista was better able to detect that sometime "wine" means a beverage, and sometimes software, and that the two concepts are different.
  Google still has trouble understanding that the fermented liquid and the software aren't the same thing - it just throws a bunch of other search terms at you.
  
  --
  www.eFax.com are spammers
7. Re:Something Alta Vista had Google does not... by martin-boundary · 2010-07-16 12:28 · Score: 1
  
  That's easy to fix by putting in a hard limit on the size of the returned text (yes, that breaks the principle of having a general regex). The real difficulty is efficiently building and maintaining an index that allows actual regex searching, such as (variations on) a suffix array. Even for a small document collection, that sort of thing eats up a lot of resources.
8. Re:Something Alta Vista had Google does not... by Jimmy+King · 2010-07-16 13:58 · Score: 1
  
  Ah hah! Thank you! Every couple of months I find myself trying to remember which search engine used to offer the proper, complex boolean search functionality. Occasionally I even make the rounds through all of the old search engines (the ones that haven't just become aggregators of other search engines, anyway) and give it a try in case I can stumble upon it. I guess I'm out of luck on that.
9. Re:Something Alta Vista had Google does not... by Hurricane78 · 2010-07-18 07:28 · Score: 1
  
  If it is known, it can be prevented. If it can be prevented, your argument is invalid.
  
  --
  Any sufficiently advanced intelligence is indistinguishable from stupidity.
From the early days to acquisition! by yppiz · 2010-07-16 09:44 · Score: 1, Interesting

I was on the founding team at Metaweb when we spun out of Applied Minds. I can answer some questions here, but first I wanted to congratulate the team that brought this company all the way to acquisition.
So, from the beginning we knew that semantic this and ontology that would be a non-starter for most contributors from Planet Earth. While Freebase is a complex system under the hood, the user interface makes contributing data to an existing type (schema) pretty easy. You can add content from a browser window and never know that all of your entries are typed by the system. You can upload a spreadsheet of data and not have to do anything more than say which column is linked to what field in Freebase.
My startup, 24 Hr. Diner, uses Freebase to demo our artist to artist recommendation engine, Jukebox. We have recommendations for 100k artists, and for each of them, we can look up their genre info and photo on Freebase without having to maintain all of that data ourselves.
And if anyone on Slashdot is working for a co. that could use an excellent recommendation engine that handles music, videos, and general web content, ping me!
1. Re:From the early days to acquisition! by beakerMeep · 2010-07-17 02:41 · Score: 1
  
  Pinging ganymede.cs.brandeis.edu [129.64.2.21] with 32 bytes of data: Reply from 129.64.2.21: bytes=32 time=24ms TTL=47 Reply from 129.64.2.21: bytes=32 time=24ms TTL=47 Reply from 129.64.2.21: bytes=32 time=26ms TTL=47 Reply from 129.64.2.21: bytes=32 time=25ms TTL=47 Ping statistics for 129.64.2.21: Packets: Sent = 4, Received = 4, Lost = 0 (0% loss), Approximate round trip times in milli-seconds: Minimum = 24ms, Maximum = 26ms, Average = 24ms
  /Sorry, dont actually need but this is slashdot I had to ping you
  
  --
  meep
2. Re:From the early days to acquisition! by yppiz · 2010-07-17 20:43 · Score: 1
  
  Pong!
Re:"Ontological" is a synonym for failure. by thoughtsatthemoment · 2010-07-16 10:11 · Score: 1

Well, a thousand years ago computer science would also have been a synonym for a useless philosophy degree.
Re:"Ontological" is a synonym for failure. by mandelbr0t · 2010-07-16 10:34 · Score: 1

This is why "ontologies" have become synonymous with fail.
So you're saying that Google bought a failure to save the rest of the world from it? It's the "tons of human labor" part that becomes the issue; it's bad enough trying to teach a human about semantics, let alone a pedantic automaton. Wake me up when an AI can disambiguate without me spending 45 minutes explaining the basics of English language.

--
"Please describe the scientific nature of the 'whammy'" - Agent Scully
Re:"Ontological" is a synonym for failure. by yppiz · 2010-07-16 20:10 · Score: 1

Early on, we knew we'd have to make a UI so that users could have as close to a free-text experience as possible while still contributing structured data. Freebase lets you create a topic that is generic, and then co-type it with multiple specific types later. It allows ontology geeks to do their thing, and regular users to just work where they are comfortable. It's a tough balance to strike, but Metaweb's Freebase was populated by a small team of data wranglers using a mix of automated methods and coordinated manual cleanup and entry, along with power users who were especially interested in particular data domains.
At one point I was really interested in submarines. I created a type describing the key characterists of subs and then spent a few days finding all the generic topics in Freebase on subs (many from Wikipedia) and filling them in. Others, either at Metaweb or outside, have done similar efforts on other domains.
Few contributors ever say or even have to consider ontologies. If they want to dig in, it's there, but almost never presented in a way that requires a PhD and a pipe.
Alternative Headline by bestadvocate · 2010-07-16 20:55 · Score: 1

Look forward to Freebasing with Google!

--
my sig
Google has it. by NeoXon · 2010-07-17 04:27 · Score: 1

All your (free)base are belong to us.
Re:"Ontological" is a synonym for failure. by Fnkmaster · 2010-07-17 04:32 · Score: 1

... which is exactly what DuckDuckGo uses as its data source to handle disambiguation. But Wikipedia is structured for humans and features a large volume of knowledge in human language form with some basic markup. It's not a bunch of information encoded in RDF tuples. Thus my point. Trying to get everybody on the web to re-encode the vast body of knowledge out there in RDF, explicitly referencing ontologies is a setup for failure. Sure, you might use some sort of tuple format to internally store information that you parse out of the human-language web, but that's different from what the "semantic web" set out to be initially.
Re:"Ontological" is a synonym for failure. by Hurricane78 · 2010-07-18 07:12 · Score: 1

Aaahhh Wikipedia... the idea of collecting “facts” by determining how many idiots did not disagree.
Or in other words: Argumentum ad populum hard at work.

--
Any sufficiently advanced intelligence is indistinguishable from stupidity.