Challenging the Ideas Behind the Semantic Web 144
mytrip writes to tell us that after a recent presentation to the American Association for Artificial Intelligence (AAAI) Tim Berners-Lee was challenged by fellow Google exec Peter Norvig citing some of the many problems behind the Semantic Web. From the article: "'What I get a lot is: "Why are you against the Semantic Web?" I am not against the Semantic Web. But from Google's point of view, there are a few things you need to overcome, incompetence being the first,' Norvig said. Norvig clarified that it was not Berners-Lee or his group that he was referring to as incompetent, but the general user."
Re:Googlebombing (Score:5, Informative)
I believe you are referring to PageRank, which is one of many algorithms used by google to determine search relevance. This article [seobook.com] discusses their use of Latent Semantic Indexing [wikipedia.org], which is a somewhat crude but effective form of sematic inference which is widely used in the field of NLP.
Sem Web, meet Chicken & Egg (Score:4, Informative)
True, the web had a similar problem, however creating a webpage is a lot more interesting (you see the results directly, how terrible they might be you do see a result) than structuring data. The latter takes a lot more work, and the direct benefit just isn't there.
Sem-Web-like standards like RSS, XML and SOAP have become mainstream, but primarily because they fill a gap. The adoption of RDF or OWL simply doesn't solve anything. Yet. It would be cool to let agents loose onto the semantic web and retrieve them together with a summary on a certain subject using a multitude of sources, but as long as it's easier to Google I don't think it would generate any interest outside academia.
Feel free to prove me wrong though.
Re:Always bet on the million monkeys (Score:3, Informative)
Re:Googlebombing (Score:2, Informative)
>
> Google does not extract any semantics from content. It merely analyses the linking between
> websites and connects that with keywords. No semantics here.
Google does extract semantics from content in a few particular domains: addresses and bussines info for Google maps, show times and additional information on movie searches, dates and appointments from Gmail to Google Calendar,
The semantic web has already started. Now we only have it in a few and simple enough domains but, I agree, this should be the right way to go.
Re:It's really, really difficult... (Score:3, Informative)
RDF's core idea is simple. Give everything a URI. Express relationships as a set of three URIs, (subject, property, value). So you might have (#me, #friend, #bob) expressing the idea that Bob is a friend of mine. Or you might have (#photo, #contains, #me), expressing the idea that I'm in a photo.
RDF is little more than a mechanism for expressing relationships. It doesn't give software the ability to understand those relationships, you need to build that on top. RDF just helps you solve the relationship problem in a generic way. So, for example, even if you have (#me, #friend, #bob), that's still meaningless until you write software that knows the #friend, #spouse, #employer, #whatever URIs are relationships between people. For instance, you could build a social network like Friendster, only decentralised - and people have, with FOAF [jibbering.com], because they've agreed that particular URIs express particular relationships.
Tutorial on the Semantic Web (Score:2, Informative)
Pay attention to the slide #22 which shows how data from different sources can be merged together. This is one of key differences between XML and RDF - to merge XML data from a number of different schemas one would need to create an application that processes data in these schemas and generate merged data (possibly inventing a new schema to represent the merged information).
In RDF that happens "magically" - in order to merge heterogenous data you don't need to do *anything* - just put all the information in an RDF store and it merges. If the data to be merged change no modifications to the store are necessary - it is like a bag that can hold anything.
Re:Damn (Score:2, Informative)