March 25, 2008
Semantic Web Patterns: A Guide to Semantic Technologies - ReadWriteWeb
Alex Iskold wrote a nice article that provides an overview and categorization of semantic web approaches, technologies, and companies.
Here are a few key points from the article, interspersed with some of my own perspectives.
- The Semantic Web is now capturing broad attention, and has been called the number one trend in 2008 (by Richard MacManus, founder of ReadWriteWeb).
- Yahoo! recently announced that their search engine is going to support RDF and microformats. This will provide incentive for publishers to use semantic markup in their content. This echoes a point I made in my semantic web keynote talk last year (see below), that search engines would create incentives to drive the semantic web faster than people may have expected.
- Several companies are now offering web services to support or automate semantic markup. These include the Semantify web service from Dapper, the Open Calais web service from Reuters/ClearForest, and the Semantic Hacker API from TextWise.
- There are top-down and bottom-up approaches to the Semantic Web. Bottom-up approaches require people to enter semantic markup. This can be in strong semantic web formats using standards like RDF, or in lightweight markup formats, like Microformats.
- Search is potentially a killer app of semantic technologies. The author argues that semantic technologies alone are not enough to deliver better search, but when used in combination with the other search techniques they might be better. I agree that the combination is best. But I disagree with the statement that
Google's algorithm, which is based on statistical analysis, deals just fine with semantic entities like people, cities, and companies.
I think there is a significant gap today between what we are used to with search engines and what is possible with stronger semantic approaches, and this will become clearer over the next year.- Contextual technologies use semantic markup within the page and combine that with external content and services. Thus a user does not have to search in order to benefit from the semantics. Examples include Snap, Yahoo Shortcuts, and SmartLinks. Such technologies are making their way into the browser, where they will have wider appeal and accelerate the trend toward the semantic web.
- Semantic databases focus on building and utilizing structure semantic information (as opposed to marking up unstructured content). Twine, by Radar Networks, and Freebase, by Metaweb, are two examples. (I am personally familiar with Freebase as we are integrating this within our offerings at Powerset.) Over time, we will see increasing synergies between the semantic technologies based on structured and unstructured data.
- Contextual technologies use semantic markup within the page and combine that with external content and services. Thus a user does not have to search in order to benefit from the semantics. Examples include Snap, Yahoo Shortcuts, and SmartLinks. Such technologies are making their way into the browser, where they will have wider appeal and accelerate the trend toward the semantic web.
I highly recommend this article to people interested in semantic technologies and search. For my own perspective on the relationship between natural language, search, and the semantic web, you can see the video and presentation of my Keynote Talk at the 2007 International Semantic Web Conference, entitled Natural Language and the Semantic Web.
Posted by barney at 9:04 PM | Comments (0) | TrackBack (0)
February 28, 2008
Cyan Banister in Nerd Girl video
My friend Cyan is featured in this music video. It's awesome!
I like the starting lyric: "Nerd girl, I don't deserve you. I don't get the references you allude to."
Posted by barney at 11:30 PM | Comments (0) | TrackBack (0)
February 26, 2008
In 5 years we will search more with voice than typing
David Vogelpohl wrote an article, Will Microsoft Resurrect Natural Language Search, citing a recent AP article about Bill Gates and voice-based search. Here are some quotes from the AP article:
People will increasingly interact with computers using speech or touch screens rather than keyboards, Microsoft Corp. Chairman Bill Gates said.“It’s one of the big bets we’re making,” he said during the final stop of a farewell tour before he withdraws from the company’s daily operations in July.
In five years, Microsoft expects more Internet searches to be done through speech than through typing on a keyboard, Gates told about 1,200 students and faculty members Thursday at Carnegie Mellon University.
David conjectures, as do I, that when people speak their searches they are more likely to use natural language than to use keywordese, and that this could change the game in search.
I personally can envision Microsoft trying to integrate speech based data entry as closely as possible with our normal style of speaking. Perhaps the phrase “Where can I buy a hd tv?” would be more natural for searchers when you take away the limitations of the keyboard.Wide spread speech based data entry will almost certainly impact the way Microsoft and subsequently all other search engines deal with search queries.
It's interesting to see Bill Gates predicting this to happen within 5 years. In the blink of an eye, an entire industry is going to change dramatically.
While on the topic of predictions about voice and language, here's one of my predictions that I have been meaning to write up:
Within 8 years from now (2016), every category of consumer electronics will have some linguistic interface as a standard feature.
By "linguistic interface", I mean voice interactions or text-based interaction that is linguage-based. Not that these devices won't still have nonlinguistic interfaces too (e.g. there will still be buttons, most likely). And by "every category", I mean you will not find a category of consumer electronics that does not have some product in that category with that feature.
For example, users will expect to be able to talk to cameras, tvs, stereos, ipods, phones, watches, microwave ovens, refrigerators, cars, etc. There will still be some cameras that aren't language-enabled, but every category will have some products that are.
As my friends Cliff Nass and Scott Brave write in their book, Voice Activated, when people interact with devices using voice, it also invokes the rest of their social apparatus. You can't hear a voice without ascribing some kind of personality, gender, race, social status, etc to the source of the voice. So in addition to expecting linguistic capability, we're also going to start expecting personality within the next decade.
I'll stop here before I get carried away to the singularity...
Posted by barney at 8:30 PM | Comments (0) | TrackBack (0)