March 12, 2006

In-car conversational voice interfaces: Speak With Me and VoiceBox

I’ve been meaning to write about this for a while. Here are two comments about innovations in speech processing for embedded (eg. in-car) applications.

Posted by barney on March 12, 2006 at 1:28 pm | No Comments

March 10, 2006

Visual Image Search

I just came across Tiltomo, a new visual search engine (thanks to a posting on Silicon Beat). Most visual search methods are based on color histograms, so they are good for finding similar color schemes to a reference image (eg roses, sunsets). This one also lets you constrain the search by the “theme” of a reference image (without having to specify the theme yourself). The homepage has a nice demo using images from Flickr. I did a search for the tag “spider” and got lots of great spider images. Then I selected one of them, with a close-up of a spider in a web, and got many more similar ones to that. For other searches, I found it easy to get collections of photos of people’s eyes, stuffed animals wearing stripey costumes, and of course, swimming suits.
I did find myself wondering about how much human effort was involved in preparing the data for use with the service, and how that would scale up. The system seemed almost too good at knowing which photos were of people vs. animals, men vs. women, and using this information to find related photos.
While I’m writing about visual search, check out AHOP2, which lets you do find websites based on aesthetic style. It’s a good way to look for furniture that might go well in a modern house, for example.
Down the line, I know of several companies that will be bringing visual object recognition into search. That’s a topic for a later post.

Posted by barney on March 10, 2006 at 6:19 pm | 1 Comment

March 10, 2006

Calendars and Natural Language

My friend Matt Hurst on his DataMining blog wrote about Spongecell, a new Ajax-based calendar that lets you enter events using natural language. Here are some examples Matt tried, with his comments:

I tried out the site. My initial impression of the look and feel was good, including the appearance of the calendar itself and the bubble tips for new users. I found about 75% of my NL inputs were handled correctly on the system, but as Matt says it is likely that users will learn which cases work well and then get high performance using those patterns.
It’s perhaps the best kept secret at Microsoft, but did you know that Microsoft Outlook already supports some natural language entry of calendar events as well? Open up an appointment (new or existing) and in the day slot for the time, enter “fourth monday of April”. Outlook converts that into the correct date. I use this feature all the time.
TechCrunch has a writeup about SpongeCell and many other players in the Calendar2.0 space. That page features 73 comments, many by other calendar companies, so to some extent this captures the current state of play on this topic. One missing related company from the list is TimeBridge (product coming soon), a Mayfield portfolio company for which I’m advisor, along with my friend Mark Drummond (founder of one of the Calendar1.0 companies, called TimeDance, for those who remember).
After writing the first draft of this article, I saw another post on TechCrunch about Google’s upcoming calendar. The information was leaked by a beta tester, and includes detailed screenshots. Key elements from that posting:

Update: I got pointed to another cool calendar2.0: 30 Boxes. It also supports natural language entry, and it looks like it makes it really easy to share events with friends and family.

Posted by barney on March 10, 2006 at 3:36 pm | 1 Comment