Technology Review - Published By MIT
Log in to My.TechnologyReview.com | Register
Advertisement
[1] 2 Next »

Friday, April 21, 2006

The Search for Voice Activation

Google's voice interface patent gives life to rumors that voice-actived mobile search will soon be a reality.

By Kate Greene

smaller text tool iconmedium text tool iconlarger text tool icon

Google is mum on how -- if at all -- it plans to use its recently granted patent for a voice-enabled search engine -- despite the fact that it has also hired several speech-recognition researchers.

Originally filed in February 2001, the patent was granted for "Voice interface for a search engine," in a move that likely signals some level of development at Google on voice technology for searching the Web using handhelds. Further fueling speculation is Google's poaching of several speech-recognition specialists -- the kind of move that often signals that a new product is afoot.

"They've put together a very strong group of people who are experts in speech-recognition technology," says Nelson Morgan, director of the International Computer Science Institute, which is affiliated with the University of California at Berkeley. He explains that Google picked up a number of engineers from Nuance, "the big gun in speech-recognition technology," which recently lost researchers after it was purchased by Scansoft, a speech-technology company (the combined company is called Nuance).

Investing in voice-search technology is a prescient move by Google, since the mobile Web appears to be at a tipping point. Globally, 28 percent of all mobile-phone users now surf the Web (up three percent over last year), according to Ipsos Insight, a Paris-based market research firm. In the United States, three out of every four homes now have mobile phones, and increasingly those phones are being used for more than just talking. In 2005, over half (52 percent) of all mobile-phone users had either sent or received a text message and 37 percent had sent or received an e-mail message.

Despite this growth, though, searching the Internet on handhelds can be maddening because of tiny keyboards and unfriendly designs, which require, at best, a stylus or, at worst, a series of clicks through numerous menu screens. Speech recognition software may provide a better way. And a handful of companies are exploring the technology. San Diego's V-Enable offers voice search services to help users find ring-tone listings, and Promptu of Menlo Park, CA plans to offer similar services this summer.

At their most fundamental, speech recognition systems are composed of three major functions. First, words are captured and translated into a digital signal. Then a speech-recognition algorithm compares those signals to words and phrases from a pre-set dictionary, of, say, ring-tone options or movie listings. Finally, the software offers the most likely match for the spoken phrase.

[1] 2 Next »

Comments

  • Super Concept
    Guest (Suchit) on 04/21/2006 at 12:00 AM
    Posts:
    1
    Certainly this could get big in terms of "Local Search" and maps based applications. Where people can query by voice about a particular thing they are looking for somewhere and they get back detailed instructions, description or whatever the information with power of Google search engine.

    http://www.Suchit-Tiwari.Org
    Rate this comment: 12345
    • Accent
      Guest (Bibi Kaur) on 04/21/2006 at 12:00 AM
      Posts:
      1
      Both NOISE POLLUTION [New] LAWS and/or Lack of laws and Accents will be in the way
      However, I keenly await the day when I can use this life saving technology. Go Google!
      Rate this comment: 12345
  • Hurdles
    Guest (Sastri) on 04/21/2006 at 12:00 AM
    Posts:
    1
    As anyone who has used voice activated phone system knows - this technology looks good but will be hard to implement. It has a frustration factor that will be hard to overcome. No one I know has the patience to parse the voice options to get to a result. Doing a search on a computer is different because you see a screenful of results from which you can pick quickly what you want or move on.
    Rate this comment: 12345
  • Download Voice Tools from Voice Web Solutions
    Guest (Brian Ty Graham) on 04/21/2006 at 12:00 AM
    Posts:
    1
    Speech activate web pages at VoiceWebSolutions.net
    Rate this comment: 12345
  • Voice Activation
    Guest (Bewulf) on 04/28/2006 at 12:00 AM
    Posts:
    1
    Human Speach is a series of tones at given frequency and amplitudes.  Certain residual background noises emulate what sounds like conversation from a distance.  It is suprising how close sounding to speach noise sources can become.  As with anything it is how one acclimates to the tonal quality that brings understanding.  Think of a bus station loudspeaker system, no one can understand what is broadcast when first exposed.  After a bit of time, however, an individual will atune to the to the altered frequency and modulation and begin to discriminate words.  Look further at how humans adapt to learn understand what machines must do to achieve learning.
    Rate this comment: 12345
Advertisement

Current Issue

Technology Review July/August 2008
The Business of Social Networks
The future of the Web is social. But can social-networking sites ever make money?
•  Subscribe
Save 41%
•  Table of Contents
•  MIT News

Magazine Services

Career Resources

MIT Technology Insider

Stories and breaking news from inside MIT about the latest research, innovations, and startups--in a convenient monthly e-newsletter. Subscribe today
Advertisement

More Technology News from Forbes

Advertisement
Advertisement
Advertisement
TECHNOLOGY RESOURCES
Advertisement
MIT Massachusetts Institute of Technology