IBM Watson Cloud Gains Eyes, Ears, And A Voice - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

Software // Enterprise Applications
11:36 AM
Connect Directly

IBM Watson Cloud Gains Eyes, Ears, And A Voice

IBM Watson developer cloud adds speech-to-text, text-to-speech, visual recognition, and decision services. Will businesses build their own Jeopardy apps?

10 Cloud Migration Mistakes To Avoid
10 Cloud Migration Mistakes To Avoid
(Click image for larger view and slideshow.)

IBM Watson tantalized the world when it beat two grandmaster champions at the game of Jeopardy in 2011, but commercial applications spun off the technology since have lacked the same anthropomorphic sex appeal. On Thursday, IBM announced new Watson Developer Cloud services that promise more of Jeopardy Watson's human-like power to hear, speak, see, and make decisions.

The Watson Developer Cloud already offered eight services that could be described as human-like or even superhuman, such as the ability to identify the language of written input; the ability to answer written questions, drawing on deep knowledge repositories; and the ability to learn user preferences. With five new services, IBM said in a statement that it's "allowing people from diverse industries and disciplines to easily tap into the power of cognitive computing."

[ Want more on this topic? Read IBM Watson: 29 Signs Of Progress. ]

The five new services include:

Speech-to-Text. This "low-latency" service converts speech into text to power voice-controlled mobile applications, transcription services, and, along with others services, speech-to-speech translation. Speech-to-Text transcriptions of speech are sent back to the client and retroactively corrected as the system gains more speech input and context. More speech is heard, helping the system learn.

Text-to-Speech. Who wants to read responses? Most of the delight in using services such as Siri, Cortana, and Amazon Alexa is having a "conversation" with a computer. Watson's new Text-to-Speech service lets developers choose from among three English and Spanish voices, including the American voice used by Watson in the 2011 Jeopardy match.

Visual Recognition. This service analyzes images or video frames and interprets what's happening in the scene. The Visual Recognition service includes prebuilt classifiers, more than 2,000 trained labels, and taxonomies for different domains. A sports taxonomy, for example, recognizes more than 150 sports and can tag images or footage with a confidence level as to whether it's an example of soccer or baseball. Use-cases include organizing large collections of imagery and understanding consumers' shopping preferences based on the images they're viewing.

Concept Insights. When it comes to search, keywords are limited. Concept Insights lets users provide documents. The service then searches for related documents based on a graph of concepts established in Wikipedia. The service provides explicit links to content that directly mentions related concepts and implicit links, which might be relevant content that doesn't directly mention concepts in the user's document. Use-cases include improving search queries and locating expertise in large organizations.

Tradeoff Analytics. This service uses Pareto filtering techniques to weigh multiple, possibly conflicting decision alternatives based on multiple criteria. Tradeoff Analytics makes best-possible choices considering decision goals, and the benefits and drawbacks of various alternatives. Use-cases include enabling retailers or manufacturers to determine their product mix. A consumer service could help them compare products or services. IBM has used this service in Watson applications to help physicians select optimal treatment options.

These five new services are available immediately, along with eight other services, on the Watson Developer Cloud. Since its launch in October 2013, the Watson Developer Cloud has attracted more than 5,000 partners that have built some 6,000 apps to date, according to IBM.

Attend Interop Las Vegas, the leading independent technology conference and expo series designed to inspire, inform, and connect the world's IT community. In 2015, look for all new programs, networking opportunities, and classes that will help you set your organization’s IT action plan. It happens April 27 to May 1. Register with Discount Code MPOIWK for $200 off Total Access & Conference Passes.

Doug Henschen is Executive Editor of InformationWeek, where he covers the intersection of enterprise applications with information management, business intelligence, big data and analytics. He previously served as editor in chief of Intelligent Enterprise, editor in chief of ... View Full Bio

We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
Comment  | 
Print  | 
More Insights
Newest First  |  Oldest First  |  Threaded View
User Rank: Ninja
2/23/2015 | 1:55:01 PM
Small business boon
These new applications sound like they could be a boon to small businesses. I love the idea of talking through my thoughts and getting a transcript without having to type it in. Or to be able to 'hear' the written presentation so I can figure out where I'm likely to stumble or where things don't quite go together. These two applications alone could change how many people work. Which of the application would you be most likely to use?
User Rank: Apprentice
2/9/2015 | 11:01:01 PM
AI -- Not All It's Cracked Up to Be
Great byline on the "consciousness fallacy" of AI in the Washington Post recently. Author: IBM AI expert David Sullivan. Punctures what he calls the "consciousness fallacy" of cognitive computing aka AI2, and takes the position that while AI has come far it will never be "human like." Ergo I suspect he'd disagree with your characterization. As Sullivan notes, the Google Car may pick the best route, but  it won't argue if you decide to go another way. I'd post the url to the article for you, but your cruel bots won't allow it. Am left to console you with "Relax, the machines will not awaken and take over."
User Rank: Author
2/9/2015 | 2:29:39 PM
Re: .150 sports, yes, but can Watson tell baseball from cricket?
LOL @Charlie, even I can tell the difference between cricket and baseball bats, so I'd think Watson can manage it. It probably also can understand the rules of the games better than I can.  I do wonder if does get confused about how "football" is used to mean different sports on different sides of the Atlantic, though.
Susan Fourtané
Susan Fourtané,
User Rank: Author
2/8/2015 | 3:23:44 AM
The next big thing
Fabio Rosati, CEO of Elance, has said that Watson is going to be the next big thing after the Internet, and I agree. Watson's powered-apps in the cloud are going to change the way we do things and this will happen across many industries and individuals alike. 


Charlie Babcock
Charlie Babcock,
User Rank: Author
2/6/2015 | 5:11:42 PM
.150 sports, yes, but can Watson tell baseball from cricket?
Watson Visual Recognition recognizes 150 sports and can tell soccer from baseball. But if it sees someone swing a bat, can it tell baseball from cricket? Watch out, it's a sticky wicket out there, Watson.

How GIS Data Can Help Fix Vaccine Distribution
Jessica Davis, Senior Editor, Enterprise Apps,  2/17/2021
Graph-Based AI Enters the Enterprise Mainstream
James Kobielus, Tech Analyst, Consultant and Author,  2/16/2021
11 Ways DevOps Is Evolving
Lisa Morgan, Freelance Writer,  2/18/2021
White Papers
Register for InformationWeek Newsletters
Current Issue
2021 Top Enterprise IT Trends
We've identified the key trends that are poised to impact the IT landscape in 2021. Find out why they're important and how they will affect you.
Flash Poll