Microsoft Tries To Make Vox Popular - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

IoT
IoT
News

Microsoft Tries To Make Vox Popular

Voice recognition is a languishing technology that Microsoft apparently sees as a ripe opportunity.

Microsoft is expanding a software effort that, if successful, could have even some of the most ardent Microsoft bashers singing its praises. The company is moving on voice-recognition apps designed to, among other things, lead callers through those thickets of customer-service phone menus.

At a speech-technology conference in New York on Wednesday, Microsoft introduced a second beta version of its .Net Speech Software development kit, a plug-in to its Visual Studio .Net development-tools suite that aims to get more programmers writing speech apps that use Microsoft tools and servers. The company also released a "technical preview," of its .Net Speech Platform, a set of application-building components. The software is due next year.

Microsoft's efforts could expand markets for wireless phone airtime, speech-recognition software, and PDAs--an area in which Microsoft is picking up market share. Microsoft said Oct. 22 it is buying Vicinity Corp., which makes software that delivers maps and directions to cell phones and PDAs, for $96 million in cash.

"Speech recognition has never really taken off the way people thought it would," says Brian Strachman, an analyst at market-research firm In-Stat. "We need a broad base of applications, and Microsoft has a direct line to all the developers out there."

Microsoft's Office suite includes a dictation-taking capability, and the company published an API for building speech-capable Windows apps. But the company's latest effort focuses on selling call-center software. Airlines and banks often maintain separate sets of application software logic for their Web sites and call centers, which is wasteful, says Kai-Fu Lee, a Microsoft VP who managed the company's Beijing research lab in the late '90s. "The Web is where the real-time, accurate data resides." Microsoft hopes the widespread use of its tools will encourage more companies to write apps that allow access to the same data by phone or over the Web.

"The call center has always been the first, best market for speech recognition," analyst Strachman says, since software can provide a quick payback in saved labor. Telephone customer-service, or "interactive voice response," apps have been characterized by dedicated hardware and application-specific protocols for retrieving data. VoiceXML, a markup language backed by AT&T, IBM, Lucent Technologies, and Motorola, has been a way to reuse logic across voice applications.

Microsoft's approach, called Speech Application Language Tags, adds support for "multimodal" apps that can deliver text and graphics to computer and cell-phone screens, in addition to voice feedback. Microsoft and its partners in the Salt Forum, which include Cisco Systems and Intel, submitted Salt 1 to the World Wide Web Consortium, an industry-standards body, in August. There aren't any multimodal applications on the market today, but they could debut next year as more developers adopt Salt. The new beta version of the .Net Speech development kit--a collection of pre-built components that automate common programming tasks--includes the ability to build a usable server-side application and complies with the version of Salt submitted to the W3C. The .Net Speech Platform includes a Salt-enabled version of the Internet Explorer Web browser, software that communicates with call-center answering and routing hardware, and a speech-recognition engine.

Microsoft plans to give away the development kit but hasn't decided how it will package its speech platform. In addition to license revenue, there's also value in seeding the market with speech software, Lee says. "Speech will be part of the PC experience," Lee says. "But it won't come from the PC, because the keyboard and mouse work so well. We want to decrease the users' fear of the technology."

We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
Comment  | 
Print  | 
More Insights
State of the Cloud
State of the Cloud
Cloud has drastically changed how IT organizations consume and deploy services in the digital age. This research report will delve into public, private and hybrid cloud adoption trends, with a special focus on infrastructure as a service and its role in the enterprise. Find out the challenges organizations are experiencing, and the technologies and strategies they are using to manage and mitigate those challenges today.
Commentary
Enterprise Guide to Edge Computing
Cathleen Gagne, Managing Editor, InformationWeek,  10/15/2019
News
Rethinking IT: Tech Investments that Drive Business Growth
Jessica Davis, Senior Editor, Enterprise Apps,  10/3/2019
Slideshows
IT Careers: 12 Job Skills in Demand for 2020
Cynthia Harvey, Freelance Journalist, InformationWeek,  10/1/2019
Register for InformationWeek Newsletters
Video
Current Issue
Getting Started With Emerging Technologies
Looking to help your enterprise IT team ease the stress of putting new/emerging technologies such as AI, machine learning and IoT to work for their organizations? There are a few ways to get off on the right foot. In this report we share some expert advice on how to approach some of these seemingly daunting tech challenges.
White Papers
Slideshows
Twitter Feed
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.
Sponsored Video
Flash Poll