3 Reasons Voice Will Finally Come To The Web

Siri is teaching us to talk to and not just type on our devices. But will we be comfortable recording all our conversations to make voice a searchable app?

E. Kelly Fitzsimmons, Co-founder, HarQen

June 26, 2013

4 Min Read

Voice is dead. Or at least the digerati think so. It takes some real digging in Silicon Valley to find the voiceheads, the true believers that voice will have its second coming as a Web application.

Today, most people think of Apple's Siri when you say voice app, but what if you could control all your apps with voice, and also search through spoken conversations and find content as easily as you do in email? At the very fringes of consumer and enterprise social interaction, this vision is already here. This emergent paradigm, known as hypervoice, promises to be a major boon for productivity. The real question is whether it will tip and become the next big shift in the Web.

It's kind of crazy that telephony and the Web are still so separate. Voice on the Web is only about the transport of voice, not voice as rich media content. Voice today is like Web 1.0 when Web content simply mimicked brochures. It's so boring, it hurts.

[ Not a Siri fan? See 7 Slick Siri Alternative Apps. ]

But what if voice was interactive like hypertext? What if we could search, share and find highlights from our conversations -- just like we do with text? Voice could go from a fringe player to a radical new social object with the potential to alter the way we communicate online.

These ideas may seem wild, but they are certainly not new. The voiceheads are quick to pull up their shirts to compare scars. With so many false starts, why is now the time for voice to become a member in good standing of the Web community? Here are three reasons.

1. Productivity #SOS

Today, voice solves only a space problem -- connecting two people across long distances in real time. But that model doesn't line up with how we work today. We work asynchronously, out of our email inboxes and social media activity streams. Live calls are increasingly disruptive to our workflow. Throwing in the pain of connecting across multiple time zones makes the need for a better way to work more pressing.

Text alone can't save us from this time-stretched, overloaded information stream. We need new tools, badly. Emerging hypervoice apps, where we can go back over our voice conversations and quickly find bits of information we need, will be like giving us perfect recall. Imagine augmented memory without an implant.

2. Viva La WebRTC!

The World Wide Web Consortium (W3C) drafted WebRTC as an API definition to enable browser-to-browser applications for voice calling, video chat and peer-to-peer file sharing without plug-ins. Today it's not trivial to put voice on the Web and make the pieces play nicely together, so it's hard to underestimate the impact that WebRTC (Web Real-Time Communication) will have on the development of future voice applications. Although the standard is still gathering form and adherents (e.g., Microsoft and Apple have not joined the party yet), WebRTC promises to make it whip simple for developers to integrate voice and video into their applications. By lowering technology barriers, new applications are likely to emerge quickly and seemingly out of left field. WebRTC will unleash the developers!

Global CIO Global CIOs: A Site Just For You Visit InformationWeek's Global CIO -- our online community and information resource for CIOs operating in the global economy.

3. Say It: "Behavioral Changes"

People are starting to get comfortable talking to, not just through, their devices. We saw this nascent behavior shift start with Siri, and now it is likely to expand with Google's hotwording. These behavioral shifts are a critical step forward, as we have to get comfortable with voice as an interface. We need to move away from using our mobile devices as a typewriter.

It's critical, and yet ... Behavioral shifts are the hardest friction point to overcome. Social convention and etiquette change far slower than our technology advancements.

And while we are talking about barriers, one of the most pressing to overcome for hypervoice will be the acceptance of recording our voice conversations. As an early adopter, I have about two years of recorded conversations. I had assumed that people would be more off-put at the prospect of being "on the record." What has really surprised me is how little anyone seems to care. By regularly recording my conversations in a format that was simply searchable and shareable not only by me, but by them as well, my colleagues saw it as a boon for their own productivity, too.

So the real question is: Are we ready to trade our privacy for productivity? We have done it before, countless times. But in some ways, voice feels special. It feels like part of our personhood. And to this last point, time will only tell.

Read more about:

20132013

About the Author(s)

E. Kelly Fitzsimmons

Co-founder, HarQen

E. Kelly Fitzsimmons is a well-known serial entrepreneur who has founded, led, and sold several technology startups. Currently, she is the co-founder and director of HarQen, named one of Gartner’s 2013 Cool Vendors in Unified Communications and Network Systems and Services, and co-founder of the Hypervoice Consortium. In 2011, she was awarded the Silvertip PwC Entrepreneurship Award honoring the fastest-growing company within the Angel Capital Association portfolio. In 2013, Speech Technology magazine honored her with the Luminary Award. She completed her undergraduate studies at the University of Rochester and holds a master’s degree from Harvard University. 

Never Miss a Beat: Get a snapshot of the issues affecting the IT industry straight to your inbox.

You May Also Like


More Insights