blog how to, blog trick, blog tips, tutorial blog, blog hack

HEADLINE NEWS

Popular Posts

Showing posts with label entrepreneurs at Google. Show all posts
Showing posts with label entrepreneurs at Google. Show all posts

Tuesday, February 22, 2011

Speech technology at Google: teaching machines to talk and listen

This is the latest post in our series profiling entrepreneurial Googlers working on products across the company and around the world. Here, you’ll get a behind-the-scenes look at how one Googler built an entire R&D team around voice technology that has gone on to power products like YouTube transcriptions and Voice Search. - Ed.

When I first interviewed at Google during the summer of 2004, mobile was just making its way onto the company’s radar. My passion was speech technology, the field in which I’d already worked for 20 years. After 10 years of speech research at SRI, followed by 10 years helping build Nuance Communications, the company I co-founded in 1994, I was ready for a new challenge. I felt that mobile was an area ripe for innovation, with a need for speech technology, and destined to be a key platform for delivery of services.

During my interview, I shared my desire to pursue the mobile space and mentioned that if Google didn’t have any big plans for mobile, then I probably wouldn’t be a good fit for the company. Well, I got the job, and I started soon after, without a team or even a defined role. In classic Google fashion, I was encouraged to explore the company, learn about what various teams were working on and figure out what was needed.

After a few months, I presented an idea to senior management to build a telephone-based spoken interface to local search. Although there was a diversity of opinion at the meeting about what applications made the most sense for Google, all agreed that I should start to build a team focused on speech technology. With help from a couple of Google colleagues who also had speech backgrounds, I began recruiting, and within a few months people were busily building our own speech recognition system.

Six years later, I’m excited by how far we’ve come and, in turn, how our long-term goals have expanded. When I started, I had to sell other teams on the value of speech technology to Google's mission. Now, I’m constantly approached by other teams with ideas and needs for speech. The biggest challenge is scaling our effort to meet the opportunities. We've advanced from GOOG-411, our first speech-driven service, to Voice Search, Voice Input, Voice Actions, a Voice API for Android developers, automatic captioning of YouTube videos, automatic transcription of voicemail for Google Voice and speech-to-speech translation, amongst others. In the past year alone, we’ve ported our technology to more than 20 languages.



Speech technology requires an enormous amount of data to feed our statistical models and lots of computing power to train our systems—and Google is the ideal place to pursue such technical approaches. With large amounts of data, computing power and an infrastructure focused on supporting large-scale services, we’re encouraged to launch quickly and iterate based on real-time feedback.

I’ve been exploring speech technology for nearly three decades, yet I see huge potential for further innovation. We envision a comprehensive interface for voice and text communication that defies all barriers of modality and language and makes information truly universally accessible. And it’s here at Google that I think we have the best chance to make this future a reality.

Update 9:39 PM: Changed title of post to clarify that speech technology is not only used on mobile phones but also for transcription tasks like YouTube captioning and voicemail transcription. -Ed.

Wednesday, February 9, 2011

Dialed up: the rapid launch and growth of Click-to-Call

This post is the first in our series profiling entrepreneurial Googlers working on products across the company and around the world. Here, you’ll get an in-depth look at how one of our most successful mobile advertising features was launched by one and a half engineers in a matter of months. -Ed.

I’ll always remember my first cell phone—a big, black brick that was really only good for making calls. While technology certainly has advanced since then, I still appreciate the speed of connecting to people and businesses instantly over the phone, something that I found harder and harder to do when I searched, for example, for the number of a restaurant to make a reservation.

So in June of 2009, a few engineers and I pooled our 20% time and worked to develop a prototype of what would eventually become Click-to-Call for smartphones, an ad unit that makes it easier for people to connect to a business by phone, rather than through a website.


Building the feature was the easy part; essentially, we developed an ad extension that allows advertisers to include a phone or location in their campaigns. However, it was launching it to advertisers that posed the biggest challenge.

With new products like Click-to-Call, we often choose to launch in beta, and incrementally roll out the features to a small subset of users, usually beginning at a 1% test and increasing from there. With Click-to-Call, we’d developed a mobile feature that we wanted to launch as soon as possible, but since mobile advertising was much smaller at that point—then with only about one-sixth as many search queries we get today—we calculated that it would take nearly three years to roll out to 10% and around 10 years to actually launch it. At that rate, the feature would likely become antiquated long before it ever officially launched.

So I decided not to follow the usual process and took a risk, choosing to launch to 50% of Google’s mobile advertisers within the first week. In my view, there was simply no other way to collect enough feedback in a short period of time so that we could quickly iterate based on feedback. Thousands of advertisers—an unprecedented amount for a brand-new feature—were on board to try it out, and with a few engineers and some pretty massive spreadsheets, we started to see real results. Within a month, we had the magical ingredient—momentum—and from there we were collecting enough feedback to be on track to bring the feature to all advertisers in a matter of months.


This is one of the reasons I work at Google. Google gives me freedom to experiment, ownership of my ideas, and amazing resources and support. We built Click-to-Call in June 2009, began testing it in July, and had it up and running for all advertisers in January 2010. One year later, Click-to-Call ads on both search and the Google Display Network are generating millions of calls every month on mobile phones and driving strong performance for advertisers.

If you’re interested in exploring some of the most significant trends in mobile, you can watch our Think Mobile livestream this Thursday, February 10 at 1:05pm EST, where we’ll discuss why it’s “not too late for businesses to still be early” in this space.

 
My Ping in TotalPing.com