Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(159)

Issue 3341020: Speech input: Do environment estimation and detect the no-speech case. (Closed)

Created:
10 years, 3 months ago by Satish
Modified:
9 years, 7 months ago
Reviewers:
joth
CC:
chromium-reviews, ben+cc_chromium.org, Paweł Hajdan Jr.
Visibility:
Public.

Description

Give time for environment estimation and detect the no-speech case. Environment estimation helps establish a baseline against which the endpointer compares to find beginning and end of speech. We now do this estimation for a few hundred milliseconds before showing the speech UI asking user to speak. Using this updated endpointer we also detect if no speech was heard for many seconds and if so inform the delegate as an error. In a future CL we'll allow the user to easily retry recognition in these error cases. BUG=53598 TEST=unit_tests --gtest_filter=SpeechRecognizerTests.* Committed: http://src.chromium.org/viewvc/chrome?view=rev&revision=58930

Patch Set 1 : . #

Total comments: 6
Unified diffs Side-by-side diffs Delta from patch set Stats (+198 lines, -46 lines) Patch
M chrome/browser/speech/endpointer/endpointer.h View 1 chunk +10 lines, -0 lines 0 comments Download
M chrome/browser/speech/endpointer/energy_endpointer.h View 1 chunk +4 lines, -0 lines 0 comments Download
M chrome/browser/speech/speech_input_manager.cc View 4 chunks +21 lines, -5 lines 1 comment Download
M chrome/browser/speech/speech_recognizer.h View 4 chunks +25 lines, -3 lines 0 comments Download
M chrome/browser/speech/speech_recognizer.cc View 8 chunks +45 lines, -16 lines 0 comments Download
M chrome/browser/speech/speech_recognizer_unittest.cc View 8 chunks +93 lines, -22 lines 5 comments Download

Messages

Total messages: 4 (0 generated)
Satish
10 years, 3 months ago (2010-09-09 07:17:35 UTC) #1
joth
LGTM couple small suggestions... http://codereview.chromium.org/3341020/diff/2001/3003 File chrome/browser/speech/speech_input_manager.cc (right): http://codereview.chromium.org/3341020/diff/2001/3003#newcode192 chrome/browser/speech/speech_input_manager.cc:192: const SpeechInputRequest& request = requests_.find(caller_id)->second; ...
10 years, 3 months ago (2010-09-09 10:51:37 UTC) #2
Satish
Addressed all comments with one reply below, will check in if you are ok with ...
10 years, 3 months ago (2010-09-09 11:30:19 UTC) #3
joth
10 years, 3 months ago (2010-09-09 13:31:00 UTC) #4
LGTM.
could comment that it's approximating a 60Hz saw-tooth waveform, or just
leave as is..


On 9 September 2010 12:30, <satish@chromium.org> wrote:

> Addressed all comments with one reply below, will check in if you are ok
> with
> it.
>
>
>
> http://codereview.chromium.org/3341020/diff/2001/3006
> File chrome/browser/speech/speech_recognizer_unittest.cc (right):
>
> http://codereview.chromium.org/3341020/diff/2001/3006#newcode243
> chrome/browser/speech/speech_recognizer_unittest.cc:243:
> audio_packet_.get()[i] = static_cast<uint8>(i);
> On 2010/09/09 10:51:37, joth wrote:
>
>> would it be more representative to put a sine wave in here or
>>
> something?
>
>
> http://src.chromium.org/viewvc/chrome/trunk/src/media/audio/simple_sources.cc
>
>> has an example.
>>
>
> I thought of that earlier but a sine wave isn't exactly representing
> speech either. So it would be swapping one artificial sequence for
> another, so I went with this simpler for loop.
>
>
> http://codereview.chromium.org/3341020/show
>

Powered by Google App Engine
This is Rietveld 408576698