2013-03-03 100 views
2

我試圖做一個自定義對話框,而語音識別,而不是使用官方的對話框。我得到了這一部分,但是當我確定顯示聲音的幅度時,爲了使它更加花哨,就像Google Now搜索欄那樣(它的麥克風周圍的圓圈如果發出更響亮的聲音):如何用語音識別器獲取音頻幅度?

googlenow http://img600.imageshack.us/img600/3459/gnow.png

然後,我開始代碼是如何獲得聲音的幅度,最後我用AudioRecord類得到它。

問題是當我嘗試混合兩者(SpeechRecognizer和AudioRecord),因爲好像他們不能夠共享麥克風或類似的東西...

在logcat的我有這樣的錯誤:

03-03 21:16:07.461: E/ListenerAdapter(23359): onError 
03-03 21:16:07.461: E/ListenerAdapter(23359): com.google.android.speech.embedded.Greco3RecognitionEngine$EmbeddedRecognizerUnavailableException: Embedded recognizer unavailable 
03-03 21:16:07.461: E/ListenerAdapter(23359): at com.google.android.speech.embedded.Greco3RecognitionEngine.startRecognition(Greco3RecognitionEngine.java:108) 
03-03 21:16:07.461: E/ListenerAdapter(23359): at java.lang.reflect.Method.invokeNative(Native Method) 
03-03 21:16:07.461: E/ListenerAdapter(23359): at java.lang.reflect.Method.invoke(Method.java:511) 
03-03 21:16:07.461: E/ListenerAdapter(23359): at com.google.android.searchcommon.utils.ThreadChanger$1$1.run(ThreadChanger.java:77) 
03-03 21:16:07.461: E/ListenerAdapter(23359): at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:390) 
03-03 21:16:07.461: E/ListenerAdapter(23359): at java.util.concurrent.FutureTask.run(FutureTask.java:234) 
03-03 21:16:07.461: E/ListenerAdapter(23359): at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:153) 
03-03 21:16:07.461: E/ListenerAdapter(23359): at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:267) 
03-03 21:16:07.461: E/ListenerAdapter(23359): at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1080) 
03-03 21:16:07.461: E/ListenerAdapter(23359): at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:573) 
03-03 21:16:07.461: E/ListenerAdapter(23359): at com.google.android.searchcommon.utils.ConcurrentUtils$2$1.run(ConcurrentUtils.java:112) 

和其他一些時候,我有這樣的:

03-03 21:47:13.344: E/ListenerAdapter(23359): onError 
03-03 21:47:13.344: E/ListenerAdapter(23359): com.google.android.speech.exception.AudioRecognizeException: Audio error 
03-03 21:47:13.344: E/ListenerAdapter(23359): at com.google.android.speech.embedded.Greco3Recognizer.read(Greco3Recognizer.java:107) 
03-03 21:47:13.344: E/ListenerAdapter(23359): at dalvik.system.NativeStart.run(Native Method) 
03-03 21:47:13.344: E/ListenerAdapter(23359): Caused by: java.io.IOException: couldn't start recording, state is:1 
03-03 21:47:13.344: E/ListenerAdapter(23359): at com.google.android.speech.audio.MicrophoneInputStream.ensureStartedLocked(MicrophoneInputStream.java:119) 
03-03 21:47:13.344: E/ListenerAdapter(23359): at com.google.android.speech.audio.MicrophoneInputStream.read(MicrophoneInputStream.java:159) 
03-03 21:47:13.344: E/ListenerAdapter(23359): at com.google.common.io.ByteStreams.read(ByteStreams.java:806) 
03-03 21:47:13.344: E/ListenerAdapter(23359): at com.google.android.speech.audio.Tee.readFromDelegate(Tee.java:374) 
03-03 21:47:13.344: E/ListenerAdapter(23359): at com.google.android.speech.audio.Tee.readLeader(Tee.java:267) 
03-03 21:47:13.344: E/ListenerAdapter(23359): at com.google.android.speech.audio.Tee$TeeLeaderInputStream.read(Tee.java:464) 
03-03 21:47:13.344: E/ListenerAdapter(23359): at java.io.InputStream.read(InputStream.java:163) 
03-03 21:47:13.344: E/ListenerAdapter(23359): at com.google.android.speech.audio.AudioSource$CaptureThread.run(AudioSource.java:193) 

這也是我如何啓動這兩個:

//previously in constructor 
speechrec = SpeechRecognizer.createSpeechRecognizer(getActivity()); 
speechrec.setRecognitionListener(this); 
// 

public void launchListening() 
{  
    if (speechrec.isRecognitionAvailable(getActivity())) 
    { 
     Intent intent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH); 
     intent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_MODEL,RecognizerIntent.LANGUAGE_MODEL_FREE_FORM); 
     speechrec.startListening(intent);  
    } 

    bufferSize = AudioRecord.getMinBufferSize(sampleRate, AudioFormat.CHANNEL_CONFIGURATION_MONO, AudioFormat.ENCODING_PCM_16BIT);// * bufferSizeFactor; 
    audio = new AudioRecord(MediaRecorder.AudioSource.MIC, sampleRate, AudioFormat.CHANNEL_CONFIGURATION_MONO, AudioFormat.ENCODING_PCM_16BIT, bufferSize); 
    audio.startRecording(); 

    captureThread = new Thread(new Runnable() 
    { 
     public void run() 
     { 
      //calculate amplitude here 
     } 
    }); 
    captureThread.start(); 
} 

關於如何創建語音識別自定義對話框的任何想法,我可以根據噪音顯示振幅,就像Google一樣?

回答

3

執行此操作的方法是向SpeechRecognizer註冊偵聽器,並將onRmsChanged的輸出可視化。但請注意:

There is no guarantee that this method will be called.

因此,您正在使用的語音識別器需要支持此方法。請注意,返回值SpeechRecognizer.createSpeechRecognizer(getActivity())取決於用戶設備的配置。

(而SpeechRecognizer被記錄,反之亦然不能啓動AudioRecord

+0

我知道,問題是,它不是guaranted被稱爲...(其實我想和它不是叫)。我正在考慮用AudioRecord捕獲音頻,然後以某種方式將其發送給SpeechRecognizer,但似乎不可行。 – nsL 2013-03-04 09:46:53

+1

是的,同時使用'AudioRecord' +'SpeechRecognizer'不適用於當前版本的Android。關於RMS回調和Google的語音識別器,它停止與Jelly Bean一起工​​作。這是我在測試Babble時的印象(https://github.com/Kaljurand/babble)。我不知道這是谷歌識別器中的一個錯誤還是他們通過API提供更少的政治決定。 – Kaarel 2013-03-04 10:19:52