User's Manual

DATASHEET FC6000 Confidential Information 10/53

2.5 Speaker independent Voice recognition

2.5.1 Voice Recognition principles

2.5.1.1 Description

VoCon 3200 V3.3 is NUANCE training-less speaker independent speech recognition

algorithm.

• Phonemes recognition: words are recognized without previous training

• Words models for a better precision, especially for digits recognition

• Continuous voice recognition: no need for blanks between words

• New words learning (Voice tags), speaker dependent speech recognition (100 Voice

tags, 2kbytes by Voice tag)

• Noise robustness and accuracy in an automotive environment: engine, click-button

etc…

• Highly accurate recognition

• VoCon Music Pre-Processor. This feature allows the user to select music to play by

voice commands.

2.5.1.2 Operation

During a voice recognition process, "Feature Extraction" module decomposes the signal. The

module "Search" looks for the equivalent text using the modules "G2P" and "Grammar to

compile". These two modules are using the libraries "Acoustic Model", "Lexicon" and

"Grammar".

Module G2P ensures equivalence between the graphemes and the phonemes.

For each language is associated an acoustic model ("Acoustic Model"), a grammar and a

lexicon ("Grammar" and "Lexicon").

System feedback is realized by a screen display and/or sounds (synthesized voice, chime,

pre-recorded voice prompts…).

Operation is ended by a final action (phone number dialing, radio station tuning…).