Datasheet

VS1053B OGG VORBIS ENCODER VSMPG

3.2.3 Compressing Speech

Speech requires different coding parameters than music. First, speech can usually

be coded in mono, so all the following examples convert audio to mono. Also, to get

smaller ﬁles, speech bandwidth can usually be suppressed without adversely affecting

intelligibility.

The following table shows the command needed to convert a speech ﬁle called ﬁle.wav,

a typical bit-rate for that quality, how many hours of music could be stored to a 1 gibibyte

memory card, and a short description of the mode.

Speech compression

Command Typ. Hours/ Description

kbit/s 1 GiB

oggenc downmix resample 4000 -q -1 ﬁle.wav 6 400 Very low bandwidth

oggenc downmix resample 6000 -q -1 ﬁle.wav 8 300 Near mobile phone quality

oggenc downmix resample 8000 -q -1 ﬁle.wav 9 265 Slightly better than mobile phone

oggenc downmix resample 8000 -q 0 ﬁle.wav 13 184 Clean 3 kHz bandwidth sound

oggenc downmix resample 11025 -q -1 ﬁle.wav 14 170 Toy application quality

oggenc downmix resample 11025 -q 0 ﬁle.wav 17 140 Cleaner toy application

oggenc downmix resample 11025 -q 1 ﬁle.wav 20 119 Clean sound, 5 kHz bandwidth

oggenc downmix resample 16000 -q 1 ﬁle.wav 24 99 Wideband (7 kHz) sound

oggenc downmix resample 22050 -q 1 ﬁle.wav 30 80 10 kHz bandwidth

oggenc downmix resample 44100 -q 0 ﬁle.wav 36 66 Full bandwidth (15+ kHz)

oggenc downmix resample 44100 -q 1 ﬁle.wav 42 57 Cleaner sound

oggenc downmix resample 44100 -q 2 ﬁle.wav 48 50 Close to transparent

oggenc downmix resample 44100 -q 5 ﬁle.wav 80 30 CD quality (but mono)

The “ downmix” option converts sound to mono and the “-q” control works as in Chap-

ter 3.2.2. Unfortunately, many graphical front-ends are missing the “ resample” option,

which is a very powerful parameter for speech compression. If your program misses

this slider, you either have to ﬁll the missing values in a command option window or

compress your ﬁles from oggenc’s command line mode.

Rev. 1.70c 2012-11-16

Page 29(39)