After the digitalization of any signal we get a sequence
of samples that represent the signal
with more or less fidelity.
is encoded using PCM (Pulse Code Modulation), in which every sample is
represented with the same number of bits. For example, in a CD we have a
data-rate of
Always remember that compression not necessary reduces the number of bits at
the output. Sometimes, an expansion could be generated.
2 Sources of redundancy
In general, the signal
has different types of redundancy:
Statistical redundancy. It can be removed using probabilistic
models of the signal producing lossless codecs. The codecs are also
known as text codecs.
Spatial/temporal redundancy. It can be removed using
spatial/temporal models of the signal and produces also lossless
codecs.
Psychological redundancy. Some information that signal carry
can not be perceived by humans. This kind of pseudo-redundancy is
removed normally by means of quantization, producing lossy codecs.
3 Symbols, runs, strings, code-words and code-streams
In the context of statistical coding, each sample of
is called a symbol.
Depending on the type of statistical relationship between symbols, we will
speak also about strings when we process more than one symbol and about
runs when all the symbols are the same in a string.
In any case, the output of the encoder is a sequence of code-words that
all together generates a code-stream.