Digital Signal Processing CGSC433

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/126

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

127 Terms

New cards

representation of time and amplitude can be

continuous or digital (discrete)

New cards

continuous

continuous line with numbers having a theoretically infinite number of decimal places

New cards

digital (discrete)

sequence of separate points; number of decimal places is always limited- no solid line

New cards

digital devices such as computers can

only store a finite amount of information

New cards

no computer can store the exact value of pi because

it has infinitely many decimal places. Computers can only store this number to a certain number of decimal places

New cards

two important aspects of conversion

how precisely we measure time-sampling and how precisely we measure amplitude-quantization

New cards

sampling is

how frequently we take measurements of the signal in time

New cards

sampling interval is quoted as

a frequency

New cards

sampling rate (in Hz)

is the number of sampling points (intervals) per second

New cards

10,000 samples/sec= 10,000 Hz

10 kHz

New cards

the higher the sampling rate

the more accurate the digital approximation

New cards

not enough sampling points

not going to resemble the original wave

New cards

sampling: high resolution

New cards

sampling: moderate resolution

New cards

sampling: low resolution

New cards

analog devices

can store continuous air pressure variations into continuous electrical signals- computers can't do this

New cards

examples of analog devices

tape recorders, vinyl records

New cards

on computers signals get stored as

digits-they are digital devices

New cards

all acoustic signals in computers are

discrete

New cards

analog-digital conversion

1. limit the number of places after the decimal point on the time axis=sampling

2. limit the number of places after the decimal point on the amplitude axis=quantization

New cards

sampling rate

# of times per second that we measure the continuous wave in producing a discrete representation of signal

New cards

the signal must be sampled

often enough so that all important information is captured

New cards

to capture a 100 Hz periodic wave you need

at least 2 samples per cycle--> 200 samples per second

New cards

nyquist frequency

highest frequency component that can be captured with a given sampling rate

New cards

the nyquist frequency is

1/2 the sampling rate

New cards

two sampling rates. A. a low sampling rate that distorts the original sound B. a higher sampling rate that closely approximates the original sound wave- 4 cycles

New cards

using .5 ms is

very beneficial

New cards

if we take samples less frequently

it is not capturing the information well

New cards

if continuous signal contains frequency- nyquist frequency

the sampled waveform will have a completely different frequency from that in the original continuous signal-misrepresentation-aliasing

New cards

misrepresentation is called

aliasing

New cards

to avoid aliasing we can

increase the sampling rate and filter out high frequencies

New cards

traditional 20 kHz sampling rate

for speech: any component with a frequency >10kHz will not be captured, BUT it will introduce alias components into the discrete signal

New cards

anti-aliasing

it is always necessary to use low-pass filters to block out high frequencies

New cards

how to find the cutoff frequency of an anti-aliasing filter

it is half the sampling rate- 16,000Hz/2=8,000 Hz- same thing as nyquist frequency it is HALF

New cards

what rate should we sample speech at?

it depends on what we are going to use the recordings for!

New cards

the highest frequency that young ears can perceive is

20kHz, so to ensure that all perceptible frequencies are represented, we must sample at 2x20kHZ=40kHz

New cards

most of the information relevant to distinguishing speech sounds is

below 10kHz, so high quality speech sound is still obtained at 20kHz sampling rate

New cards

for vowels most of the relevant information is below

5kHz, so we can get away with a sampling rate at about 10kHz for analyzing just vowels

New cards

energy in fricatives is

higher requiring a sampling rate around 16-20kHz

New cards

phones including cell phones have a sampling rate of

8 kHz

New cards

quantization refers to

how finely we chop up the amplitude scale

New cards

the continuous amplitude scale is divided into

a finite number of evenly spaced amplitude values

New cards

the higher the quantization rate

the more accurate the digital approximation

New cards

digital numbers

computer world, limited choices, discrete values

New cards

computers handle integers (1,2) better than

decimals (0.01, 0.02 etc.)

New cards

acoustic waveforms are stored as

sequences of integers

New cards

size of integer

determined by the number of bits (binary digits) used

New cards

the larger the number of bits

the greater the amplitude resolution

New cards

speech encoding

8, 12, 16 bit quantization

New cards

quantization rate is quoted in

bits

New cards

a bit can either be

0 or 1

New cards

with 1 bit we can only represent

two numbers, 0 or 1

New cards

with two bits we have

four possibilities: 00, 01, 10, 11

New cards

generally using n bits we can represent

2^n levels of amplitude

New cards

8 bits, 2^8 encodable numbers

256 possible distinctions (amplitude values)

New cards

12 bits, 2^12 encodable numbers

4,096 possible distinctions (amplitude values)

New cards

16 bits, 2^16 encodable numbers

65,536 possible distinctions (amplitude values)

New cards

2 bit resolution with

4 levels of quantization

New cards

3 bit resolution with

8 levels of quantization

New cards

the act of quantization introduces

some error into the signal, which is called quantization noise

New cards

quantization noise

error in the signal

New cards

quantization noise is

the difference between the actual amplitude of the analog signal and the amplitude of the digital representation

New cards

subtract red from blue and get the graph below

New cards

too much noise

affects the way the sound is

New cards

the relative loudness of quantization noise is called

signal-to-noise ratio

New cards

smaller ratios mean

the noise has a bigger effect

New cards

larger ratios mean

the noise has a smaller effect

New cards

signal-to-noise ratio is typically expressed as

a ratio from number of possible amplitude steps to 1; for 16 bit quantization: 65,536:1

New cards

noise has a smaller effect for

65,536:1 than for 4,096:1 (12 bit)

New cards

the ratio is the best possible in principle given the

level of quantization

New cards

if the amplitudes in the actual signal do not make use of the full range of values

the actual signal-to-noise ratio may be smaller

New cards

how many amplitude steps can be represented if we are using 10 bits?

2^10=1024

New cards

what is the maximum signal to noise ratio of 1024 amplitude steps/bits?

1024:1 (number of amplitude steps/bits to 1)

New cards

when recording

you are supposed to keep the signal amplitude as high as possible- you are supposed to keep the bar in the green zone without going into the red zone

New cards

red zone

clipping

New cards

keeping within the green zone

keeps the actual signal-to-noise ratio high (so the quantization noise has a smaller effect) because you are using the full amplitude range

New cards

techniques for investigating digital signals include

digital filters, autocorrelation, RMS amplitude, Fast Fourier transform, linear predictive coding, and spectrograms

New cards

digital filters

removes low or high frequency components from the signal

New cards

autocorrelation

tracks pitch changes over time

New cards

RMS amplitude

measures acoustic intensity (loudness)

New cards

Fast Fourier Transform (FFT)

decomposes complex waves into their single component parts

New cards

linear predictive coding

allows examination of broad spectral peaks

New cards

spectrograms

shows spectral changes over time

New cards

we can construct a low pass filter by calculating the

moving average- this eliminates the high frequency bumps- the signal is transformed to preserve lower frequency components

New cards

longer windows will create

smoother signals, but with worse time resolution- not being able to see changes over time very well because they have been smoothed over

New cards

when we speak our F0 is always

changing at least slightly

New cards

there are many methods to tracking fundamental frequency over time, one of which is

autocorrelation

New cards

tracking fundamental frequency- the idea is to pick some interval of the speech signal, make a copy of it and then

1. shift the copy of itself over by 1 sample and see how well the copied interval correlates with the actual wave

2. then shift it by 2 samples and check the correlation

3. then shift it by 3 etc.

4. after some predetermined number of shifting, stop and choose the best correlation

<p>1. shift the copy of itself over by 1 sample and see how well the copied interval correlates with the actual wave</p><p>2. then shift it by 2 samples and check the correlation</p><p>3. then shift it by 3 etc.</p><p>4. after some predetermined number of shifting, stop and choose the best correlation</p>

New cards

when shifting the interval by 50 and 90 samples

the lag doesn't correlate well with the original

New cards

shifting by 129 samples gives

the best correlation, the lag duration is our best guess of the period of the wave

New cards

since the sampling was done at 16,000 samples/s (16 kHz) we calculate the F0 by

129/16000 gives us the length of the lag duration in seconds0 the period so its inverse 16000/129 is the frequency=124.031 Hz

New cards

in praat the possible frequency range is

75-500 Hz these frequencies determine the minimum and maximum possible lag durations

New cards

potential problems with autocorrelation

pitch doubling and pitch halving

New cards

pitch doubling

occurs when one cycle of the waveform has two halves that look roughly the same and the autocorrelation method mistakes them as separate cycles

New cards

when pitch doubling happens

it tracks the pitch as double the actual value

New cards

pitch halving

occurs when alternating pitch periods are more similar than successive periods and the autocorrelation method mistakes two adjacent pitch periods as part of the same cycle

New cards

when pitch halving occurs

it tracks the pitch as half the actual value

New cards

Root Mean Square (RMS) amplitude is

a measure of the energy in a complex wave

New cards

RMS amplitude calculates

the average amplitude over time

100

New cards

RMS amplitude more closely correlates to

perceived loudness than raw amplitude does