Digital Signal Processing CGSC433

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/126

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

127 Terms

1
New cards

representation of time and amplitude can be

continuous or digital (discrete)

2
New cards

continuous

continuous line with numbers having a theoretically infinite number of decimal places

<p>continuous line with numbers having a theoretically infinite number of decimal places</p>
3
New cards

digital (discrete)

sequence of separate points; number of decimal places is always limited- no solid line

<p>sequence of separate points; number of decimal places is always limited- no solid line</p>
4
New cards

digital devices such as computers can

only store a finite amount of information

5
New cards

no computer can store the exact value of pi because

it has infinitely many decimal places. Computers can only store this number to a certain number of decimal places

6
New cards

two important aspects of conversion

how precisely we measure time-sampling and how precisely we measure amplitude-quantization

7
New cards

sampling is

how frequently we take measurements of the signal in time

8
New cards

sampling interval is quoted as

a frequency

9
New cards

sampling rate (in Hz)

is the number of sampling points (intervals) per second

10
New cards

10,000 samples/sec= 10,000 Hz

10 kHz

11
New cards

the higher the sampling rate

the more accurate the digital approximation

12
New cards

not enough sampling points

not going to resemble the original wave

<p>not going to resemble the original wave</p>
13
New cards

sampling: high resolution

knowt flashcard image
14
New cards

sampling: moderate resolution

knowt flashcard image
15
New cards

sampling: low resolution

knowt flashcard image
16
New cards

analog devices

can store continuous air pressure variations into continuous electrical signals- computers can't do this

17
New cards

examples of analog devices

tape recorders, vinyl records

18
New cards

on computers signals get stored as

digits-they are digital devices

19
New cards

all acoustic signals in computers are

discrete

20
New cards

analog-digital conversion

1. limit the number of places after the decimal point on the time axis=sampling

2. limit the number of places after the decimal point on the amplitude axis=quantization

21
New cards

sampling rate

# of times per second that we measure the continuous wave in producing a discrete representation of signal

22
New cards

the signal must be sampled

often enough so that all important information is captured

23
New cards

to capture a 100 Hz periodic wave you need

at least 2 samples per cycle--> 200 samples per second

24
New cards

nyquist frequency

highest frequency component that can be captured with a given sampling rate

25
New cards

the nyquist frequency is

1/2 the sampling rate

26
New cards

two sampling rates. A. a low sampling rate that distorts the original sound B. a higher sampling rate that closely approximates the original sound wave- 4 cycles

knowt flashcard image
27
New cards

using .5 ms is

very beneficial

28
New cards

if we take samples less frequently

it is not capturing the information well

29
New cards

if continuous signal contains frequency- nyquist frequency

the sampled waveform will have a completely different frequency from that in the original continuous signal-misrepresentation-aliasing

30
New cards

misrepresentation is called

aliasing

31
New cards

to avoid aliasing we can

increase the sampling rate and filter out high frequencies

32
New cards

traditional 20 kHz sampling rate

for speech: any component with a frequency >10kHz will not be captured, BUT it will introduce alias components into the discrete signal

33
New cards

anti-aliasing

it is always necessary to use low-pass filters to block out high frequencies

34
New cards

how to find the cutoff frequency of an anti-aliasing filter

it is half the sampling rate- 16,000Hz/2=8,000 Hz- same thing as nyquist frequency it is HALF

35
New cards

what rate should we sample speech at?

it depends on what we are going to use the recordings for!

36
New cards

the highest frequency that young ears can perceive is

20kHz, so to ensure that all perceptible frequencies are represented, we must sample at 2x20kHZ=40kHz

37
New cards

most of the information relevant to distinguishing speech sounds is

below 10kHz, so high quality speech sound is still obtained at 20kHz sampling rate

38
New cards

for vowels most of the relevant information is below

5kHz, so we can get away with a sampling rate at about 10kHz for analyzing just vowels

39
New cards

energy in fricatives is

higher requiring a sampling rate around 16-20kHz

40
New cards

phones including cell phones have a sampling rate of

8 kHz

41
New cards

quantization refers to

how finely we chop up the amplitude scale

42
New cards

the continuous amplitude scale is divided into

a finite number of evenly spaced amplitude values

43
New cards

the higher the quantization rate

the more accurate the digital approximation

<p>the more accurate the digital approximation</p>
44
New cards

digital numbers

computer world, limited choices, discrete values

45
New cards

computers handle integers (1,2) better than

decimals (0.01, 0.02 etc.)

46
New cards

acoustic waveforms are stored as

sequences of integers

47
New cards

size of integer

determined by the number of bits (binary digits) used

48
New cards

the larger the number of bits

the greater the amplitude resolution

49
New cards

speech encoding

8, 12, 16 bit quantization

50
New cards

quantization rate is quoted in

bits

51
New cards

a bit can either be

0 or 1

52
New cards

with 1 bit we can only represent

two numbers, 0 or 1

53
New cards

with two bits we have

four possibilities: 00, 01, 10, 11

54
New cards

generally using n bits we can represent

2^n levels of amplitude

<p>2^n levels of amplitude</p>
55
New cards

8 bits, 2^8 encodable numbers

256 possible distinctions (amplitude values)

56
New cards

12 bits, 2^12 encodable numbers

4,096 possible distinctions (amplitude values)

57
New cards

16 bits, 2^16 encodable numbers

65,536 possible distinctions (amplitude values)

58
New cards

2 bit resolution with

4 levels of quantization

<p>4 levels of quantization</p>
59
New cards

3 bit resolution with

8 levels of quantization

<p>8 levels of quantization</p>
60
New cards

the act of quantization introduces

some error into the signal, which is called quantization noise

61
New cards

quantization noise

error in the signal

62
New cards

quantization noise is

the difference between the actual amplitude of the analog signal and the amplitude of the digital representation

63
New cards

subtract red from blue and get the graph below

knowt flashcard image
64
New cards

too much noise

affects the way the sound is

65
New cards

the relative loudness of quantization noise is called

signal-to-noise ratio

66
New cards

smaller ratios mean

the noise has a bigger effect

67
New cards

larger ratios mean

the noise has a smaller effect

68
New cards

signal-to-noise ratio is typically expressed as

a ratio from number of possible amplitude steps to 1; for 16 bit quantization: 65,536:1

69
New cards

noise has a smaller effect for

65,536:1 than for 4,096:1 (12 bit)

70
New cards

the ratio is the best possible in principle given the

level of quantization

71
New cards

if the amplitudes in the actual signal do not make use of the full range of values

the actual signal-to-noise ratio may be smaller

72
New cards

how many amplitude steps can be represented if we are using 10 bits?

2^10=1024

73
New cards

what is the maximum signal to noise ratio of 1024 amplitude steps/bits?

1024:1 (number of amplitude steps/bits to 1)

74
New cards

when recording

you are supposed to keep the signal amplitude as high as possible- you are supposed to keep the bar in the green zone without going into the red zone

75
New cards

red zone

clipping

76
New cards

keeping within the green zone

keeps the actual signal-to-noise ratio high (so the quantization noise has a smaller effect) because you are using the full amplitude range

77
New cards

techniques for investigating digital signals include

digital filters, autocorrelation, RMS amplitude, Fast Fourier transform, linear predictive coding, and spectrograms

78
New cards

digital filters

removes low or high frequency components from the signal

79
New cards

autocorrelation

tracks pitch changes over time

80
New cards

RMS amplitude

measures acoustic intensity (loudness)

81
New cards

Fast Fourier Transform (FFT)

decomposes complex waves into their single component parts

82
New cards

linear predictive coding

allows examination of broad spectral peaks

83
New cards

spectrograms

shows spectral changes over time

84
New cards

we can construct a low pass filter by calculating the

moving average- this eliminates the high frequency bumps- the signal is transformed to preserve lower frequency components

85
New cards

longer windows will create

smoother signals, but with worse time resolution- not being able to see changes over time very well because they have been smoothed over

<p>smoother signals, but with worse time resolution- not being able to see changes over time very well because they have been smoothed over</p>
86
New cards

when we speak our F0 is always

changing at least slightly

87
New cards

there are many methods to tracking fundamental frequency over time, one of which is

autocorrelation

88
New cards

tracking fundamental frequency- the idea is to pick some interval of the speech signal, make a copy of it and then

1. shift the copy of itself over by 1 sample and see how well the copied interval correlates with the actual wave

2. then shift it by 2 samples and check the correlation

3. then shift it by 3 etc.

4. after some predetermined number of shifting, stop and choose the best correlation

<p>1. shift the copy of itself over by 1 sample and see how well the copied interval correlates with the actual wave</p><p>2. then shift it by 2 samples and check the correlation</p><p>3. then shift it by 3 etc.</p><p>4. after some predetermined number of shifting, stop and choose the best correlation</p>
89
New cards

when shifting the interval by 50 and 90 samples

the lag doesn't correlate well with the original

<p>the lag doesn't correlate well with the original</p>
90
New cards

shifting by 129 samples gives

the best correlation, the lag duration is our best guess of the period of the wave

<p>the best correlation, the lag duration is our best guess of the period of the wave</p>
91
New cards

since the sampling was done at 16,000 samples/s (16 kHz) we calculate the F0 by

129/16000 gives us the length of the lag duration in seconds0 the period so its inverse 16000/129 is the frequency=124.031 Hz

92
New cards

in praat the possible frequency range is

75-500 Hz these frequencies determine the minimum and maximum possible lag durations

93
New cards

potential problems with autocorrelation

pitch doubling and pitch halving

94
New cards

pitch doubling

occurs when one cycle of the waveform has two halves that look roughly the same and the autocorrelation method mistakes them as separate cycles

95
New cards

when pitch doubling happens

it tracks the pitch as double the actual value

<p>it tracks the pitch as double the actual value</p>
96
New cards

pitch halving

occurs when alternating pitch periods are more similar than successive periods and the autocorrelation method mistakes two adjacent pitch periods as part of the same cycle

97
New cards

when pitch halving occurs

it tracks the pitch as half the actual value

<p>it tracks the pitch as half the actual value</p>
98
New cards

Root Mean Square (RMS) amplitude is

a measure of the energy in a complex wave

<p>a measure of the energy in a complex wave</p>
99
New cards

RMS amplitude calculates

the average amplitude over time

100
New cards

RMS amplitude more closely correlates to

perceived loudness than raw amplitude does