Lecture Language:
  • English
Hours (Lecture):
  • Mo 12:15-13:45 H5
    Thu 8:15-9:45 H15
    05.10.2019 16:15-17:45 H15
    05.31.2019 16:15-17:45 H15
Hours (Exercise):
  • Tu 16:15 - 17:45 (06.021)
    Fr 8:15 - 9:45 (06.021)
Credit Points:
  • 5 ECTS
Time Lecture:
  • 3
Time Exercise:
  • 1
UnivIS Links:

Schedule
Week
Monday, 12:15 - 13:45
 
Tuesday, 16:15 - 17:45
Thursday, 08:15 - 09:45
 
Friday, 08:15 - 09:45
Friday, 16:15 - 17:45
1
22/04
 
23/04
 
25/04
Lecture 1 (H15)
26/04
 
 
2
29/04
Lecture 2 (H5)
30/04
 
02/05
Lecture 3 (H15)
03/05
 
 
3
06/05
Lecture 4 (H5)
07/05
 
09/05
Lecture 5 (H15)
10/05
 
Lecture 6 (H15)
4
13/05
 
14/05
Supplement 1 (06.021)
16/05
 
17/05
Supplement 1 (06.021)
 
5
20/05
 
21/05
 
23/05
Lecture 7 (H15)
24/05
 
 
6
27/05
Lecture 8 (H5)
28/05
 
30/05
 
31/05
 
Lecture 9 (H15)
7
03/06
Lecture 10 (H5)
04/06
Supplement 2 (06.021)
06/06
Lecture 11(H15)
07/06
Supplement 2 (06.021)
 
8
10/06
 
11/06
 
13/06
Lecture 12 (H15)
14/06
 
 
9
17/06
Lecture 13 (H5)
18/06
 
20/06
 
21/06
 
 
10
24/06
Lecture 14 (H5)
25/06
Supplement 3 (06.021)
27/06
Lecture 15 (H15)
28/06
Supplement 3 (06.021)
 
11
01/07
Lecture 16 (H5)
02/07
Supplement 4 (06.021)
04/07
Lecture 17 (H15)
05/07
Supplement 4 (06.021)
 
12
08/07
Lecture 18 (H5)
09/07
Supplement 5 (06.021)
11/07
Lecture 19 (H15)
12/07
Supplement 5 (06.021)
 
13
15/07
Lecture 20 (H5)
16/07
Supplement 6 (06.021)
18/07
Lecture 21 (H15)
19/07
Supplement 6 (06.021)
 
14
22/07
Lecture 22 (H5)
23/07
Supplement 7 (06.021)
25/07
Lecture 23 (H15)
26/07
Supplement 7 (06.021)
 
Contents

The course concentrates on algorithms for speech and audio signal processing with applications in telecommunications and multimedia, especially

  • Human Speech Production and Hearing: Source-Filter-Model, Filterbank Model of Cochlea; Masking effects;

  • Signal Representation for Speech and Audio: Estimation and representation of short-term and long-term statistics in time domain, frequency domain, and cepstral domain; typical examples and visualisations;

  • Source Coding for Speech and Audio: criteria; scalar and vectorial quantization and coding; linear prediction; pitch prediction; waveform/ parameter/ hybrid coding; standards (ITU, GSM, ISO-MPEG)

  • Fundamental Concepts of Automatic Speech Recognition: Feature extraction, Dynamic Time Warping, Hidden Markov Models

  • Fundamental Concepts of Speech Synthesis: Text-to-Speech systems, modell-based and data-driven synthesis, PSOLA synthesis

  • Signal enhancement for acquisition and reproduction: Noise reduction, echo cancellation, dereverberation using single-channel and multichannel algorithms;

Literature
Recommendations for each topic are given during the lectures