Audionumérique

  • Voir la page en français

    In brief

  • Code : N9EN16A

Objectives

- Understand the properties of the audio signal (speech and music)

- Know how to process and model the audio signal

Description

- Introduction to the speech signal, description of the production and human perception of speech. Practical exercises.
- Acquisition of the audio signal by the computer
- Parameterization of the speech signal (MFCC, PLP). Practical application in the lab.
- Modeling of the speech signal (HMM, GMM, DNN). Implementation of a keyword recognition application in practical training (DNN).

Bibliography

- Calliope & Fant (1989). La parole et son traitement automatique. Masson, Paris.
- Mariani, « Analyse, synthèse et codage de la parole », Hermès, Lavoisier, juillet 2002
- Haton, Cerisara, Fohr, Laprie, Smaïli, Reconnaissance automatique de la parole : du signal à son interprétation, Dunod, Paris, 2006
- Hinton & co, « Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups », Signal Processing Magazine, IEEE, vol. 29, n°6, pp. 82-97, nov 2012
- Environnement Google colab : https://colab.research.google.com

Pre-requisites

Bayesian modeling

Contact(s)

FARINAS JEROME

Contact

The National Institute of Electrical engineering, Electronics, Computer science,Fluid mechanics & Telecommunications and Networks

2, rue Charles Camichel - BP 7122
31071 Toulouse Cedex 7, France

+33 (0)5 34 32 20 00

Certifications

  • Logo MENESR
  • Logo UTFTMP
  • Logo INP
  • Logo INPT
  • Logo Mines télécoms
  • Logo CTI
  • Logo CDEFI
  • Logo midisup