Chargement...
 

Natural Language Speech and Audio Processing

Domaine
Natural Language Speech and Audio Processing
Domain - extra
corpus linguistics, machine learning, signal processing
Année
2010
Starting
automn 2010
État
Open
Sujet
Accent simultation and modelisation using automatic speech processing
Thesis advisor
ADDA-DECKER Martine
Co-advisors
Philippe Boula de Mareüil, LIMSI-CNRS. accent characterisation, speech synthesis
Laboratory
EXT
Collaborations

Abstract
Accent is generally defined as a set of some typical pronunciation traits. These traits, when perceived, contribute to classify the speech either as non standard, or as coming from a specific (regional, sociological) variety. It has been shown that accent classification is a very difficult task, both for humans as well as for automatic classifiers. Many facets of what makes an accent remain still to be uncovered and the proposed subject aims at contributing to this aim. What about simulated accents? What if speakers exagerate pronunciation traits to mimic some other speaker or some non-standard variety? Speakers may be more or less gifted to play with accents, mimic various accents and even different voices. This raises important issues with respect to fundamental questions concerning the nature of accents and the characterisation of human voices. On an application side, automatic detection of accent simulation will contribute to security applications such as impostor detection.
Context
Impostor detection is a very active area in the field of automatic speaker recognition. Applied methods include GMMs (gaussian mixture models), HMMs (Hidden Markov Models), SVMs (Support Vector Machines). Acoustic features typically correspond to MFCCs (mel frequency cepstrum coefficients) and MFCC derivatives. Automatic accent identification is a relatively recent issue within the field of automatic language recognition. However the question of accent impostors has, to the best of our knowledge, not been addressed yet.
Objectives
Improve our knowledge/understanding of accent (perception/pruduction) using automatic speech processing methods.

Work program
Data collection of accented speakers and accent impostors. Definition of acoustic/prosodic/pronounciation features. Speech synthesis of controlled accent stimuli. Automatic classification and perceptual experiments involving natural accent impostor stimuli as well as synthetic accent-graded stimuli.

Extra information
Prerequisite
Détails
Télécharger SujetThese2010_LIMSI_Boula_Adda.pdf
Expected funding
Institutional funding
Status of funding
Expected
Candidates
Utilisateur
martine.adda-decker
Créé
Vendredi 26 février 2010 18:05:37 CET
dernière modif.
Vendredi 26 février 2010 18:05:37 CET

Fichiers joints

 filenamecrééhitsfilesize 
SujetThese2010_LIMSI_Boula_Adda.pdf 26 Feb 2010 18:05253440.17 Kb


Ecole Doctorale Informatique Paris-Sud


Directrice
Nicole Bidoit
Assistante
Stéphanie Druetta
Conseiller aux thèses
Dominique Gouyou-Beauchamps

ED 427 - Université Paris-Sud
UFR Sciences Orsay
Bat 650 - aile nord - 417
Tel : 01 69 15 63 19
Fax : 01 69 15 63 87
courriel: ed-info à lri.fr