Natural Language Speech and Audio Processing

Domaine: Natural Language Speech and Audio Processing
Domain - extra: corpus linguistics, machine learning, signal processing
Année: 2010
Starting: automn 2010
État: Open
Sujet: Accent simultation and modelisation using automatic speech processing
Thesis advisor: ADDA-DECKER Martine
Co-advisors: Philippe Boula de Mareüil, LIMSI-CNRS. accent characterisation, speech synthesis
Laboratory: EXT
Collaborations
Abstract: Accent is generally defined as a set of some typical pronunciation traits. These traits, when perceived, contribute to classify the speech either as non standard, or as coming from a specific (regional, sociological) variety. It has been shown that accent classification is a very difficult task, both for humans as well as for automatic classifiers. Many facets of what makes an accent remain still to be uncovered and the proposed subject aims at contributing to this aim. What about simulated accents? What if speakers exagerate pronunciation traits to mimic some other speaker or some non-standard variety? Speakers may be more or less gifted to play with accents, mimic various accents and even different voices. This raises important issues with respect to fundamental questions concerning the nature of accents and the characterisation of human voices. On an application side, automatic detection of accent simulation will contribute to security applications such as impostor detection.
Context: Impostor detection is a very active area in the field of automatic speaker recognition. Applied methods include GMMs (gaussian mixture models), HMMs (Hidden Markov Models), SVMs (Support Vector Machines). Acoustic features typically correspond to MFCCs (mel frequency cepstrum coefficients) and MFCC derivatives. Automatic accent identification is a relatively recent issue within the field of automatic language recognition. However the question of accent impostors has, to the best of our knowledge, not been addressed yet.
Objectives: Improve our knowledge/understanding of accent (perception/pruduction) using automatic speech processing methods.
Work program: Data collection of accented speakers and accent impostors. Definition of acoustic/prosodic/pronounciation features. Speech synthesis of controlled accent stimuli. Automatic classification and perceptual experiments involving natural accent impostor stimuli as well as synthetic accent-graded stimuli.
Extra information
Prerequisite
Détails: SujetThese2010_LIMSI_Boula_Adda.pdf
Expected funding: Institutional funding
Status of funding: Expected
Candidates
Utilisateur: martine.adda-decker
Créé: Vendredi 26 février 2010 18:05:37 CET
dernière modif.: Vendredi 26 février 2010 18:05:37 CET

Fichiers joints

	filename	créé	hits	filesize
	SujetThese2010_LIMSI_Boula_Adda.pdf	26 Feb 2010 18:05	2534	40.17 Kb

Connexion

Ecole Doctorale Informatique Paris-Sud

Directrice
Nicole Bidoit
Assistante
Stéphanie Druetta
Conseiller aux thèses
Dominique Gouyou-Beauchamps

ED 427 - Université Paris-Sud
UFR Sciences Orsay
Bat 650 - aile nord - 417
Tel : 01 69 15 63 19
Fax : 01 69 15 63 87
courriel: ed-info à lri.fr