ePapers Repository

Modeling accents for automatic speech recognition

Najafian, Maryam (2013) Modeling accents for automatic speech recognition. In: University of Birmingham Graduate School Research Poster Conference 2013 , 12th June 2013, University of Birmingham. (Unpublished)

[img]
Preview
PDF
UGSposter2013Najafian.pdf
622Kb

Abstract

In British English the term 'accent' refers to systematic variations in pronunciation, often associated with particular geographic regions. Accent is one of the most frequently cited causes of variability in speech. The problem of accents is becoming more important with the advancement of computerised services and Automatic Speech Recognition (ASR) systems. Speech recognition technology is used in a wide range of applications and services such as health care, education, automated call centres, authentication and information services. In many of these applications accents and foreign languages posed a problem for speech recognition developers because, if the people using the system could not be understood, they might become frustrated and stop using the system. For example, a recent news story reported that an automated phone system deployed by Birmingham City Council could not cope with ‘Brummie’ accents [1]. This research investigates how we can exploit the knowledge of accents to obtain both rapid and robust ASR systems for British accented utterances using only 30 seconds of speech.

When we hear another person’s speech for the first time, we quickly establish a 'profile' of that person based on his or her speech, in terms of factors such as gender, age, accent, and social group. It is possible that we use this characterization to adapt very quickly to that person’s speech. Current ASR systems typically differentiate between genders, but otherwise tend to ignore important factors including accents. Additionally, conventional adaptation techniques for ASR require a substantial amount of training material from each individual to be able to adapt the system in order to have a more user specific system. Now the question is whether the concept of ‘regional accent’ is useful to overcome these two problems.

The first section describes the techniques used to visualise the space where accent recognition is performed, otherwise known as the ‘accent recognition’ space, and show the extent to which the emergent structure is consistent with subjective notions of accent. The two techniques used are Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA).

We then interpret speech recognition results on accented speech from 14 different British regions, analysed in terms of their structure in the ‘accent recognition’ space.

Following this we present results on accent adaptation using conventional adaptation techniques such as MAP (Maximum A Posteriori Adaptation) and MLLR (Maximum Likelihood Linear Regression). We use these accent adaptation techniques along with the knowledge from accent recognition techniques, such as the ACCDIST accent recognition measure, to provide an accent robust, rapid speaker adaptation system.

1. “Brummie accents baffle automated phone system ... at Birmingham City Council”, Daily Mail, 5th November 2012.

Type of Work:Conference or Workshop Item (Poster)
School/Faculty:Colleges (2008 onwards) > College of Engineering & Physical Sciences
Department:School of Electronic, Electrical and Computer Engineering
Additional Information:

Research Supervisor: Prof Martin Russell

Date:June 2013
Series/Collection Name:Prizewinners from the Graduate School Research Poster Conference 2013
Subjects:T Technology > T Technology (General)
Related URLs:
URLURL Type
https://intranet.birmingham.ac.uk/as/studentservices/graduateschool/news/public/rpc2013winners.aspxOrganisation
Copyright Status:This poster is copyright of the author and/or third parties. The intellectual property rights in respect of this work are as defined by The Copyright Designs and Patents Act 1988 or as modified by any successor legislation. Any use made of information contained in this poster must be in accordance with that legislation and must be properly acknowledged.
Copyright Holders:The Author
ID Code:1736

Export Reference As : ASCII + BibTeX + Dublin Core + EndNote + HTML + METS + MODS + OpenURL Object + Reference Manager + Refer + RefWorks
Share this item :
QR Code for this page

Repository Staff Only: item control page