A Bayesian Approach to Estimation of Speaker Normalization Parameters

Ram, Dhananjay; Kundu, Debasis; Hegde, Rajesh M.

Computer Science > Sound

arXiv:1610.05948 (cs)

[Submitted on 19 Oct 2016]

Title:A Bayesian Approach to Estimation of Speaker Normalization Parameters

Authors:Dhananjay Ram, Debasis Kundu, Rajesh M. Hegde

View PDF

Abstract:In this work, a Bayesian approach to speaker normalization is proposed to compensate for the degradation in performance of a speaker independent speech recognition system. The speaker normalization method proposed herein uses the technique of vocal tract length normalization (VTLN). The VTLN parameters are estimated using a novel Bayesian approach which utilizes the Gibbs sampler, a special type of Markov Chain Monte Carlo method. Additionally the hyperparameters are estimated using maximum likelihood approach. This model is used assuming that human vocal tract can be modeled as a tube of uniform cross section. It captures the variation in length of the vocal tract of different speakers more effectively, than the linear model used in literature. The work has also investigated different methods like minimization of Mean Square Error (MSE) and Mean Absolute Error (MAE) for the estimation of VTLN parameters. Both single pass and two pass approaches are then used to build a VTLN based speech recognizer. Experimental results on recognition of vowels and Hindi phrases from a medium vocabulary indicate that the Bayesian method improves the performance by a considerable margin.

Comments:	23 Pages, 9 Figures
Subjects:	Sound (cs.SD); Computation and Language (cs.CL); Applications (stat.AP)
Cite as:	arXiv:1610.05948 [cs.SD]
	(or arXiv:1610.05948v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.1610.05948

Submission history

From: Dhananjay Ram [view email]
[v1] Wed, 19 Oct 2016 10:16:46 UTC (778 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.SD

< prev | next >

new | recent | 2016-10

Change to browse by:

cs
cs.CL
stat
stat.AP

References & Citations

DBLP - CS Bibliography

listing | bibtex

Dhananjay Ram
Debasis Kundu
Rajesh M. Hegde

export BibTeX citation

Computer Science > Sound

Title:A Bayesian Approach to Estimation of Speaker Normalization Parameters

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:A Bayesian Approach to Estimation of Speaker Normalization Parameters

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators