Bayesian Networks and Discrete Observations for Robust Speech Recognition
Pp. 133-140 (8)
Antonio Miguel, Alfonso Ortega and Eduardo Lleida
Traditionally, in speech recognition, the hidden Markov model state emission probability
distributions are usually associated to continuous random variables, by using Gaussian
mixtures. Thus, the inter-feature dependence is not accurately modeled by the covariance matrix,
since it only considers pairs of variables. The mixture is the part of the model which usually
captures this information, but this is done in a loose and inefficient way. Graphical models provide
a precise and simple mechanism to model the dependencies among two or more variables.
We propose the use of discrete random variables as observations and graphical models to extract
the internal dependence structure in the feature vectors. A method to estimate a graphical model
with a constrained number of dependencies is shown in this chapter, which is a special kind of
Bayesian network. Experimental results show that this method can be considered robust as compared to standard baseline systems.
Graphical Models, Bayesian networks, Maximum Likelihood, Expectation maximization
Communications Technology Group (GTC), Aragon Institute for Engineering Research (I3A), University of Zaragoza, Spain