Query word retrieval from continuous speech using GMM posteriorgrams

P.R. Reddy; K. Rout; Sri Rama Murty Kodukula

doi:10.1109/SPCOM.2014.6984011

Profiles Research Units Publications

Conferences

Query word retrieval from continuous speech using GMM posteriorgrams

P.R. Reddy, K. Rout,

Published in Institute of Electrical and Electronics Engineers Inc.

2014

DOI: 10.1109/SPCOM.2014.6984011

Abstract

The objective of this work is to study the issues involved in building an automatic query word retrieval system for broadcast news in an unsupervised framework, i.e., without using any labelled speech data. In the absence of labelled data, sequence of feature-vectors extracted from the query word have to be matched with those extracted from the test utterance. This is a non-trivial task, as typical feature-vectors like Mel-frequency cepstral coefficients (MFCC) carry both speech-specific and speaker-specific information. In this work, we have employed Gaussian mixture models (GMM) to extract speaker-independent features from the speech signal. Gaussian mixture model, trained on a large amount of speech data, is used to derive posterior features for each frame of speech signal. The sequence of posterior features are matched using dynamic time-warping algorithm to detect the presence of query word in the test utterance. The performance of the proposed method is evaluated on Telugu broadcast news database. It is observed that the posterior features extracted from GMM are better suited for query word retrieval compared to the MFCC features. © 2014 IEEE.

About the journal

Journal	Data powered by Typeset2014 International Conference on Signal Processing and Communications, SPCOM 2014
Publisher	Data powered by TypesetInstitute of Electrical and Electronics Engineers Inc.

Authors (1)

Sri Rama Murty Kodukula
- Department of Electrical Engineering

ACADEMICS

FACILITIES

CAMPUS LIFE

COUNCILS

QUICK LINKS