Self-supervised phonotactic representations for language identification

G. Ramesh; C.S. Kumar; Sri Rama Murty Kodukula

doi:10.21437/Interspeech.2021-1310

Profiles Research Units Publications

Conferences

Self-supervised phonotactic representations for language identification

G. Ramesh, C.S. Kumar,

Published in International Speech Communication Association

2021

DOI: 10.21437/Interspeech.2021-1310

Volume: 2

Pages: 861 - 865

Abstract

Phonotactic constraints characterize the sequence of permissible phoneme structures in a language and hence form an important cue for language identification (LID) task. As phonotactic constraints span across multiple phonemes, the short-term spectral analysis (20-30 ms) alone is not sufficient to capture them. The speech signal has to be analyzed over longer contexts (100s of milliseconds) in order to extract features representing the phonotactic constraints. The supervised senone classifiers, aimed at modeling triphone context, have been used for extracting language-specific features for the LID task. However, it is difficult to get large amounts of manually labeled data to train the supervised models. In this work, we explore a selfsupervised approach to extract long-term contextual features for the LID task. We have used wav2vec architecture to extract contextualized representations from multiple frames of the speech signal. The contextualized representations extracted from the pre-trained wav2vec model are used for the LID task. The performance of the proposed features is evaluated on a dataset containing 7 Indian languages. The proposed self-supervised embeddings achieved 23% absolute improvement over the acoustic features and 3% absolute improvement over their supervised counterparts. Copyright © 2021 ISCA.

About the journal

Journal	Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Publisher	International Speech Communication Association
ISSN	2308457X

Authors (1)

Sri Rama Murty Kodukula
- Department of Electrical Engineering

ACADEMICS

FACILITIES

CAMPUS LIFE

COUNCILS

QUICK LINKS