[Conference] INTERSPEECH 2024, Kos Island, Greece, September 2024

Authors:

Nikhil Jakhar, Sudhanshu Srivastava, Arun Baby

Abstract:

This work presents an integrated framework combining language identification with multilingual ASR using the Whisper architecture. The proposed approach achieves an absolute 19.1% improvement in Word Error Rate (WER) while enhancing language identification performance by 6% in terms of Diarization Error Rate (DER) for Indic languages.

Cite:

@inproceedings{jakhar24_interspeech,
  title = {{A Unified Approach to Multilingual Automatic Speech Recognition with Improved Language Identification for Indic Languages}},
  author = {Nikhil Jakhar and Sudhanshu Srivastava and Arun Baby},
  year = {2024},
  booktitle = {{Interspeech 2024}},
  pages = {3949--3953},
  doi = {10.21437/Interspeech.2024-2043}
}

Proceedings

PDF