Loading Now

Summary of Xls-r Deep Learning Model For Multilingual Asr on Low- Resource Languages: Indonesian, Javanese, and Sundanese, by Panji Arisaputra et al.


XLS-R Deep Learning Model for Multilingual ASR on Low- Resource Languages: Indonesian, Javanese, and Sundanese

by Panji Arisaputra, Alif Tri Handoyo, Amalia Zahra

First submitted to arxiv on: 12 Jan 2024

Categories

  • Main: Computation and Language (cs.CL)
  • Secondary: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)

     Abstract of paper      PDF of paper


GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty Written by Summary
High Paper authors High Difficulty Summary
Read the original abstract here
Medium GrooveSquid.com (original content) Medium Difficulty Summary
The study develops and evaluates an Automatic Speech Recognition (ASR) system, specifically designed for Indonesian, Javanese, and Sundanese languages. Using the XLS-R 300m model, the researchers aimed to improve ASR performance in converting spoken language into written text. The methodology employed includes testing procedures, datasets used, and training techniques. The results show competitive Word Error Rate (WER) measurements, with some compromise in performance for Javanese and Sundanese languages. Integrating a 5-gram KenLM language model significantly reduces WER and enhances ASR accuracy. This research contributes to advancing ASR technology by addressing linguistic diversity and improving performance across various languages.
Low GrooveSquid.com (original content) Low Difficulty Summary
This study makes an automatic speech recognition (ASR) system better for Indonesian, Javanese, and Sundanese languages. The researchers use a special model called XLS-R 300m to improve how well the system works. They test it on different languages and find that it does pretty well, but not perfectly. Adding another tool called KenLM makes it even better! This research helps make ASR systems more useful for people who speak these languages.

Keywords

* Artificial intelligence  * Language model