Speech Recognition-Driven Language Learning: The Case of Tiny Talks for Nepali Using Wav2Vec2

Authors

  • Archana Mahat Dept. of Computer Engineering, Kathmandu Engineering College, Nepal
  • Nush Ojha Dept. of Computer Engineering, Kathmandu Engineering College, Nepal
  • Aarati Acharya Dept. of Computer Engineering, Kathmandu Engineering College, Nepal
  • Anjali Sapkota Dept. of Computer Engineering, Kathmandu Engineering College
  • Sudeep Shakya Head of Department, Dept. of Computer Engineering, Kathmandu Engineering College, Nepal

DOI:

https://doi.org/10.3126/kjse.v10i1.93873

Keywords:

Nepali language learning, automatic speech recognition (ASR), speech accuracy, early language acquisition, language preservation

Abstract

The TinyTalks project presents a mobile based system designed to support Nepali language learning among children aged 4 to 8. The application addresses the challenge of language preservation by offering an engaging and interactive learning environment tailored to young learners. TinyTalks integrates automatic speech recognition using a fine-tuned Wav2Vec2 model to evaluate children’s pronunciation in real time. Nepali speech samples collected from multiple speakers were used to train the model through supervised learning, resulting in a validation accuracy of about 75 percent. The final system provides interactive lessons, pronunciation feedback and simple quizzes that support early language acquisition. The study demonstrates the feasibility of combining mobile technology and speech recognition to assist foundational Nepali language learning for young children.

Downloads

Download data is not yet available.
Abstract
2
PDF
1

Downloads

Published

2026-05-05

How to Cite

Mahat, A., Ojha, N., Acharya, A., Sapkota, A., & Shakya, S. (2026). Speech Recognition-Driven Language Learning: The Case of Tiny Talks for Nepali Using Wav2Vec2. KEC Journal of Science and Engineering, 10(1), 164–168. https://doi.org/10.3126/kjse.v10i1.93873

Issue

Section

Articles