TechBot: A Computer Science Focused QA System

Authors

  • Alisha Rauniyar Department of Computer and Electronics Engineering, Kantipur Engineering College, Dhapakhel, Lalitpur, Nepal
  • Ankit Shrestha Department of Computer and Electronics Engineering, Kantipur Engineering College, Dhapakhel, Lalitpur, Nepal
  • Ayush Tamang Department of Computer and Electronics Engineering, Kantipur Engineering College, Dhapakhel, Lalitpur, Nepal
  • Alaka Rai Department of Computer and Electronics Engineering, Kantipur Engineering College, Dhapakhel, Lalitpur, Nepal
  • Ajay Mani Paudel Executive Director, Natural Language Processing Hub, Tangal, Kathmandu, Nepal

DOI:

https://doi.org/10.3126/injet.v2i2.78612

Keywords:

Chatbot, Web Scraping, Transformers, Language transformation, Natural Language Processing, Computer Science

Abstract

This paper presents Tech Bot: A Computer Science Focused QA System designed for the computer science domain. Since there is a rapid increase in the field of computer science, a system that can provide accurate, context-aware responses specific to computer science is desperately needed. TechBot addresses this need by combining the advanced capabilities of Natural Language Processing techniques and Deep Learning Models. It includes a strong preprocessing pipeline, a query comprehension module that is capable of decoding technical terms and an answer retrieval framework optimized for computer science content. It performs better in the computer science domain than in general-purpose systems. Using the Hugging Face Dataset Library with extra web scraping, 20,076 question-answer pairs were collected. Secondly, the QA pairs were split into training and validation sets. Two state-of-the-art transformer-based sequence-to-sequence models, T5 and BART, which are designed for natural language generation, were employed. After fine-tuning on the domain-specific dataset, BART outperformed T5 based on the ROUGE score evaluated on the validation set. Initially, the BART model with 140 million parameters was trained for 21 epochs, achieving a ROUGE score of 0.2867. Finally, an average ROUGE score of 0.2890 was obtained on the test dataset.

Downloads

Download data is not yet available.
Abstract
230
PDF
138

Downloads

Published

2025-05-19

How to Cite

Rauniyar, A., Shrestha, A., Tamang, A., Rai, A., & Paudel, A. M. (2025). TechBot: A Computer Science Focused QA System. International Journal on Engineering Technology, 2(2), 145–154. https://doi.org/10.3126/injet.v2i2.78612

Issue

Section

Articles