A Comprehensive Review of Reinforcement Learning

Authors

  • Ramesh Ghimire Midwest University, Graduate School of Engineering
  • Devendra Kumar Labh Karna Midwest University, Graduate School of Engineering

DOI:

https://doi.org/10.3126/mujoei.v1i1.91094

Keywords:

Reinforcement Learning, Deep reinforcement learning, Q learning, Offline reinforcement learning

Abstract

Reinforcement Learning (RL) is a paradigm of machine learning in which an agent learns optimal behavior through trial-and-error interactions with an environment guided by feedback in the form of rewards. This survey provides a comprehensive overview of Reinforcement Learning (RL), beginning with fundamental concepts (Markov decision processes, policies, value functions, and reward signals) and progressing to the prominent categories of RL algorithms: value-based, policy-based, actor-critic architectures, and model-based. We categorize and explain these families and highlight representative algorithms (from Q-learning and Deep Q-Networks to policy gradient methods like REINFORCE, proximal policy optimization, and more). We then summarize recent advancements, including the deep RL revolution that combines neural networks with RL to solve high-dimensional problems, the emergence of offline RL for learning from fixed datasets, progress in multi-agent RL for complex, competitive, and cooperative systems, and hierarchical RL for temporal abstraction. Key real-world applications are reviewed in domains such as robotics (where RL enables autonomous control and manipulation), game playing (where RL has achieved human- and superhuman-level performance in video games and board games), finance (for algorithmic trading and decision-making under uncertainty), and healthcare (for treatment planning and medical decision support).

Downloads

Download data is not yet available.
Abstract
0
PDF
0

Author Biographies

Ramesh Ghimire, Midwest University, Graduate School of Engineering

Instructor

Devendra Kumar Labh Karna, Midwest University, Graduate School of Engineering

Asst. Prof.

Downloads

Published

2025-12-01

How to Cite

Ghimire, R., & Karna, D. K. L. (2025). A Comprehensive Review of Reinforcement Learning. Mid-West University Journal of Engineering & Innovation, 1(1), 1–13. https://doi.org/10.3126/mujoei.v1i1.91094

Issue

Section

Original Articles