Advances in AI-Driven Cybersecurity: Tackling Prompt Injection Attacks through Adversarial Learning

Saroj Ghimire; Suman Thapaliya

doi:10.3126/joeis.v4i1.81604

Authors

Saroj Ghimire Lincoln University College, Malaysia
Suman Thapaliya Texas college of management and IT

DOI:

https://doi.org/10.3126/joeis.v4i1.81604

Keywords:

AI-driven cybersecurity, prompt injection, adversarial machine learning, large language models, secure AI systems

Abstract

The rapid adoption of large language models (LLMs) such as ChatGPT, Bard, and Claude has transformed human-computer interaction across various industries. However, this advancement introduces novel cybersecurity threats—particularly prompt injection attacks—that exploit the models’ instruction-following abilities to produce malicious or unintended outputs. Existing cybersecurity frameworks lack the capacity to counter these emerging threats, demanding AI- centric defense strategies.

This study explores the role of adversarial machine learning in mitigating prompt injection vulnerabilities. We conduct a comprehensive analysis of how adversarial prompts are crafted to circumvent content filters, hijack model behavior, and extract sensitive data. Building upon recent advances in adversarial learning, we propose a robust defense framework that combines adversarial training with input sanitization techniques to detect and neutralize harmful prompts.

The framework is evaluated on leading LLM platforms using both benchmark datasets and real- world scenarios. Results show enhanced resistance to both direct and indirect prompt injection attacks, with minimal compromise to model performance and responsiveness.

By embedding adversarial robustness into the deployment lifecycle of LLMs, our work advances the development of secure and trustworthy AI systems. These findings emphasize the need for evolving AI-native security protocols aligned with the dynamic nature of generative models, ensuring safe and responsible AI deployment.

Downloads

Download data is not yet available.

Abstract

104

pdf

64

Author Biographies

Saroj Ghimire, Lincoln University College, Malaysia

Lincoln University College, Malaysia

Suman Thapaliya, Texas college of management and IT

Texas college of management and IT

Advances in AI-Driven Cybersecurity: Tackling Prompt Injection Attacks through Adversarial Learning

Authors

DOI:

Keywords:

Abstract

Downloads

Author Biographies

Saroj Ghimire, Lincoln University College, Malaysia

Suman Thapaliya, Texas college of management and IT

Downloads

Published

How to Cite

Issue

Section

License

Information