DecepNet: A Hybrid NLP Framework for Robust Phishing Detection Under Adversarial Conditions

Gargi Choudhury; Shivam Choudhury

doi:10.47363/JAICC/ICADCCS2026/2026(5)12

Authors

Gargi Choudhury Independent Researchers, India Author
Shivam Choudhury Independent Researchers, India Author

DOI:

https://doi.org/10.47363/JAICC/ICADCCS2026/2026(5)12

Keywords:

Phishing Detection, Natural Language Processing (NLP), Adversarial Machine Learning, Deception Modeling, Transformer Models, Cybersecurity, Email Security, Hybrid Machine Learning

Abstract

Phishing attacks continue to evolve with increasing sophistication, particularly with the emergence of AI-generated content that closely mimics legitimate communication. Traditional phishing detection systems, which rely on static text classification models, often struggle to generalize across diverse datasets and fail under subtle adversarial modifications. In this work, we propose DecepNet, a hybrid phishing detection framework that integrates natural language processing with structural and behavioral feature analysis to enhance detection performance and robustness.

The proposed system combines TF-IDF-based representations and transformer-based semantic modeling with a learned deception function designed to capture persuasive intent through linguistic, contextual, and behavioral signals such as urgency, authority cues, and threat-reward framing. The model is trained on a multi-source dataset constructed from publicly available corpora, including Apache Spam Assassin and Enron Spam Dataset, along with additional phishing samples derived from URL-based sources and synthetically generated adversarial data.

To evaluate robustness, we introduce an adversarial testing setup involving paraphrasing, tone modification, and obfuscation techniques. Experimental results demonstrate that conventional machine learning models experience noticeable performance degradation under such conditions, while the proposed framework maintains improved stability and generalization across datasets. Comparative analysis with baseline models, including Logistic Regression, Random Forest, and transformer-based classifiers, highlights the effectiveness of integrating deception-aware features.

This study emphasizes the importance of combining semantic understanding with behavioral modeling in phishing detection and provides a practical, adaptable framework for addressing evolving social engineering threats in real-world environments.

Author Biographies

Gargi Choudhury, Independent Researchers, India

Gargi Choudhury, Independent Researchers, India
Shivam Choudhury, Independent Researchers, India

Shivam Choudhury, Independent Researchers, India

Journal of Artificial Intelligence & Cloud Computing

DecepNet: A Hybrid NLP Framework for Robust Phishing Detection Under Adversarial Conditions

Authors

DOI:

Keywords:

Abstract

Author Biographies

Downloads

Published

Issue

Section

License

How to Cite

Similar Articles

Similar Articles

Leveraging AI and Machine Learning for Cyber Threat Analysis

Cyber Threat Intelligence: Leveraging AI for Predictive Analytics in Hybrid Cloud Systems

Next-Gen Firewalls: Enhancing Cloud Security with Generative AI

Detecting Synthetic Identity Fraud Via Multimodal Customer Data Integration

Application of AI and ML in the Field of DevSecOps

NLP-Based De-Identification Techniques for Patient Data Anonymization

AI in Cybersecurity and User Interface Design beyond Chatbots

Privacy-Preserving Mental Health Analysis on Social Media Using Federated Deep Learning and Named Entity Recognition

AI-Driven Cybersecurity and Anomaly Detection in Blockchain

Utilizing AI and Machine Learning for Human Emotional Analysis through Speech-to-Text Engine Data Conversion