Advanced Machine Learning Models for Detecting and Classifying Financial Fraud in Big Data-Driven

Authors

  • Achuthananda Reddy Polu Senior SDE, Cloudhub IT Solutions, USA Author
  • Bhumeka Narra Sr Software Developer, Statefarm, USA Author
  • Dheeraj Varun Kumar Reddy Buddula Software Engineer, Elevance Health Inc, USA Author
  • Hari Hara Sudheer Patchipulusu Senior Software Engineer, Walmart, USA Author
  • Navya Vattikonda Business Intelligence Engineer, International Medical Group Inc, USA Author
  • Anuj Kumar Gupta Oracle ERP Senior Business Analyst ,Genesis Alkali, USA Author

DOI:

https://doi.org/10.47363/JAICC/2024(3)448

Keywords:

Financial Fraud detection, Credit Card Fraud, big data, Machine Learning K-Nearest Neighbor (KNN)

Abstract

The banking sector faces a major challenge in identifying credit card fraud, especially as online transactions increase. This study employs the Kaggle Credit Card Fraud Detection dataset to present a machine learning (ML)-based method to credit card fraud detection. The collection contains de-identified transaction information from European cardholders. With 284,807 transactions, only 492 included fraud, suggesting a significant disparity in class.Therefore, data balancing techniques were used to improve model training. Among the data pretreatment procedures were label encoding for categorical conversion and standardization to normalize feature scales. Using Euclidean distance, for the purpose of identifying the majority-belonging k-nearest neighbor class, A classifier called K-Nearest Neighbors (KNN) was created. The model was assessed using ROC-AUC, F1-score, accuracy, precision, recall,and K-fold cross-validation, among other important performance measures. The KNN model outperformed benchmark models like MLP and Naïve Bayes,obtaining 98.56% accuracy and an AUC of 96.07, according to experimental data, demonstrating great classification efficacy. The promise of KNN in creating reliable and accurate fraud detection systems for cybersecurity applications is confirmed by these findings.

Author Biographies

  • Achuthananda Reddy Polu, Senior SDE, Cloudhub IT Solutions, USA

    Achuthananda Reddy Polu, Senior SDE, Cloudhub IT Solutions, USA

  • Bhumeka Narra, Sr Software Developer, Statefarm, USA


    Sr Software Developer, Statefarm, USA

  • Dheeraj Varun Kumar Reddy Buddula, Software Engineer, Elevance Health Inc, USA

    Software Engineer, Elevance Health Inc, USA

  • Hari Hara Sudheer Patchipulusu, Senior Software Engineer, Walmart, USA

    Senior Software Engineer, Walmart, USA

  • Navya Vattikonda, Business Intelligence Engineer, International Medical Group Inc, USA


    Business Intelligence Engineer, International Medical Group Inc, USA 

  • Anuj Kumar Gupta, Oracle ERP Senior Business Analyst ,Genesis Alkali, USA

    Oracle ERP Senior Business Analyst ,Genesis Alkali, USA

Downloads

Published

2024-06-10

How to Cite

Advanced Machine Learning Models for Detecting and Classifying Financial Fraud in Big Data-Driven. (2024). Journal of Artificial Intelligence & Cloud Computing, 3(6), 1-7. https://doi.org/10.47363/JAICC/2024(3)448

Similar Articles

31-40 of 401

You may also start an advanced similarity search for this article.