ARGai 2.0: A Feature Engineering Enabled Deep Network Model for Antibiotic Resistance Gene and Strain Identification in E. coli

Authors

DOI:

https://doi.org/10.3991/ijoe.v21i01.52521

Keywords:

Antimicrobial resistance, Antibiotic resistant genes, Deep learning, Transfer learning, SMOTE, Urine tract infections

Abstract


Escherichia coli (E. coli) is a group of bacteria that cause infections in the gastrointestinal (GI) tract and urinary tract (UTIs). The rise in antimicrobial resistance (AMR) due to antibioticresistant genes (ARGs) linked to E. coli strains that cause UTIs poses a significant threat. Identifying ARGs and resistant strains is crucial for effective treatment. To identify ARGs and classify resistant strains in E. coli utilizing gene expression (GE) data with advanced computational techniques such as feature engineering and transfer learning (TL). In TL, knowledge acquired by the baseline model is transferred to the target domain (BI-LSTMGRU). ARGai 2.0 utilizes the Synthetic Minority Over-sampling Technique (SMOTE) for oversampling the GE dataset to evaluate the effectiveness of the proposed TL framework. Our proposed ARGai 2.0 model achieved a higher classification accuracy of 14% compared to 1D CNN and 11% compared to BI-LSTM-GRU individually. The analysis revealed that genes associated with the nitrate reductase operon (narU, narV, narW, narY, and narZ) exhibit high connectivity and interaction scores, indicating their central role in nitrate metabolism. This aligns with the high enrichment FDR of 3.07E-10 and fold enrichment of 229.28 for pathways related to nitrate reductase complex and nitrite transmembrane transporter activity. ARGai 2.0 successfully detected the significant genes responsible for antibiotic resistance and classified the resistant strains. The gene network analysis highlights the central role of nitrate metabolism genes, while peripheral genes like ansP and yncG are involved in more specialized functions.

Author Biographies

Arpita Priyadarshini, Utkal University, Bhubaneswar, Odisha, India

Arpita Priyadarshini is a Master's (M. Tech) student at Department of Statistics, Utkal University, India. She holds a B. Tech degree with specialization in Information Technology from Silicon Institute of Technology, India. Her areas of interest in research are computational modeling and gene expression profiling, as well as the use of statistical and machine learning approaches in data analysis. She is skilled in statistical modeling, feature selection, R, Python, and sophisticated machine learning algorithms. She can be contacted at email: pri.arpita@gmail.com.

Sweta Padma Routray, Siksha ‘O’ Anusandhan (Deemed to be) University, Bhubaneswar, Odisha, India

Sweta Padma Routray completed her B.Sc. and M.Sc. degrees in Bioinformatics at Buxi Jagabandhu Bidyadhar Autonomous College, India, in 2017 and 2019, respectively. Currently, she is pursuing her Ph.D. in Biotechnology at the Center of Biotechnology, Siksha O Anusandhan Deemed to be University. Her primary research interests lie in the fields of bioinformatics, microbial genomics, and transcriptomics.

Santanu Kumar Sahoo, Siksha ‘O’ Anusandhan (Deemed to be) University, Bhubaneswar, Odisha, India

Santanu Kumar Sahoo is currently working as Associate Professor, department of Electronics and Communication Engineering, FET-ITER, Siksha ‘O’ Anusandhan University. He received his B Tech degree in electronics and communication engineering from Utkal University, Odisha, India in 2004 and Doctoral degree in communication system engineering from Siksha O Anusandhan University, Odisha, India in 2018. His areas of interest are biomedical signal and image processing. He can be contacted at email: santanusahoo@soa.ac.in.

Tripti Swarnkar, National Institute of Technology, Raipur, Chhattisgarh, India

Tripti Swarnkar received the Ph.D. degree in Computer Science & Engineering from IIT Kharagpur WB India. She is currently a professor department of Computer Application, National Institute of Technology, Raipur, India. Previously she is worked as Professor and Head, department of Computer Application, Faculty of Engineering & Technology Siksha ‘O’ Anusandhan Deemed to be University. She has more than 2 decades of teaching experience in the field of Computer Science & Engineering. She has worked in different capacity in the field of academics, as well as an administration. She is currently guiding five Ph.D. and six have already being awarded. Dr. Swarnkar's principal research interest is Machine learning, Omics data analysis and medical image analysis. Her aspiration is to work at the interface of these different fields or Multidisciplinary Environment. She is IEEE senior member, IEEE EMBS member, IEEE GRSS society member. She has organized many conferences as well as workshops successfully, recently chaired the Women in imaging session of IEEE ICAIHC-2025 and I&P Track of IEEE Returning Mothers Conference-2024. She has also worked as a Principal Investigator of Multidisciplinary Project on "Validation of Artificial Intelligence (AI) based models in screening and diagnosis of diseases in routine clinical practices", sponsored by Intel India. She can be contacted at email: tswarnkar.mca@nitrr.ac.in.

Downloads

Published

2025-01-16

How to Cite

Nayak, D. S. K., Priyadarshini, A., Routray, S. P., Sahoo, S. K., & Swarnkar, T. (2025). ARGai 2.0: A Feature Engineering Enabled Deep Network Model for Antibiotic Resistance Gene and Strain Identification in E. coli. International Journal of Online and Biomedical Engineering (iJOE), 21(01), pp. 76–96. https://doi.org/10.3991/ijoe.v21i01.52521

Issue

Section

Papers