Construction of a New Data Set of Pleural Fluid Cytological Images for Research

Authors

  • Frida López-Córdova Universidad Nacional Mayor de San Marcos, Lima, Peru https://orcid.org/0000-0002-0494-6800
  • Hugo Vega-Huerta Universidad Nacional Mayor de San Marcos, Lima, Peru https://orcid.org/0000-0002-4268-5808
  • Gisella Luisa Elena Maquen-Niño Universidad Nacional Pedro Ruíz Gallo, Lambayeque, Peru https://orcid.org/0000-0002-9224-5456
  • Jaime Cáceres-Pizarro Hospital Nacional Cayetano Heredia, Lima, Peru
  • Ivan Adrianzén-Olano Universidad Nacional Toribio Rodríguez de Mendoza, Chachapoyas, Peru https://orcid.org/0000-0002-1910-2854
  • Oscar Benito-Pacheco Universidad Nacional Mayor de San Marcos, Lima, Peru

DOI:

https://doi.org/10.3991/ijoe.v21i07.54323

Keywords:

Cytology, Medical Imaging, Cytological examination, Pleural fluid, Deep learning

Abstract


The limited availability of standardized datasets has hindered the implementation of artificial intelligence (AI) models in serous fluid cytology, particularly in pleural fluid analysis. In this paper, we present the construction of a dataset of pleural fluid cytology images. The objective is to generate a dataset of pleural fluid cytologic images validated by two pathologists and classified into five categories for cell diagnosis, which will be used to train AI models. As a methodology, the images represent pleural fluid cytology samples that have been prepared through medical procedures and transferred to slides, providing valuable information when evaluated under the microscope by medical specialists through cytological examination. We documented the entire process for building the pleural fluid cytological image dataset, from image capture, labeling, preprocessing, standardization, and uploading to public platforms. As a result, we obtained a pleural fluid cytology dataset based on the International System (TIS) criteria for reporting serous fluids, classifying samples into AUS, MAL, ND, NFM, and SFM. This dataset is intended to support medical research, deep learning applications in medical image analysis, and improved diagnostic methodologies.

Author Biographies

Frida López-Córdova, Universidad Nacional Mayor de San Marcos, Lima, Peru

Professor at UNMSM

Hugo Vega-Huerta, Universidad Nacional Mayor de San Marcos, Lima, Peru

Is a principal professor at UNMSM and a researcher specializing in Computer Science, Artificial Intelligence, business management. Hold a doctorate in Systems Engineering at UNFV, Master in Administration at UNMSM. Academic Vice Dean at the Faculty of Systems Engineering - UNMSM (2016-2020), Director of the Software Engineering Program at URP (2011-2014). Responsible for the YACHAY Research Group - UNMSM. In 2012 he obtained the recognition "Scientific Merit Award" by the UNMSM for the number of articles published and the Research Papers.

Downloads

Published

2025-06-03

How to Cite

López-Córdova, F., Vega-Huerta, H., Maquen-Niño, G. L. E., Cáceres-Pizarro, J., Adrianzén-Olano, I., & Benito-Pacheco, O. (2025). Construction of a New Data Set of Pleural Fluid Cytological Images for Research. International Journal of Online and Biomedical Engineering (iJOE), 21(07), pp. 138–151. https://doi.org/10.3991/ijoe.v21i07.54323

Issue

Section

Papers