Feature Selection Methods in Big Medical Databases: A Comprehensive Survey
Keywords:
Medical databases, Features selection, Machine learning, Evaluation criteria, Medical informaticsAbstract
Medical science is rapidly evolving alongside the continuous growth of medical data recording systems, resulting in massive and diverse datasets. While these data offer valuable opportunities, their high volume and complexity create significant challenges in processing, particularly in tasks such as classification and clustering. Feature selection has emerged as a critical strategy to overcome these issues by improving efficiency, accuracy, interpretability, and scalability. This article presents a comprehensive survey of feature selection methods with a dedicated focus on medical data, providing a structured categorization into filter, wrapper, and embedded approaches. It further reviews evaluation criteria, highlights strengths and limitations of each category, and discusses their relevance through examples from real-world medical applications. By combining theoretical perspectives with practical insights, this work contributes a clear roadmap for researchers in healthcare informatics, emphasizing that effective feature selection can substantially enhance medical data analysis and support future advancements in the field.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Mohsen Karimi, Zahra Karimi, Mahsa Khosravi, Zeinab Delaram, Mostafa Habibi Dehsheikhim, Somayeh Arab Najafabadi, Mohammadreza Alizadeh Aliabadi, Nakisa Tavakoli

This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright © 2025 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

