Human Action Recognition: A Comprehensive Survey of Multimodal Advances, Challenges, and Emerging Directions

Authors

  • Umar Suleiman Bichi Department of Computer Science & Engineering, Faculty of Engineering and Technology, Vivekananda Global University, Jaipur, India
  • Sunusi Bala Abdullahi Department of Information System Engineering, Faculty of Computer and Information System Engineering, Sakarya University, Turkey https://orcid.org/0000-0003-1898-7352

Keywords:

Human Action recognition, Multimodal Fusion, Sensor-Based Activity Recognition, Privacy-Preserving Models, Transformer Architectures, Graph Neural Networks

Abstract

Human Action Recognition (HAR) has become a pivotal field within computer vision and machine learning, with transformative applications in surveillance, healthcare, human-computer interaction, and sports analytics. Despite notable advances, a persistent gap remains between benchmark-driven performance and real-world deployment, particularly regarding cross-subject generalization, fine-grained action understanding, computational scalability, and privacy preservation. This survey provides a systematic and critical review of HAR research published between 2022 and 2025, analyzing 30 peer-reviewed articles from the IEEE Xplore digital library. We trace the progression from unimodal frameworks to multimodal fusion architectures, highlighting innovations across skeleton-, sensor-, and vision-based modalities. Key architectural trends include transformer-based models, graph neural networks, and self-supervised learning, alongside domain-specific adaptations in healthcare and sports. We also examine methodological shifts toward lightweight, privacy-aware, and generalizable systems. By synthesizing these developments, the review outlines emerging research directions and highlights priorities such as robust evaluation protocols, ethical safeguards, and deployment-ready HAR solutions, thereby guiding future work in the field.

Downloads

Published

2025-10-28

How to Cite

Bichi, U. S., & Abdullahi, S. B. (2025). Human Action Recognition: A Comprehensive Survey of Multimodal Advances, Challenges, and Emerging Directions. International Journal of Theoretical & Applied Computational Intelligence, 305–322. Retrieved from https://ijtaci.com/index.php/ojs/article/view/36

Issue

Section

Articles