Human Action Recognition: A Comprehensive Survey of Multimodal Advances, Challenges, and Emerging Directions

Authors

  • Umar Suleiman Bichi Department of Computer Science & Engineering, Faculty of Engineering and Technology, Vivekananda Global University, Jaipur, India
  • Sunusi Bala Abdullahi Department of Information System Engineering, Faculty of Computer and Information System Engineering, Sakarya University, Turkey https://orcid.org/0000-0003-1898-7352

DOI:

https://doi.org/10.65278/IJTACI.2025.36

Keywords:

Human Action recognition, Multimodal Fusion, Sensor-Based Activity Recognition, Privacy-Preserving Models, Transformer Architectures, Graph Neural Networks

Abstract

Human Action Recognition (HAR) has become a pivotal field within computer vision and machine learning, with transformative applications in surveillance, healthcare, human-computer interaction, and sports analytics. Despite notable advances, a persistent gap remains between benchmark-driven performance and real-world deployment, particularly regarding cross-subject generalization, fine-grained action understanding, computational scalability, and privacy preservation. This survey provides a systematic and critical review of HAR research published between 2022 and 2025, analyzing 30 peer-reviewed articles from the IEEE Xplore digital library. We trace the progression from unimodal frameworks to multimodal fusion architectures, highlighting innovations across skeleton-, sensor-, and vision-based modalities. Key architectural trends include transformer-based models, graph neural networks, and self-supervised learning, alongside domain-specific adaptations in healthcare and sports. We also examine methodological shifts toward lightweight, privacy-aware, and generalizable systems. By synthesizing these developments, the review outlines emerging research directions and highlights priorities such as robust evaluation protocols, ethical safeguards, and deployment-ready HAR solutions, thereby guiding future work in the field.

Downloads

Download data is not yet available.

Downloads

Published

2025-10-28

How to Cite

Bichi, U. S., & Abdullahi, S. B. (2025). Human Action Recognition: A Comprehensive Survey of Multimodal Advances, Challenges, and Emerging Directions. International Journal of Theoretical & Applied Computational Intelligence, vol. 2025, 305–322. https://doi.org/10.65278/IJTACI.2025.36

Issue

Section

Articles

Similar Articles

<< < 1 2 3 > >> 

You may also start an advanced similarity search for this article.