TY - GEN
T1 - Comparative Analysis of YOLOv5/v8/v9 for Object Detection, Tracking, and Human Action Recognition in Combat Sports
AU - Quinn, Evan
AU - Corcoran, Niall
N1 - Publisher Copyright:
© Proceedings of the 4th International Conference on AI Research, ICAIR 2024.
PY - 2024
Y1 - 2024
N2 - YOLO models are widely used object detectors in computer vision (CV). This study investigates the relative performance of YOLOv5, YOLOv8, and YOLOv9 for object detection, tracking, and human action recognition in combat sports. The models were evaluated using curated datasets encompassing various combat scenarios, athlete movements, and equipment configurations. Pre-processing protocols and augmentation techniques were applied to improve model accuracy and generalizability, including automated orientation correction, image dimension standardisation, contrast enhancement, and methods such as zoom, rotation, shear, and grayscale conversion. The key findings provide insight into the comparative performance of the models across various evaluation metrics, such as precision, recall, and mean average precision. Each model's ability to detect, track, and recognise human actions in dynamic combat sports environments is evaluated. Computational efficiency and real-time performance were assessed as these are important indicators for practical applications in coaching, training, and competitive scoring systems. The findings suggest that YOLOv8 offers the best balance of precision and recall, making it particularly suitable for real-time applications in combat sports analytics. This study contributes to advancing CV technologies in combat sports analytics, with potential implications for improving athletic training methods, facilitating personalised coaching interventions, and enhancing objectivity and consistency in competitive scoring processes in combat sports.
AB - YOLO models are widely used object detectors in computer vision (CV). This study investigates the relative performance of YOLOv5, YOLOv8, and YOLOv9 for object detection, tracking, and human action recognition in combat sports. The models were evaluated using curated datasets encompassing various combat scenarios, athlete movements, and equipment configurations. Pre-processing protocols and augmentation techniques were applied to improve model accuracy and generalizability, including automated orientation correction, image dimension standardisation, contrast enhancement, and methods such as zoom, rotation, shear, and grayscale conversion. The key findings provide insight into the comparative performance of the models across various evaluation metrics, such as precision, recall, and mean average precision. Each model's ability to detect, track, and recognise human actions in dynamic combat sports environments is evaluated. Computational efficiency and real-time performance were assessed as these are important indicators for practical applications in coaching, training, and competitive scoring systems. The findings suggest that YOLOv8 offers the best balance of precision and recall, making it particularly suitable for real-time applications in combat sports analytics. This study contributes to advancing CV technologies in combat sports analytics, with potential implications for improving athletic training methods, facilitating personalised coaching interventions, and enhancing objectivity and consistency in competitive scoring processes in combat sports.
KW - Combat sports
KW - Computer vision
KW - Object detection and tracking
KW - YOLOv5
KW - YOLOv8
KW - YOLOv9
UR - http://www.scopus.com/inward/record.url?scp=85215690219&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85215690219
T3 - Proceedings of the 4th International Conference on AI Research, ICAIR 2024
SP - 364
EP - 373
BT - Proceedings of the 4th International Conference on AI Research, ICAIR 2024
A2 - Goncalves, Carlos
A2 - Rouco, Jose Carlos Dias
PB - Academic Conferences International Limited
T2 - 4th International Conference on AI Research, ICAIR 2024
Y2 - 5 December 2024 through 6 December 2024
ER -