You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A Human Action Recognition (HAR) model combining 3D CNN and LSTM networks to accurately recognize actions in videos using spatial-temporal feature extraction. Trained on UCF-50 and outperforming existing architectures.
Efficient video action recognition using hybrid techniques: combining ORB, SIFT, and deep models like VideoMAE and (2+1)D Conv to reduce data size while maintaining performance.