Skeleton-based deep pose feature learning for action quality assessment on figure skating videos

Huiying Li, Qing Lei*, Hongbo Zhang, Jixiang Du, Shangce Gao

*この論文の責任著者

研究成果: ジャーナルへの寄稿学術論文査読

12 被引用数 (Scopus)

抄録

Most of the existing Action Quality Assessment (AQA) methods for scoring sports videos have deeply researched how to evaluate the single action or several sequential-defined actions that performed in short-term sport videos, such as diving, vault, etc. They attempted to extract features directly from RGB videos through 3D ConvNets, which makes the features mixed with ambiguous scene information. To investigate the effectiveness of deep pose feature learning on automatically evaluating the complicated activities in long-duration sports videos, such as figure skating and artistic gymnastic, we propose a skeleton-based deep pose feature learning method to address this problem. For pose feature extraction, a spatial–temporal pose extraction module (STPE) is built to capture the subtle changes of human body movements and obtain the detail representations for skeletal data in space and time dimensions. For temporal information representation, an inter-action temporal relation extraction module (ATRE) is implemented by recurrent neural network to model the dynamic temporal structure of skeletal subsequences. We evaluate the proposed method on figure skating activity of MIT-skate and FIS-V datasets. The experimental results show that the proposed method is more effective than RGB video-based deep feature learning methods, including SENet and C3D. Significant performance progress has been achieved for the Spearman Rank Correlation (SRC) on MIT-Skate dataset. On FIS-V dataset, for the Total Element Score (TES) and the Program Component Score (PCS), better SRC and MSE have been achieved between the predicted scores against the judge's ones when compared with SENet and C3D feature methods.

本文言語英語
論文番号103625
ジャーナルJournal of Visual Communication and Image Representation
89
DOI
出版ステータス出版済み - 2022/11

ASJC Scopus 主題領域

  • 信号処理
  • メディア記述
  • コンピュータ ビジョンおよびパターン認識
  • 電子工学および電気工学

フィンガープリント

「Skeleton-based deep pose feature learning for action quality assessment on figure skating videos」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル