[action] PoseConv3D

Ref: [action] Action Recognition by Skeleton

Paper on Trend

深兰科技|一文了解基于ST-GCN的人体动作识别与生成【过去】
PoseC3D: 基于人体姿态的动作识别新范式【现在】
[CVPR22 Oral] PoseConv3D: Processing Skeleton Data with 3D-CNN【现在】

ST-GCN是香港中文大学提出一种时空图卷积网络，可以用它进行人类行为识别。这种算法基于人类关节位置的时间序列表示而对动态骨骼建模，并将图卷积扩展为时空图卷积网络而捕捉这种时空的变化关系。

不同于传统的基于人体 3 维骨架的 GCN 方法，PoseC3D 仅使用 2 维人体骨架热图堆叠作为输入，就能达到更好的识别效果。

大部分骨骼动作识别的工作采用 GCN 来提取骨骼的特征。尽管被广泛使用，但 GCN 方法依然在鲁棒性、兼容性和可扩展性上存在一定缺陷。

Let's: https://youtu.be/IWKY5PyF0LU?t=955

Heatmap Volumn

多个人，输入？如何处理？

17个关节，32 frames。

W的热图，其中K是关节数，H和W是帧的高度和宽度

基于提取好的 2D 姿态，我们需要堆叠

动作识别：https://github.com/open-mmlab/mmaction2/blob/master/README_zh-CN.md

Supported Methods.

Action Recognition
C3D (CVPR'2014)	TSN (ECCV'2016)	I3D (CVPR'2017)	I3D Non-Local (CVPR'2018)	R(2+1)D (CVPR'2018)
TRN (ECCV'2018)	TSM (ICCV'2019)	TSM Non-Local (ICCV'2019)	SlowOnly (ICCV'2019)	SlowFast (ICCV'2019)
CSN (ICCV'2019)	TIN (AAAI'2020)	TPN (CVPR'2020)	X3D (CVPR'2020)	OmniSource (ECCV'2020)
MultiModality: Audio (ArXiv'2020)	TANet (ArXiv'2020)	TimeSformer (ICML'2021)
Action Localization
SSN (ICCV'2017)	BSN (ECCV'2018)	BMN (ICCV'2019)
Spatio-Temporal Action Detection
ACRN (ECCV'2018)	SlowOnly+Fast R-CNN (ICCV'2019)	SlowFast+Fast R-CNN (ICCV'2019)	LFB (CVPR'2019)
Skeleton-based Action Recognition
ST-GCN (AAAI'2018)	2s-AGCN (CVPR'2019)	PoseC3D (ArXiv'2021)

Results and models are available in the README.md of each method's config directory. A summary can be found on the model zoo page.

Where is RGBPose-Conv3D?

Regarding the implementation of poseC3D considering both RGB and pose input #1221

Jeba-create opened this issue on 12 Oct 2021 · 15 comments

Hi, Jeba-create, if you want to implement the RGBPoseSlowFast in the PoseC3D paper on ur own:

- You need to create a new dataset, which provides samples consist of RGB videos and 2D skeletons (For example, video file path and skeleton in a single dictionary). (register it in DATASETS)
- You need to create components in the data pipeline to process such samples. (register it in PIPELINES)
- You need to create a two-stream backbone, which takes both RGB frames and heatmap volumes (maybe in a tuple) as input. (register it in BACKBONES)
Implementing RGBPoseSlowFast requires some effort, if you just need RGB+Pose-based predictions, you can fuse predictions of two individual streams directly.

Hi, the rest part of the PoseC3D project (RGBPose-SlowFast & PoseC3D + Kinetics) will be released after the paper gets published.

Ref: https://github.com/kennymckormick/pyskl【dev版本】

[Submitted on 28 Apr 2021 (v1), last revised 2 Apr 2022 (this version, v2)]

Revisiting Skeleton-based Action Recognition

等待。。。

Continue ...

posted @ 2022-04-02 17:15 郝壹贰叁阅读(736) 评论(0) 收藏举报

刷新页面返回顶部

机器学习水很深

We all have two lives. The second one starts when we realize that we only have one. --- Tom Hiddleston

[action] PoseConv3D

Heatmap Volumn

Where is RGBPose-Conv3D?

公告