COCO-WholeBody 注释详解
官方数据集说明:https://github.com/jin-s13/COCO-WholeBody
1. 如何预览 json 文件?
使用 Vscode,右键 格式化文档即可。
2. 字段解读
2.1 关键字段之 "images"
"images": [ { "license": 4, "file_name": "000000397133.jpg", "coco_url": "http://images.cocodataset.org/val2017/000000397133.jpg", "height": 427, "width": 640, "date_captured": "2013-11-14 17:02:52", "flickr_url": "http://farm7.staticflickr.com/6116/6255196340_da26cf2c9e_z.jpg", "id": 397133 # 图像名字中的数字 }, ...
上述 "images" 字段的个数是 图像的个数。
2.2 关键字段之 "annotations"
"segmentation": [[实例分割的点]], "num_keypoints": 10, # 有效的标注的身体关键点, (v>0), (e.g. crowds and small objects, will have num_keypoints=0). "area": 47803.27955, # 身体的 bbox 的面积 "iscrowd": 0, "keypoints": [17*3], # (x,y,v) # a visibility flag v defined as v=0: not labeled (in which case x=y=0), v=1: labeled but not visible, and v=2: labeled and visible. A keypoint is considered visible if it falls inside the object segment. "image_id": 425226, # 对应 images 中的 id "bbox": [], # 身体的 bbox,(x,y,w,h) 左上角,宽高 "category_id": 1, # 物体的类别 "id": 183126, # 关键:目标的唯一编号 "face_box": [], "lefthand_box": [], "righthand_box": [], "lefthand_kpts": [21*3] "righthand_kpts": [ 308.0, 239.0, 0.1408521831035614, ... ], # 21*3 (xyv) hand 的 v 的含义与 其他的关键点不同点在于:v 有时候是 小于1 的小数,v>0 means that the keypoint is reliable. "face_kpts": [68*3], #(x,y,v)v 含义同 body 关键点 # The validity of the face/hand/foot are used to minimize the labelling uncertainty. Only if the face/hand images are clear enough for keypoint labeling (for annotators), the validity is True, otherwise False. Invalid cases may include severely blur or occlusion. We only label keypoints/boxes for valid cases. Invalid boxes/keypoints are simply set as all-zero arrays. "face_valid": false, "lefthand_valid": true, "righthand_valid": true, "foot_valid": true, "foot_kpts": [6*3] #(x,y,v)v 含义同 body 关键点
简记为
annotation{ "face_box": list([x, y, w, h]), "lefthand_box": list([x, y, w, h]), "righthand_box": list([x, y, w, h]), "foot_kpts": list([x, y, v] * 6), "face_kpts": list([x, y, v] * 68), "lefthand_kpts": list([x, y, v] * 21), "righthand_kpts": list([x, y, v] * 21), "face_valid": bool, "lefthand_valid": bool, "righthand_valid": bool, "foot_valid": bool, "[cloned]": ..., } categories[{ "[cloned]": ..., }]
【推荐】国内首个AI IDE,深度理解中文开发场景,立即下载体验Trae
【推荐】编程新体验,更懂你的AI,立即体验豆包MarsCode编程助手
【推荐】抖音旗下AI助手豆包,你的智能百科全书,全免费不限次数
【推荐】轻量又高性能的 SSH 工具 IShell:AI 加持,快人一步
· winform 绘制太阳,地球,月球 运作规律
· 震惊!C++程序真的从main开始吗?99%的程序员都答错了
· AI与.NET技术实操系列(五):向量存储与相似性搜索在 .NET 中的实现
· 【硬核科普】Trae如何「偷看」你的代码?零基础破解AI编程运行原理
· 超详细:普通电脑也行Windows部署deepseek R1训练数据并当服务器共享给他人