目标检测 - VOC - xml标注格式

相对其他计算机视觉任务，目标检测算法的数据格式更为复杂。为了对数据进行统一的处理，目标检测数据一般都会做成VOC或者COCO的格式。

XML标注格式

<annotation>
    <folder>VOC_ROOT</folder>
    <filename>aaaa.jpg</filename>  # 文件名
    <size>                         # 图像尺寸（长宽以及通道数）通过读取图片获取
        <width>500</width>
        <height>332</height>
        <depth>3</depth>
    </size>
    <segmented>1</segmented>       # 是否用于分割（在图像物体识别中无所谓）
    <object>                       # 检测到的物体
        <name>horse</name>         # 物体类别
        <pose>Unspecified</pose>   # 拍摄角度，如果是自己的数据集就Unspecified
        <truncated>0</truncated>   # 是否被截断（0表示完整)
        <difficult>0</difficult>   # 目标是否难以识别（0表示容易识别）
        <bndbox>                   # bounding-box（包含左下角和右上角xy坐标）
            <xmin>100</xmin>
            <ymin>96</ymin>
            <xmax>355</xmax>
            <ymax>324</ymax>
        </bndbox>
    </object>
    <object>                       # 检测到多个物体
        <name>person</name>
        <pose>Unspecified</pose>
        <truncated>0</truncated>
        <difficult>0</difficult>
        <bndbox>
            <xmin>198</xmin>
            <ymin>58</ymin>
            <xmax>286</xmax>
            <ymax>197</ymax>
        </bndbox>
    </object>
</annotation>

构建与解析XML文件

使用 xml.etree.ElementTree 模块构建解析xml文件

posted @ 2023-05-10 18:51 bigroc 阅读(268) 评论(0) 收藏举报

刷新页面返回顶部

bigroc

https://bigroc.cn/

目标检测 - VOC - xml标注格式

目标检测 - VOC - xml标注格式

XML标注格式

构建与解析XML文件

公告