数据集格式转换以及标注框可视化脚本_算法训练可视化脚本资源-CSDN下载

共9个文件

py：9个

需积分: 5 144 浏览量 2024-11-23 15:07:24 上传评论收藏 19KB ZIP 举报

在计算机视觉领域中，数据集的管理和格式转换是核心任务之一。随着深度学习模型在图像处理中的广泛应用，不同格式的数据集对于模型训练和评估至关重要。数据集格式转换脚本提供了一种方便快捷的方式来在不同的标注格式之间进行转换，进而满足不同模型框架的需求。常见的标注格式有COCO和YOLO两种，它们分别代表了不同的数据集格式标准。 COCO数据集格式是一种广泛采用的标注体系，其标注文件通常以.json扩展名存储，包含了丰富的图像信息和标注数据。COCO格式能够详细描述图像中的多个对象，每个对象都有自己的类别、边界框、分割掩码等属性。这种格式的灵活性和丰富的信息使其成为许多计算机视觉任务的首选。 YOLO格式是另一种常见的标注格式，其标注文件通常以.txt扩展名存储，用于YOLO系列模型。YOLO格式通常只记录了图像中的对象的类别和边界框的坐标。YOLO格式的简洁性使得它在实时目标检测任务中非常流行。数据集格式转换以及标注框可视化脚本的出现，大大简化了研究人员和工程师在不同格式数据集间切换的过程。这些脚本允许用户在YOLO和COCO格式之间互相转换，使得数据能够轻松地适应不同的训练框架。对于标注框的可视化，脚本能够将标注框以图形化的方式展示出来，这对于验证标注的准确性、调试算法和进行结果分析都至关重要。在实际应用中，数据集格式转换脚本的使用可以带来以下几点好处： 1. 灵活性：研究人员可以根据自己的需求选择不同的标注格式，无论是进行模型训练还是模型评估，都能够找到合适的标注体系。 2. 高效性：自动化转换脚本能够快速处理大量数据，相比于手动转换大大提高了工作效率，减少了重复劳动。 3. 可视化：能够直观地查看标注框在图像中的位置和形状，对于识别标注错误、评估标注质量以及进一步的图像分析都有极大的帮助。 4. 兼容性：转换脚本确保了数据的一致性，使得在不同的计算机视觉框架之间迁移和使用数据变得更加容易。计算机视觉领域的快速发展对数据集管理提出了更高的要求，格式转换脚本和可视化工具的结合使用，成为了推动该领域发展的重要力量。通过这样的工具，研究者和工程师可以更加专注于模型的开发和创新，而不必担心数据格式的兼容和管理问题。概括来说，数据集格式转换以及标注框可视化脚本的作用是提供一个高效、灵活的工具，帮助用户在不同标注格式间进行快速转换，并通过可视化手段直观地查看和分析标注数据。这不仅提高了数据处理的效率，而且加强了数据集的质量控制，对于促进计算机视觉研究和应用的发展具有重要意义。

资源推荐

资源详情

资源评论

收起资源包目录

dataset-change.zip （9个子文件）

dataset-change

yolo_visial.py 6KB

yolo2coco.py 6KB

coco2yolov2.py 4KB

voc2ylo.py 5KB

yolo2voc.py 5KB

voc_visiual.py 6KB

coco_visiual.py 6KB

voc2coco.py 8KB

coco2voc.py 4KB

import xml.etree.ElementTree as ET import os import json from datetime import datetime import sys import argparse coco = dict() coco['images'] = [] coco['type'] = 'instances' coco['annotations'] = [] coco['categories'] = [] category_set = dict() image_set = set() category_item_id = -1 image_id = 000000 annotation_id = 0 def addCatItem(name): global category_item_id category_item = dict() category_item['supercategory'] = 'none' category_item_id += 1 category_item['id'] = category_item_id category_item['name'] = name coco['categories'].append(category_item) category_set[name] = category_item_id return category_item_id def addImgItem(file_name, size): global image_id if file_name is None: raise Exception('Could not find filename tag in xml file.') if size['width'] is None: raise Exception('Could not find width tag in xml file.') if size['height'] is None: raise Exception('Could not find height tag in xml file.') image_id += 1 image_item = dict() image_item['id'] = image_id image_item['file_name'] = file_name image_item['width'] = size['width'] image_item['height'] = size['height'] image_item['license'] = None image_item['flickr_url'] = None image_item['coco_url'] = None image_item['date_captured'] = str(datetime.today()) coco['images'].append(image_item) image_set.add(file_name) return image_id def addAnnoItem(object_name, image_id, category_id, bbox): global annotation_id annotation_item = dict() annotation_item['segmentation'] = [] seg = [] # bbox[] is x,y,w,h # left_top seg.append(bbox[0]) seg.append(bbox[1]) # left_bottom seg.append(bbox[0]) seg.append(bbox[1] + bbox[3]) # right_bottom seg.append(bbox[0] + bbox[2]) seg.append(bbox[1] + bbox[3]) # right_top seg.append(bbox[0] + bbox[2]) seg.append(bbox[1]) annotation_item['segmentation'].append(seg) annotation_item['area'] = bbox[2] * bbox[3] annotation_item['iscrowd'] = 0 annotation_item['ignore'] = 0 annotation_item['image_id'] = image_id annotation_item['bbox'] = bbox annotation_item['category_id'] = category_id annotation_id += 1 annotation_item['id'] = annotation_id coco['annotations'].append(annotation_item) def read_image_ids(image_sets_file): ids = [] with open(image_sets_file, 'r') as f: for line in f.readlines(): ids.append(line.strip()) return ids def parseXmlFilse(data_dir, json_save_path, split='train'): assert os.path.exists(data_dir), "data path:{} does not exist".format(data_dir) labelfile = split + ".txt" image_sets_file = os.path.join(data_dir, "ImageSets", "Main", labelfile) xml_files_list = [] if os.path.isfile(image_sets_file): ids = read_image_ids(image_sets_file) xml_files_list = [os.path.join(data_dir, "Annotations", f"{i}.xml") for i in ids] elif os.path.isdir(data_dir): # 修改此处xml的路径即可 # xml_dir = os.path.join(data_dir,"labels/voc") xml_dir = data_dir xml_list = os.listdir(xml_dir) xml_files_list = [os.path.join(xml_dir, i) for i in xml_list] for xml_file in xml_files_list: if not xml_file.endswith('.xml'): continue tree = ET.parse(xml_file) root = tree.getroot() # 初始化 size = dict() size['width'] = None size['height'] = None if root.tag != 'annotation': raise Exception('pascal voc xml root element should be annotation, rather than {}'.format(root.tag)) # 提取图片名字 file_name = root.findtext('filename') assert file_name is not None, "filename is not in the file" # 提取图片 size {width,height,depth} size_info = root.findall('size') assert size_info is not None, "size is not in the file" for subelem in size_info[0]: size[subelem.tag] = int(subelem.text) if file_name is not None and size['width'] is not None and file_name not in image_set: # 添加coco['image'],返回当前图片ID current_image_id = addImgItem(file_name, size) print('add image with name: {}\tand\tsize: {}'.format(file_name, size)) elif file_name in image_set: raise Exception('file_name duplicated') else: raise Exception("file name:{}\t size:{}".format(file_name, size)) # 提取一张图片内所有目标object标注信息 object_info = root.findall('object') if len(object_info) == 0: continue # 遍历每个目标的标注信息 for object in object_info: # 提取目标名字 object_name = object.findtext('name') if object_name not in category_set: # 创建类别索引 current_category_id = addCatItem(object_name) else: current_category_id = category_set[object_name] # 初始化标签列表 bndbox = dict() bndbox['xmin'] = None bndbox['xmax'] = None bndbox['ymin'] = None bndbox['ymax'] = None # 提取box:[xmin,ymin,xmax,ymax] bndbox_info = object.findall('bndbox') for box in bndbox_info[0]: bndbox[box.tag] = int(box.text) if bndbox['xmin'] is not None: if object_name is None: raise Exception('xml structure broken at bndbox tag') if current_image_id is None: raise Exception('xml structure broken at bndbox tag') if current_category_id is None: raise Exception('xml structure broken at bndbox tag') bbox = [] # x bbox.append(bndbox['xmin']) # y bbox.append(bndbox['ymin']) # w bbox.append(bndbox['xmax'] - bndbox['xmin']) # h bbox.append(bndbox['ymax'] - bndbox['ymin']) print('add annotation with object_name:{}\timage_id:{}\tcat_id:{}\tbbox:{}'.format(object_name, current_image_id, current_category_id, bbox)) addAnnoItem(object_name, current_image_id, current_category_id, bbox) json_parent_dir = os.path.dirname(json_save_path) if not os.path.exists(json_parent_dir): os.makedirs(json_parent_dir) json.dump(coco, open(json_save_path, 'w')) print("class nums:{}".format(len(coco['categories']))) print("image nums:{}".format(len(coco['images']))) print("bbox nums:{}".format(len(coco['annotations']))) if __name__ == '__main__': """ 脚本说明：本脚本用于将VOC格式的标注文件.xml转换为coco格式的标注文件.json 参数说明： voc_data_dir:两种格式 1.voc2012文件夹的路径，会自动找到voc2012/imageSets/Main/xx.txt 2.xml标签文件存放的文件夹 json_save_path:json文件输出的文件夹 split:主要用于voc2012查找xx.txt,如train.txt.如果用格式2，则不会用到该参数 """ parser = argparse.ArgumentParser() parser.add_argument('-d', '--voc-dir', type=str, default='data/label/voc', help='voc path') parser.add_argument('-s', '--save-path', type=str, default='./dat

评论收藏

内容反馈