當前位置：首頁 > 编程资源 > 编程问答 >内容正文

编程问答

ValueError: With n_samples=0, test_size=0.2 and train_size=None, the resulting train set will be emp

發布時間：2023/12/8 编程问答 26 豆豆

生活随笔收集整理的這篇文章主要介紹了 ValueError: With n_samples=0, test_size=0.2 and train_size=None, the resulting train set will be emp 小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

今天寫代碼labelmetovoc，即將labelme標注的轉化為voc標準格式參考的這篇文章時遇到了如下問題：
ValueError: With n_samples=0, test_size=0.2 and train_size=None, the resulting train set will be empty. Adjust any of the aforementioned parameters.
在網上查了一下，大部分博客都認為是scikit-learn版本較高時出現的問題,需要換到0.20.0以下版本，但是換的話numpy,scipy等庫均要重新下載兼容的版本，所以建議不要輕易嘗試，我請教了一下師兄，他說是因為參數上面參數設置出現了問題，train或test的數量必須為整數，改好的代碼如下所示：

# -*- coding: utf-8 -*- """ Created on Fri Apr 10 13:39:17 2020@author: nihao """ import os import numpy as np import codecs import json import glob import cv2 import shutil from sklearn.model_selection import train_test_split# 1.標簽路徑 labelme_path = "D:\\PinInspection\\jpgretanglelabel" # 原始labelme標注數據路徑 saved_path = "D:\\PinInspection\\improveto300\\VOC2007" # 保存路徑# 2.創建要求文件夾 dst_annotation_dir = os.path.join(saved_path, 'Annotations') if not os.path.exists(dst_annotation_dir):os.makedirs(dst_annotation_dir) dst_image_dir = os.path.join(saved_path, "JPEGImages") if not os.path.exists(dst_image_dir):os.makedirs(dst_image_dir) dst_main_dir = os.path.join(saved_path, "ImageSets", "Main") if not os.path.exists(dst_main_dir):os.makedirs(dst_main_dir)# 3.獲取待處理文件 org_json_files = sorted(glob.glob(os.path.join(labelme_path, '*.json'))) org_json_file_names = [i.split("\\")[-1].split(".json")[0] for i in org_json_files] org_img_files = sorted(glob.glob(os.path.join(labelme_path, '*.jpg'))) org_img_file_names = [i.split("\\")[-1].split(".jpg")[0] for i in org_img_files]# 4.labelme file to voc dataset for i, json_file_ in enumerate(org_json_files):json_file = json.load(open(json_file_, "r", encoding="utf-8"))image_path = os.path.join(labelme_path, org_json_file_names[i]+'.jpg')img = cv2.imread(image_path)height, width, channels = img.shapedst_image_path = os.path.join(dst_image_dir, "{:06d}.jpg".format(i))cv2.imwrite(dst_image_path, img)dst_annotation_path = os.path.join(dst_annotation_dir, '{:06d}.xml'.format(i))with codecs.open(dst_annotation_path, "w", "utf-8") as xml:xml.write('<annotation>\n')xml.write('\t<folder>' + 'Pin_detection' + '</folder>\n')xml.write('\t<filename>' + "{:06d}.jpg".format(i) + '</filename>\n')# xml.write('\t<source>\n')# xml.write('\t\t<database>The UAV autolanding</database>\n')# xml.write('\t\t<annotation>UAV AutoLanding</annotation>\n')# xml.write('\t\t<image>flickr</image>\n')# xml.write('\t\t<flickrid>NULL</flickrid>\n')# xml.write('\t</source>\n')# xml.write('\t<owner>\n')# xml.write('\t\t<flickrid>NULL</flickrid>\n')# xml.write('\t\t<name>ChaojieZhu</name>\n')# xml.write('\t</owner>\n')xml.write('\t<size>\n')xml.write('\t\t<width>' + str(width) + '</width>\n')xml.write('\t\t<height>' + str(height) + '</height>\n')xml.write('\t\t<depth>' + str(channels) + '</depth>\n')xml.write('\t</size>\n')xml.write('\t\t<segmented>0</segmented>\n')for multi in json_file["shapes"]:points = np.array(multi["points"])xmin = min(points[:, 0])xmax = max(points[:, 0])ymin = min(points[:, 1])ymax = max(points[:, 1])label = multi["label"]if xmax <= xmin:passelif ymax <= ymin:passelse:xml.write('\t<object>\n')xml.write('\t\t<name>' + label + '</name>\n')xml.write('\t\t<pose>Unspecified</pose>\n')xml.write('\t\t<truncated>1</truncated>\n')xml.write('\t\t<difficult>0</difficult>\n')xml.write('\t\t<bndbox>\n')xml.write('\t\t\t<xmin>' + str(xmin) + '</xmin>\n')xml.write('\t\t\t<ymin>' + str(ymin) + '</ymin>\n')xml.write('\t\t\t<xmax>' + str(xmax) + '</xmax>\n')xml.write('\t\t\t<ymax>' + str(ymax) + '</ymax>\n')xml.write('\t\t</bndbox>\n')xml.write('\t</object>\n')print(json_file_, xmin, ymin, xmax, ymax, label)xml.write('</annotation>')# 5.split files for txt train_file = os.path.join(dst_main_dir, 'train.txt') trainval_file = os.path.join(dst_main_dir, 'trainval.txt') val_file = os.path.join(dst_main_dir, 'val.txt') test_file = os.path.join(dst_main_dir, 'test.txt')ftrain = open(train_file, 'w') ftrainval = open(trainval_file, 'w') fval = open(val_file, 'w') ftest = open(test_file, 'w')total_annotation_files = glob.glob(os.path.join(dst_annotation_dir, "*.xml")) total_annotation_names = [i.split("\\")[-1].split(".xml")[0] for i in total_annotation_files]# test_filepath = "" for file in total_annotation_names:ftrainval.writelines(file + '\n') # test # for file in os.listdir(test_filepath): # ftest.write(file.split(".jpg")[0] + "\n") # split train_files, val_files = train_test_split(total_annotation_names, test_size=0.2) # train for file in train_files:ftrain.write(file + '\n') # val for file in val_files:fval.write(file + '\n')ftrainval.close() ftrain.close() fval.close() # ftest.close()

只需要
1.將自己的輸入地址寫入labelme_path = “D:\PinInspection\jpgretanglelabel” # 原始labelme標注數據路徑
里，jpgretanglelabel文件夾里需要同時存放原圖與標注的json文件，文件命名對應（一張標注的json文件對應一張原圖）。
2.將自己及的輸出地址寫入saved_path = “D:\PinInspection\improveto300\VOC2007” # 保存路徑。
這樣問題就解決了~

總結

以上是生活随笔為你收集整理的ValueError: With n_samples=0, test_size=0.2 and train_size=None, the resulting train set will be emp的全部內容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯，歡迎將生活随笔推薦給好友。

上一篇：计算机组成原理机器数的浮点表示法
下一篇： TensorFlow keras数据集本

3atv精品不卡视频,97人人超碰国产精品最新,中文字幕av一区二区三区人妻少妇,久久久精品波多野结衣,日韩一区二区三区精品

编程问答

ValueError: With n_samples=0, test_size=0.2 and train_size=None, the resulting train set will be emp

總結