當(dāng)前位置：首頁 > 编程语言 > python >内容正文

python

MindSDK+yolov5部署及python版图像视频推理实现

發(fā)布時間：2024/1/23 python 35 豆豆

生活随笔收集整理的這篇文章主要介紹了 MindSDK+yolov5部署及python版图像视频推理实现小編覺得挺不錯的,現(xiàn)在分享給大家,幫大家做個參考.

一、前言

???????? 基于華為云上的MindX SDK + Pytorch yolov5 應(yīng)用案例：

????????https://bbs.huaweicloud.com/forum/thread-118598-1-1.html

???????? 原帖使用預(yù)訓(xùn)練yolov5s.onnx模型進(jìn)行處理，使用c++進(jìn)行圖像推理，由于原帖python版的實(shí)現(xiàn)并不完整，這里對python版圖像和視頻推理進(jìn)行實(shí)現(xiàn)。

整個實(shí)現(xiàn)流程：

???????? 1、基礎(chǔ)環(huán)境：Atlas800-3000、mxManufacture、Ascend-CANN-toolkit、Ascend Driver

???????? 2、模型轉(zhuǎn)換：pytorch模型轉(zhuǎn)onnx模型，yolov5s.pt----->yolov5.onnx

???????? 3、onnx模型簡化，onnx模型轉(zhuǎn)om模型

???????? 4、業(yè)務(wù)流程編排與配置

???????? 5、python推理流程代碼開發(fā)（圖像、視頻）

二、圖像推理流程開發(fā)實(shí)現(xiàn)。

1、初始化流管理。????????

streamManagerApi = StreamManagerApi() ret = streamManagerApi.InitManager() if ret != 0:print("Failed to init Stream manager, ret=%s" % str(ret))exit() with open("../pipeline/yolov5x_example.pipeline", 'rb') as f:pipelineStr = f.read() ret = streamManagerApi.CreateMultipleStreams(pipelineStr) if ret != 0:print("Failed to create Stream, ret=%s" % str(ret))exit()

2、加載圖像，進(jìn)行推理。????????

dataPath = "dog.jpg" savePath = "dog_result.jpg" # 獲取圖像 dataInput = MxDataInput() with open(dataPath, 'rb') as f:dataInput.data = f.read() streamName = b'classification+detection' inPluginId = 0 uniqueId = streamManagerApi.SendDataWithUniqueId(streamName, inPluginId, dataInput) if uniqueId < 0:print("Failed to send data to stream.")exit() inferResult = streamManagerApi.GetResultWithUniqueId(streamName, uniqueId, 3000) if inferResult.errorCode != 0:print("GetResultWithUniqueId error. errorCode=%d, errorMsg=%s" % (inferResult.errorCode, inferResult.data.decode()))exit()

3、解析推理結(jié)果，獲取推理結(jié)果的坐標(biāo)和置信度，并在圖像上進(jìn)行繪制。這里使用json對結(jié)果進(jìn)行解析生成字典，獲取圖像目標(biāo)檢測兩個角點(diǎn)坐標(biāo)(x0,y0),(x1,y1)，以及置信度confidence，使用OpenCV加載圖像繪制檢測框和置信度。????????

infer_results = inferResult.data.decode() temp_dic = json.loads(infer_results) img = cv2.imread(dataPath) if 'MxpiObject' in temp_dic.keys():for i in range(len(temp_dic["MxpiObject"])):name = temp_dic["MxpiObject"][i]["classVec"][0]["className"]confidence = temp_dic["MxpiObject"][i]["classVec"][0]["confidence"]text = name + ":" + str(confidence)x0 = int(temp_dic["MxpiObject"][i]["x0"])y0 = int(temp_dic["MxpiObject"][i]["y0"])x1 = int(temp_dic["MxpiObject"][i]["x1"])y1 = int(temp_dic["MxpiObject"][i]["y1"])img = cv2.rectangle(img, (x0, y0), (x1, y1), (0, 255, 0), 2)cv2.putText(img, text, (x0, y0 + 20), cv2.FONT_HERSHEY_SIMPLEX, 0.8, (0, 255, 0), 2, )cv2.imwrite(savePath, img) else:cv2.putText(img, 'No object detect !', (0, 20), cv2.FONT_HERSHEY_SIMPLEX, 0.8, (0, 255, 0), 2, )cv2.imwrite(savePath, img) # destroy streams streamManagerApi.DestroyAllStreams()

結(jié)果展示：

三、視頻推理實(shí)現(xiàn)。

???????? 由于這里用來測試的視頻為mp4格式，所以采用OpenCV進(jìn)行視頻解碼，解析每一幀轉(zhuǎn)換為圖像之后再進(jìn)行推理，所以這里視頻的推理本質(zhì)上與圖像推理大致相同。也可以嘗試將mp4轉(zhuǎn)換為h264格式，昇騰可以支持h264、h265直接解碼。

具體實(shí)現(xiàn)：????????

videoCapture = cv2.VideoCapture(videoPath) # 獲取視頻幀率 fps = videoCapture.get(cv2.CAP_PROP_FPS) # 獲取視頻寬和高 width = videoCapture.get(cv2.CAP_PROP_FRAME_WIDTH) height = videoCapture.get(cv2.CAP_PROP_FRAME_HEIGHT) model_width = 640 model_height = 640 x_scale = width / model_width y_scale = height / model_width size = (int(width), int(height)) videoWriter = cv2.VideoWriter(savePath, cv2.VideoWriter_fourcc('X', 'V', 'I', 'D'), fps, size) count = 0 success, frame = videoCapture.read() while success:img_temp = 'temp.jpg'img = cv2.resize(frame, [model_width, model_height], cv2.INTER_LINEAR)cv2.imwrite(img_temp, img)dataInput = MxDataInput()with open(img_temp, 'rb') as f:dataInput.data = f.read()streamName = b'classification+detection'inPluginId = 0uniqueId = streamManagerApi.SendDataWithUniqueId(streamName, inPluginId, dataInput)if uniqueId < 0:print("Failed to send data to stream.")exit()# Obtain the inference result by specifying streamName and uniqueId.inferResult = streamManagerApi.GetResultWithUniqueId(streamName, uniqueId, 3000)if inferResult.errorCode != 0:print("GetResultWithUniqueId error. errorCode=%d, errorMsg=%s" % (inferResult.errorCode, inferResult.data.decode()))exit()infer_results = inferResult.data.decode()temp_dic = json.loads(infer_results)#print(infer_results)if 'MxpiObject' in temp_dic.keys():for i in range(len(temp_dic["MxpiObject"])):name = temp_dic["MxpiObject"][i]["classVec"][0]["className"]confidence = temp_dic["MxpiObject"][i]["classVec"][0]["confidence"]text = name + ":" + str(confidence)x0 = int(x_scale * temp_dic["MxpiObject"][i]["x0"])y0 = int(y_scale * temp_dic["MxpiObject"][i]["y0"])x1 = int(x_scale * temp_dic["MxpiObject"][i]["x1"])y1 = int(y_scale * temp_dic["MxpiObject"][i]["y1"])img = cv2.rectangle(frame, (x0, y0), (x1, y1), (0, 255, 0), 2)cv2.putText(frame, text, (x0, y0 + 20), cv2.FONT_HERSHEY_SIMPLEX, 0.8, (0, 255, 0), 2, )videoWriter.write(frame)count += 1print(count)success, frame = videoCapture.read() # destroy streams streamManagerApi.DestroyAllStreams()

視頻推理效果：

總結(jié)

以上是生活随笔為你收集整理的MindSDK+yolov5部署及python版图像视频推理实现的全部內(nèi)容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網(wǎng)站內(nèi)容還不錯，歡迎將生活随笔推薦給好友。

上一篇： python字符串转字典并获取多层嵌套字
下一篇：使用labelme制作自己的深度学习图像

3atv精品不卡视频,97人人超碰国产精品最新,中文字幕av一区二区三区人妻少妇,久久久精品波多野结衣,日韩一区二区三区精品

python

MindSDK+yolov5部署及python版图像视频推理实现

總結(jié)