當前位置：首頁 > 编程资源 > 编程问答 >内容正文

编程问答

DL之YoloV2：Yolo V2算法的简介(论文介绍)、架构详解、案例应用等配图集合之详细攻略

發(fā)布時間：2025/3/21 编程问答 22 豆豆

生活随笔收集整理的這篇文章主要介紹了 DL之YoloV2：Yolo V2算法的简介(论文介绍)、架构详解、案例应用等配图集合之详细攻略小編覺得挺不錯的,現(xiàn)在分享給大家,幫大家做個參考.

DL之YoloV2：Yolo V2算法的簡介(論文介紹)、架構詳解、案例應用等配圖集合之詳細攻略

Yolo V2算法的簡介(論文介紹)

1、YOLOV2的特點、改進、優(yōu)缺點

2、實驗結果

3、不同算法性能(mAP)比較

Yolo V2算法的架構詳解

Yolo V2算法的案例應用

相關文章
CV：人工智能之計算機視覺方向的簡介(常用數(shù)據(jù)集+CV職位)、傳統(tǒng)方法對比CNN類算法、計算機視覺八大應用(知識導圖+經(jīng)典案例)之詳細攻略
CV之OD：計算機視覺之目標檢測(Object Detection)方向的簡介、使用方法、案例應用之詳細攻略
CV之OD：計算機視覺之目標檢測(Object Detection)方向的思維導圖、單階段和兩步階段經(jīng)典算法演化之詳細攻略
DL之Yolo：Yolo算法的簡介(論文介紹)、架構詳解、案例應用等配圖集合之詳細攻略
DL之YoloV2：Yolo V2算法的簡介(論文介紹)、架構詳解、案例應用等配圖集合之詳細攻略
DL之YoloV2：Yolo V2算法的架構詳解
DL之YoloV3：Yolo V3算法的簡介(論文介紹)、架構詳解、案例應用等配圖集合之詳細攻略

Yolo V2算法的簡介(論文介紹)

摘要
? ? ? ??We introduce YOLO9000, a state-of-the-art, real-time object detection system that can detect over 9000 object categories. First we propose various improvements to the YOLO detection method, both novel and drawn from prior work. The improved model, YOLOv2, is state-of-the-art on standard detection tasks like PASCAL VOC and COCO. At 67 FPS, YOLOv2 gets 76.8 mAP on VOC 2007. At 40 FPS, YOLOv2 gets 78.6 mAP, outperforming state-of-the-art methods like Faster RCNN with ResNet and SSD while still running significantly faster. Finally we propose a method to jointly train on object detection and classification. Using this method we train YOLO9000 simultaneously on the COCO detection dataset and the ImageNet classification dataset. Our joint training allows YOLO9000 to predict detections for object classes that don't have labelled detection data. We validate our approach on the ImageNet detection task. YOLO9000 gets 19.7 mAP on the ImageNet detection validation set despite only having detection data for 44 of the 200 classes. On the 156 classes not in COCO, YOLO9000 gets 16.0 mAP. But YOLO can detect more than just 200 classes; it predicts detections for more than 9000 different object categories. And it still runs in real-time.
? ? ? ??我們介紹了YOLO9000，一個最先進的實時對象檢測系統(tǒng)，可以檢測超過9000個對象類別。首先，我們提出了對YOLO檢測方法的各種改進，既新穎又借鑒了前人的工作。改進后的YOLOv2模型在PASCAL VOC和COCO等標準檢測任務上是最先進的。在67 FPS情況下，YOLOv2在VOC 2007上獲得76.8 mAP。在40幀每秒的速度下，YOLOv2獲得了78.6張mAP，性能超過了最先進的方法，比如使用ResNet和SSD的更快的RCNN，同時仍然運行得非常快。最后提出了一種聯(lián)合訓練目標檢測與分類的方法。利用該方法，我們同時對YOLO9000進行了COCO檢測數(shù)據(jù)集和ImageNet分類數(shù)據(jù)集的訓練。我們的聯(lián)合訓練允許YOLO9000預測沒有標記檢測數(shù)據(jù)的對象類的檢測。我們在ImageNet檢測任務上驗證了我們的方法。YOLO9000在ImageNet檢測驗證集上獲得19.7 mAP，盡管在200個類中只有44個類的檢測數(shù)據(jù)。在未使用COCO的156個類中，YOLO9000得到了16.0 mAP。但是YOLO可以檢測超過200個類;它預測了超過9000種不同對象類別的探測。它仍然是實時運行的。
Conclusion ?
? ? ? ??We introduce YOLOv2 and YOLO9000, real-time detection ?systems. YOLOv2 is state-of-the-art and faster ?than other detection systems across a variety of detection ?datasets. Furthermore, it can be run at a variety of image ?sizes to provide a smooth tradeoff between speed and accuracy. ?
? ? ? ??我們介紹了YOLOv2和YOLO9000，實時檢測系統(tǒng)。YOLOv2是最先進的，比其他檢測系統(tǒng)更快地通過各種檢測數(shù)據(jù)集。此外，它可以運行在各種圖像大小，以提供速度和精度之間的平穩(wěn)權衡。
? ? ? ??YOLO9000 is a real-time framework for detection more ?than 9000 object categories by jointly optimizing detection ?and classification. We use WordTree to combine data from ?various sources and our joint optimization technique to train ?simultaneously on ImageNet and COCO. YOLO9000 is a ?strong step towards closing the dataset size gap between detection ?and classification. ?
? ? ? ??YOLO9000是通過聯(lián)合優(yōu)化檢測和分類，實現(xiàn)對9000多個目標類別進行檢測的實時框架。我們使用WordTree來組合來自不同來源的數(shù)據(jù)，并使用我們的聯(lián)合優(yōu)化技術在ImageNet和COCO上同時進行訓練。YOLO9000是縮小檢測和分類之間數(shù)據(jù)集大小差距的有力一步。
? ? ? ??Many of our techniques generalize outside of object detection. ?Our WordTree representation of ImageNet offers a ?richer, more detailed output space for image classification. ?Dataset combination using hierarchical classification would ?be useful in the classification and segmentation domains. ?Training techniques like multi-scale training could provide ?benefit across a variety of visual tasks. ?
? ? ? ??我們的許多技術可以推廣到對象檢測之外。ImageNet的WordTree表示為圖像分類提供了更豐富、更詳細的輸出空間。在分類和分割領域，采用層次分類的數(shù)據(jù)集組合方法是非常有用的。像多尺度訓練這樣的訓練技術可以為各種視覺任務提供好處。
? ? ? ??For future work we hope to use similar techniques for ?weakly supervised image segmentation. We also plan to ?improve our detection results using more powerful matching ?strategies for assigning weak labels to classification data ?during training. Computer vision is blessed with an enormous ?amount of labelled data. We will continue looking ?for ways to bring different sources and structures of data together to make stronger models of the visual world.
? ? ? ??在未來的工作中，我們希望使用類似的技術來進行弱監(jiān)督圖像分割。我們還計劃在訓練中使用更強大的匹配策略來為分類數(shù)據(jù)分配弱標簽，從而提高檢測結果。計算機視覺擁有大量的標記數(shù)據(jù)。我們將繼續(xù)尋找將不同的數(shù)據(jù)源和數(shù)據(jù)結構結合在一起的方法，從而構建更強大的可視化世界模型。

論文
Joseph Redmon , Ali Farhadi.
YOLO9000: Better, Faster, Stronger. CVPR 2017 (Best Paper Honorable Mention)
https://arxiv.org/abs/1612.08242

1、YOLOV2的特點、改進、優(yōu)缺點

1、YOLOV2的特點

YOLOv2是YOLO的第二個版本，其目標是在提高速度的同時顯著提高準確度。
與基于proposal的檢測器相比，YOLOv1定位誤差更高，并且召回率（測量所有目標的定位有多好）更低。
SSD是YOLOv1的強大競爭對手，它在某一方面表現(xiàn)出更高的實時處理精度。

2、YOLOV2的改進處

YOLO v2: 使用一系列的方法對YOLO v1進行了改進，在保持原有速度的同時提升準確度。
YOLO9000: 提出了一種目標分類與檢測的聯(lián)合訓練方法，通過WordTree來混合檢測數(shù)據(jù)集與識別數(shù)據(jù)集之中的數(shù)據(jù)，同時在COCO和ImageNet數(shù)據(jù)集中進行訓練得到YOLO9000，實現(xiàn)9000多種目標的實時檢測。

2、實驗結果

1、VOC2007數(shù)據(jù)集

? ? ? Here is the accuracy improvements after applying the techniques discussed so far:
? ? ??注：anchor機制只是試驗性在YOLOv2上嘗試，一旦有了dimension priors就把anchor拋棄了。最后達到78.6mAP的模型上也沒用anchor boxes。

? ? ? Accuracy comparison for different detectors:

3、不同算法性能(mAP)比較

1、Accuracy comparison for different detectors

Yolo V2算法的架構詳解

后期更新……

Yolo V2算法的案例應用

后期更新……

總結

以上是生活随笔為你收集整理的DL之YoloV2：Yolo V2算法的简介(论文介绍)、架构详解、案例应用等配图集合之详细攻略的全部內(nèi)容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網(wǎng)站內(nèi)容還不錯，歡迎將生活随笔推薦給好友。

上一篇：成功解决Ubuntu下的make: gc
下一篇： ECS：利用ECS进行深度学习详细攻略

3atv精品不卡视频,97人人超碰国产精品最新,中文字幕av一区二区三区人妻少妇,久久久精品波多野结衣,日韩一区二区三区精品

编程问答

DL之YoloV2：Yolo V2算法的简介(论文介绍)、架构详解、案例应用等配图集合之详细攻略

Yolo V2算法的簡介(論文介紹)

1、YOLOV2的特點、改進、優(yōu)缺點

2、實驗結果

3、不同算法性能(mAP)比較

Yolo V2算法的架構詳解

Yolo V2算法的案例應用

總結