Powered by GitBook

YOLO Algorithm

前言

YOLO是如何訓練，如何執行的。

內容

Training

input: 100x100x3 圖片

output: 3x3x16

3x3是因為將圖片切割成9宮格。

16是因為有2組不同anchor box，且每組anchor box有8個值。

無物體的grid cell的target Y該長怎樣?

有一個或兩個物體的grid cell的target Y該長怎樣?

現實訓練的output是19x19x40。

40 = 5 x 8

Making predictions

無物體時(藍色框)。

有一個物體時(綠色框)。

Outputting the non-max supressed outputs

每一個grid cell都會有兩個bounding box。

先丟棄機率pc低的grid cell，表示此cell可能都沒有物體。

在每個cell中的每個物體都要利用non-max supresseion分別跑過一次，來確保每個物體都擁有正確bounding box。

該學到的

YOLO模型的input, output的shape是如何設計的。
non-max suppression是如何利用在YOLO。

results matching ""

No results matching ""