Questions tagged [yolo]

You Only Look Once (YOLO) is a real-time object detection system written in C. Use this tag for questions about YOLO compilation and installation, usage, and its features and functionality.

You Only Look Once (YOLO) is a real-time object detection system developed for high speed object detection within images.

Useful Links:

1841 questions
48
votes
4 answers

Understanding darknet's yolo.cfg config files

I have searched around the internet but found very little information around this, I don't understand what each variable/value represents in yolo's .cfg files. So I was hoping some of you could help, I don't think I'm the only one having this…
Reda Drissi
  • 1,532
  • 3
  • 20
  • 32
28
votes
5 answers

Darknet YOLO image size

I am trying to train custom object classifier in Darknet YOLO v2 https://pjreddie.com/darknet/yolo/ I gathered a dataset for images most of them are 6000 x 4000 px and some lower resolutions as well. Do I need to resize the images before training to…
Farahats9
  • 535
  • 1
  • 9
  • 22
20
votes
8 answers

How to convert bounding box (x1, y1, x2, y2) to YOLO Style (X, Y, W, H)

I'm training a YOLO model, I have the bounding boxes in this format:- x1, y1, x2, y2 => ex (100, 100, 200, 200) I need to convert it to YOLO format to be something like:- X, Y, W, H => 0.436262 0.474010 0.383663 0.178218 I already calculated the…
20
votes
2 answers

Anchor Boxes in YOLO : How are they decided

I have gone through a couple of YOLO tutorials but I am finding it some what hard to figure if the Anchor boxes for each cell the image is to be divided into is predetermined. In one of the guides I went through, The image was divided into 13x13…
linker
  • 821
  • 1
  • 8
  • 20
19
votes
3 answers

YOLO object detection: how does the algorithm predict bounding boxes larger than a grid cell?

I am trying to better understand how the YOLO2 & 3 algorithms works. The algorithm processes a series of convolutions until it gets down to a 13x13 grid. Then it is able to classify objects within each grid cell as well as the bounding boxes for…
krishnab
  • 9,270
  • 12
  • 66
  • 123
18
votes
1 answer

Training a Keras model yields multiple optimizer errors

So I need to retrain Tiny YOLO using my own dataset. The model I am using can be found here: keras-yolo3 . I started training and I get multiple optimizer errors, added the code of the errors to stop confusion. And I noticed the training is going…
UrosT
  • 181
  • 1
  • 6
16
votes
2 answers

Unsupported gpu architecture compute_30 on a CUDA 5 capable gpu

I'm currently trying to compile Darknet on the latest CUDA toolkit which is version 11.1. I have a GPU capable of running CUDA version 5 which is a GeForce 940M. However, while rebuilding darknet using the latest CUDA toolkit, it said nvcc fatal …
3MP The Rook
  • 173
  • 1
  • 2
  • 5
16
votes
5 answers

How to convert Yolo format bounding box coordinates into OpenCV format

I have Yolo format bounding box annotations of objects saved in a .txt files. Now I want to load those coordinates and draw it on the image using OpenCV, but I don’t know how to convert those float values into OpenCV format coordinates values I…
arush1836
  • 1,327
  • 8
  • 19
  • 37
15
votes
2 answers

One stage vs two stage object detection

I was going through YOLOv4 paper which often uses the term one & two stage object detection. I was unable to understand what's the difference between the two types of object detectors. I am assuming One stage does both region detection + object…
15
votes
2 answers

How many images(minimum) should be there in each classes for training YOLO?

I am trying to implement YOLOv2 on my custom dataset. Is there any minimum number of images required for each class?
Ashna Eldho
  • 454
  • 1
  • 7
  • 22
14
votes
2 answers

Extracting the license plate parallelogram from the surrounding bounding box?

So I have trained an object recognition neural network (YOLOv3) to detect bounding boxes around the license plates of car pictures shot at a variety of tilted and straight angles and the network does it pretty reliably. However now I want to extract…
Moalana
  • 421
  • 3
  • 11
13
votes
3 answers

Using YOLO or other image recognition techniques to identify all alphanumeric text present in images

I have multiple images diagram, all of which contains labels as alphanumeric characters instead of just the text label itself. I want my YOLO model to identify all the numbers & alphanumeric characters present in it. How can I train my YOLO model to…
user11247278
13
votes
9 answers

KeyError: ''val_loss" when training model

I am training a model with keras and am getting an error in callback in fit_generator function. I always run to epoch 3rd and get this error annotation_path = 'train2.txt' log_dir = 'logs/000/' classes_path =…
Phuc Nguyen
  • 371
  • 1
  • 3
  • 14
13
votes
6 answers

OpenCV 4.x+ requires enabled C++11 support compilation darknet fatal error

I just compiled and installed the last version of OpenCV 3.4.0 and I would like to compile darknet (for yolo object detection), but at compilation, I have gcc -Iinclude/ -Isrc/ -DOPENCV `pkg-config --cflags opencv` -DGPU…
FlavienRJ
  • 191
  • 1
  • 1
  • 7
12
votes
2 answers

How to reduce number of classes in YOLOv3 files?

I am using YOLOv3 to detect cars in videos. I downloaded three files used in my code coco.names, yolov3.cfg and yolov3.weights which are trained for 80 different classes of objects to be detected. The code worked but very slowly, it takes more than…
AbdelAziz AbdelLatef
  • 3,650
  • 6
  • 24
  • 52
1
2 3
99 100