Python

Real time Drone object tracking using Python and OpenCV

26/01/201531/01/2015 by Christian S. Perone

After flying this past weekend (together with Gabriel and Leandro) with Gabriel’s drone (which is an handmade APM 2.6 based quadcopter) in our town (Porto Alegre, Brasil), I decided to implement a tracking for objects using OpenCV and Python and check how the results would be using simple and fast methods like Meanshift. The result was very impressive and I believe that there is plenty of room for optimization, but the algorithm is now able to run in real time using Python with good results and with a Full HD resolution of 1920×1080 and 30 fps.

Here is the video of the flight that was piloted by Gabriel:

See it in Full HD for more details.

The algorithm can be described as follows and it is very simple (less than 50 lines of Python) and straightforward:

A ROI (Region of Interest) is defined, in this case the building that I want to track
The normalized histogram and back-projection are calculated
The Meanshift algorithm is used to track the ROI

The entire code for the tracking is described below:

import numpy as np
import cv2

def run_main():
    cap = cv2.VideoCapture('upabove.mp4')

    # Read the first frame of the video
    ret, frame = cap.read()

    # Set the ROI (Region of Interest). Actually, this is a
    # rectangle of the building that we're tracking
    c,r,w,h = 900,650,70,70
    track_window = (c,r,w,h)

    # Create mask and normalized histogram
    roi = frame[r:r+h, c:c+w]
    hsv_roi = cv2.cvtColor(roi, cv2.COLOR_BGR2HSV)
    mask = cv2.inRange(hsv_roi, np.array((0., 30.,32.)), np.array((180.,255.,255.)))
    roi_hist = cv2.calcHist([hsv_roi], [0], mask, [180], [0, 180])
    cv2.normalize(roi_hist, roi_hist, 0, 255, cv2.NORM_MINMAX)
    term_crit = (cv2.TERM_CRITERIA_EPS | cv2.TERM_CRITERIA_COUNT, 80, 1)
    
    while True:
        ret, frame = cap.read()

        hsv = cv2.cvtColor(frame, cv2.COLOR_BGR2HSV)
        dst = cv2.calcBackProject([hsv], [0], roi_hist, [0,180], 1)

        ret, track_window = cv2.meanShift(dst, track_window, term_crit)

        x,y,w,h = track_window
        cv2.rectangle(frame, (x,y), (x+w,y+h), 255, 2)
        cv2.putText(frame, 'Tracked', (x-25,y-10), cv2.FONT_HERSHEY_SIMPLEX,
            1, (255,255,255), 2, cv2.CV_AA)
        
        cv2.imshow('Tracking', frame)

        if cv2.waitKey(1) & 0xFF == ord('q'):
            break

    cap.release()
    cv2.destroyAllWindows()

if __name__ == "__main__":
    run_main()

I hope you liked it !

70 thoughts on “Real time Drone object tracking using Python and OpenCV”

luis morales says:

26/01/2015 at 09:43

will this be as accurate using the fpv video? standard pal resolution? it will be incredible more usefull to be able to track and detect from a live feed and post processing after a flight.

Reply
1. Christian S. Perone says:
  
  26/01/2015 at 18:26
  
  Yes it is supposed to work with FPV video, but I believe that other methods would be more appropriate, although it depends if your FPV video has a good FPS.
  
  Reply
  1. nikta says:
    
    18/09/2017 at 05:12
    
    can you please specify which methods you are referring for real time object tracking
    
    Reply
Sales Ivan says:

26/01/2015 at 11:13

This inspires me. I’m joining a few guys who are engineering software for drones and I only have Java / backend experience. This makes me believe I can have some fun with them! 🙂 Thanks

Reply
Thomas says:

26/01/2015 at 11:38

Hey,

what hardware are you using as a board? raspberry pi? what hardware is the drone?

Reply
1. costi says:
  
  26/01/2015 at 16:21
  
  I don’t think the raspberry pi could do this frame rate. I am also really interested in knowing which hardware was used!
  
  Reply
  1. Andrei Moreira says:
    
    23/04/2019 at 21:57
    
    Olá! Você poderia me dar alguma dica pra um projeto escolar?
    
    Ele consiste em um drone (F450) com a controladora APM 2.6. O qual precisa passar por um frame (janela) que pode ser de qualquer forma geométrica (isso tem que ser de forma autônoma). Pensei em utilizar uma câmera e um raspberry pi para fazer o reconhecimento da forma geométrica e achar o seu centro para que a aeronave possa passar no meio dela.
    
    Obrigado pela atenção 😉
    
    Reply
2. Christian S. Perone says:
  
  26/01/2015 at 18:28
  
  Hello Thomas, we’re using an APM 2.6 board for the Drone, but the video was recorded using GoPro and the processing was done by a desktop computer, but the vidoe was processed in real-time, so it can be used also for an FPV transmission for instance. I also believe that the ARM on the Raspberry Pi has enough resource to process this, but I’m still going to test that.
  
  Reply
  1. amin says:
    
    29/11/2016 at 08:16
    
    HI,
    i have done with rasberry and usb webcam in python , it works realtime .
    
    Reply
    1. jad says:
      
      07/01/2017 at 23:24
      
      Did you document a tutorial because I am interested in this project and I would appreciate an advice in how to start.
      
      Reply
    2. Hiren Shirishbhai Chatrola says:
      
      13/09/2018 at 12:54
      
      i m getting some error what i should do
      
      Reply
  2. http://www./ says:
    
    01/03/2017 at 08:55
    
    Your answer was just what I needed. It’s made my day!
    
    Reply
  3. arya says:
    
    28/07/2017 at 04:58
    
    Hi,
    and what do you use to send commands from computer to APM 2.6?
    
    Reply
agilehobby says:

26/01/2015 at 13:01

If you are using OpenCV 3.0, cv2.CV_AA should be cv2.LINE_AA

Good tutorial, thanks.

Reply
1. Christian S. Perone says:
  
  26/01/2015 at 18:29
  
  Thanks for the feedback.
  
  Reply
2. Nils says:
  
  22/08/2016 at 13:15
  
  I’ve ran into this while installing OpenCV. Does anyone know the history/reasoning behind this change? I’m a neophyte to OpenCV.
  
  Reply
  1. Anonymous says:
    
    24/11/2016 at 14:12
    
    In opencv version 3.0, a lot of refactoring was done. Due to rising interest, contribution from peoples around the world and with GSOC, the library was getting a lot bigger. Version 3 seems to be the right time to reduce all the technical debt and provide a cleaner and easier design for as you said, neophytes to learn faster.
    
    Reply
Liviu Macsen says:

26/01/2015 at 13:28

Do you tested how Meanshift algorithm perform compared with Median-Flow or TLD in this case of tracking?

I tried some time ago and Median-Flow has better results in my case.

Reply
1. Christian S. Perone says:
  
  26/01/2015 at 18:29
  
  I didn’t tested it, this was my first PoC, I’ll try other methods probably using better feature extraction like SURF or something like that.
  
  Reply
Liviu Macsen says:

26/01/2015 at 13:29

Do you tested how Meanshift algorithm perform compared with Median-Flow or TLD in this case of tracking?

I tried some time ago and Median-Flow had better results in my case.

Reply
Namnori says:

26/01/2015 at 14:46

Hi, your case is very simple so practically any respectable tracking method will give you good results. But for your case (specially in the case of a drone), a KLT tracking for video stabilization followed of a mixed KLT/MS tracking will give you a very much better robustness and accuracy, even with deformable objects (and with opencv it is not very complex to code).

I have implemented something similar some time ago for my master thesis, if you are interested in it, you can see the result and download my thesis here: http://namnorireports.duckdns.org/wordpress/?p=104.

Reply
1. Christian S. Perone says:
  
  26/01/2015 at 18:31
  
  Thanks for the information Namnori, I’ll take look. This was my first PoC only using simple (and very fast) method like Meanshift, but I’ll try other methods for sure.
  
  Reply
  1. Namnori says:
    
    26/01/2015 at 19:42
    
    Take into account that for tracking you usually not use the image at it’s original size when you use corners or edges, because their core information remains almost intact in downsampled images, and in the very little cases when you need precision at the original size, what you do is a pyramidal analysis.
    
    The problem with the usual application of MS is that with only the color information, it fails a lot when you have similar histograms (which in real scenarios happens a lot). You can easily boost it’s performance only adding the gradient information to the rest of the histogram (counting for example the number of edge points) and using the channels in a dependant way (using each sample RGBE as a single value in the histogram, this consumes a lot of memory but it worth the extra cost).
    
    If you later use a KLT method to stabilize the video, then you can even go further, using the features tracked in KLT to learn the MS histogram instead of use a single patch (that will allow you get much better result in deformable objects, and it will allow you to use much smarter learning methods).
    
    Reply
    1. Seçkin says:
      
      16/02/2018 at 20:40
      
      Hello, can you share your thesis again I couldnt open it.
      
      Reply
2. addy says:
  
  28/09/2018 at 08:36
  
  @namnori: plz upload your source link again,as it is not opened .
  
  Reply
Pingback: Real Time Object Tracking For Your Self-Built Drone - Best Drone Depot
Eric HUBERT says:

27/01/2015 at 09:32

Nice job!
Now I would love to see that running in realtime on the drone. I guess a typical smartphone-type device would be enough (granted we can use it’s GPU for speeding up the computation).
Imagine what you could do with a modern smartphone computational power! (without even mentionning connectivity like cellular, 3g, GPS, its IMUs etc…)

Aaaannnd now I need a drone… :-/

Reply
Eagle eye says:

27/01/2015 at 11:26

Really good job man! I was researching about something slightly different. I wanted to get the GPS coordinates of the object I am tracking using my drone’s gps info.
I want it really hard for a promising project in my mind. The accuracy is irritating me as well (may be I would make the Drone halt to get a solid gps lock first)
If it’s possible or you can demonstrate it, I would be more than thankful man!

Reply
Chao Yan says:

31/01/2015 at 06:22

“img2” is assigned to but never used ?

Reply
1. Christian S. Perone says:
  
  31/01/2015 at 10:45
  
  That was a typo, thanks, I removed the assignment, rectangle() actually returns None.
  
  Reply
Pingback: Обзор наиболее интересных материалов по анализу данных и машинному обучению №33 (26 января — 1 февраля 2015) | Zit@i0
Chris Anderson says:

02/02/2015 at 20:33

If that means that you can circle an object that you define as the area of interest, and the camera will stay focused on it, that is VERY cool!

Reply
kyle says:

09/02/2015 at 17:40

Can this be used to make a map of an objects movements later?

Reply
sutape saisaka says:

13/05/2015 at 12:20

Can your drone track and follow the moving target?

Reply
marina kolesnik says:

17/05/2015 at 15:00

Hi Christian,

your tracking software seems to be fast and robust.
My team and I have created a sales platform for intelligent video analytics. We have a base of clients who would find your software highly interesting.

The platform is called Blepo, and can be found at http://www.blepo.net. As a developer, you can advertise your software entirely for free, to a wide range of users. These users will be able to test your software against their own videos, allowing them to find software that is best suited to their needs.

The platform is cloud based and secure. You may choose to upload a demo version of your IVA software, or host the demo version on your own servers. If hosted on your own servers, you will receive an online request every time a user would like to test it. This ensures complete protection of your software.

I would be delighted to have you join the community – the exposure it will bring your software is significant. You can find more information on our community at
http://www.procams-project.eu/developer/.

Kindest regards,

Marina

Reply
koko says:

02/08/2015 at 02:39

how can i fix this problem
Traceback (most recent call last):
File “C:/Python27/koko realtime.py”, line 45, in
run_main()
File “C:/Python27/koko realtime.py”, line 16, in run_main
roi = frame[r:r+h, c:c+w]
TypeError: ‘NoneType’ object has no attribute ‘__getitem__’

Reply
1. Christian S. Perone says:
  
  02/08/2015 at 14:01
  
  Your frame object is None, that means that the capture call to the camera or file is returning None for some reason, please check the return error code.
  
  Reply
  1. Curtis says:
    
    24/08/2015 at 10:01
    
    Which camera did you use? I was just getting started with your code and I’m also getting same nonetype.
    
    Reply
Ken Piper says:

13/01/2016 at 18:26

Great work! Interested in the type of camera that you used!

Reply
Elie says:

19/01/2016 at 09:47

HI Chris,

Can we implement your solution on moving target also? we are interested by your solution.

Reply
1. Christian S. Perone says:
  
  19/01/2016 at 20:12
  
  Hello Elie, sure you can implement it on a moving target, however there are other methods that could improve your tracking, this method was just made using a very simple (and with good performance) method, it depends on how much performance you can get, the quality of the video, etc.
  
  Reply
  1. Elie says:
    
    19/01/2016 at 20:31
    
    Hi Christian, thanks for the reply, so how can we do the implementation with our drone and gimbal, we are using the pixhawk. What do we need to be able to do it ?
    
    Reply
RINI ROY says:

19/02/2016 at 09:31

roi = frame[r:r+h, c:c+w]
TypeError: ‘NoneType’ object has no attribute ‘__getitem__’
how to solve

Reply
Python says:

03/03/2016 at 17:21

“Excerpts and links may be used, provided that full and clear credit is given to Christian S. Perone with appropriate and specific direction to the original content.”

Yet you used this code without giving any credit – http://docs.opencv.org/master/db/df8/tutorial_py_meanshift.html

Nice.

Reply
1. Christian S. Perone says:
  
  03/03/2016 at 17:32
  
  As far as I know, the OpenCV license is a 3-clause BSD license (http://opencv.org/license.html) and I’m citing the link from the documentation example in the post itself (also, if you see, the code isn’t actually exact the same, do you know another way of doing meanshift without using the same API ?), if this is violating the license, please let me know so I can remove the code in question.
  
  Reply
YaddyVirus says:

29/03/2016 at 12:10

How did you give those numbers to the line where you defined the ROI? I mean how did you get those numbers?

If you can tell me that it’d be great help for me…
Thanks!

Reply
1. Marcus Ward says:
  
  30/03/2016 at 17:32
  
  Hey, will i be able to use this to track traffic signs?
  
  Reply
vemesha says:

18/05/2016 at 02:55

For real time tracking how did you determine the pixels of c,r,w,h of the building.

Reply
1. Harsha says:
  
  18/06/2016 at 06:08
  
  Make a mouse callback function for giving the roi
  
  Reply
Kartik says:

24/07/2016 at 14:37

Hi Chris,

Amazing video capture. I need a little help. How did you get the “upabove.mp4” in the program? And also, if I want to circles on the ground how would I change the parameters c,r,w,h?

Reply
afnan says:

02/08/2016 at 13:44

hey… thank you so much for the code!!umm..actually, my project deals with detecting faces using a drone using FPV (for detection of the wanted for example) can you please please tell me if there’s a code for detecting faces??

Reply
Anonymous says:

20/11/2016 at 17:38

Hi Chris,

I need to implement a follow algorithm for a UAV to follow a Car. The UAV receives the GPS co-ordinates of the car. I would be implementing it in PX4 FMU. I also want to have a good quality video recording while following. How could I use your code?

Reply
Shane Johnson Jr says:

30/12/2016 at 15:25

Hey would this field include Computer Science or Computer Engineering when discussing object tracking in real time? as well as connecting a drone to GPS or computers ? please send me helpful links on these topics if possible..

Reply
surya rao says:

18/01/2017 at 17:32

i want to develop the same using raspberry pi and ardupilot how can i achieve this can you suggest me some usefull tutorials as im a newbie in drones

Reply
Saif Alabachi says:

06/03/2017 at 00:48

The tracker seems working fast and reliable!
I have one concern, here the drone is on a constant distance from the object. So, if we assume that the drone is moving toward the object, does the bounding box of the object keeps track of the movement, in other words, does the bounding box will enlarge as the object becomes closer?

Thanks
Saif,

Reply
Ryan says:

15/03/2017 at 17:31

What should I do/edit if I want to use a live feed from the camera of the drone?

Reply
Nabeel says:

11/04/2017 at 14:10

Hi !

I need your help. I want to know, how can we use live feed of drone to process that live feed to detect person. Is it possible or not ? If yes, than how ? Can we use any drone to get live feed on our desktop and process it according to our need ?
I’m waiting for your response.

Thanks

Reply
Alaa Zahra says:

04/06/2017 at 08:39

can i use this for a real time webcam ?
will it work with the same range ?

Reply
Michael Roszkowski says:

26/09/2017 at 12:38

Hello Mr. Perrone,
I was looking at your code and I am trying to run your code using an A.R Drone 2.0 but nothing is streaming. Am I suppose to add something.

Reply
Richard Gardner says:

08/10/2017 at 13:50

Is there a donloadable executeable. Like an install able app as I am not a programmer. But I am a drone builder and this is very appealing

Reply
Anonymous says:

24/12/2017 at 12:47

Set the ROI (Region of Interest). Actually, this is a
# rectangle of the building that we’re tracking
c,r,w,h = 900,650,70,70
track_window = (c,r,w,h)

can you please explain these lines

Reply
Robert Miller says:

01/02/2018 at 09:04

Hi Christian

This would be very useful for a Non Profit UAV based anti poaching project I am working on and it would be great if you would be willing to discuss this and the obstacles we are facing with auto camera and drone positioning.

Thanks

Robert

Reply
Francisco Rodriguez says:

03/04/2018 at 11:42

Hi I try to run but, ihaver this error:

mask = cv2.inRange(hsv_roi, np.array((0., 30.,32.)), np.array((180.,255.,255.)))

error: /Users/travis/build/skvark/opencv-python/opencv/modules/core/src/arithm.cpp:1769: error: (-209) The lower boundary is neither an array of the same size and same type as src, nor a scalar in function inRange

Reply
Justin Keener says:

23/04/2018 at 20:37

How did you determine the r,c,h,w values? I’m incredibly new to this.

Thanks

Reply
Narveer says:

30/05/2018 at 03:40

Hi
please have a look at this video it already have tracked building which we are providing as input to the algorithm.Also check the corners of video while running code. I don’t think it is working.
https://www.youtube.com/watch?v=m7vOCg8GrAA

Reply
Tucker B. says:

10/07/2018 at 05:50

Wow! This is amazing. I really liked it. Great job!

Reply
Deadend says:

19/07/2018 at 02:51

It’s good but how to put lat-long of the target object?

Reply
BestNelly says:

20/07/2019 at 20:29

I have noticed you don’t monetize christianperone.com, don’t waste your traffic,
you can earn additional cash every month with new monetization method.
This is the best adsense alternative for any type of website (they approve all sites),
for more info simply search in gooogle: murgrabia’s tools

Reply
ROBERTO BUENO LUIZ says:

10/10/2019 at 20:36

Olá Christian tudo bem? Me chamo Roberto Bueno e sou graduando em Engenharia Agrícola e Ambiental pela universidade federal de Mato Grosso e estou querendo utilizar este tipo de aplicação para um projeto, gostaria de saber se possui algum e-mail para que possa entrar em contato.
Agradeço a atenção, obrigado.

Reply
Gleb Kutilin says:

26/02/2023 at 14:15

Traceback (most recent call last):
File “c:\Users\glebk\OneDrive\Рабочий стол\opencv\droneROI\main.py”, line 45, in
run_main()
File “c:\Users\glebk\OneDrive\Рабочий стол\opencv\droneROI\main.py”, line 17, in run_main
hsv_roi = cv2.cvtColor(roi, cv2.COLOR_BGR2HSV)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
cv2.error: OpenCV(4.7.0) D:\a\opencv-python\opencv-python\opencv\modules\imgproc\src\color.cpp:182: error: (-215:Assertion failed) !_src.empty() in function ‘cv::cvtColor’

i have this problem. what is my mistake?

Reply