VisionChallenge

Version 35 (Grace Vesom, 2016-06-20 06:47 am)

1 33 Grace Vesom
h1. OpenCV’s People’s Vote Winning Papers and State of the Art Vision Challenge Winners
2 30 Grace Vesom
3 34 Grace Vesom
This is a 2-for-1 "CVPR 2015 Workshop":http://www.pamitc.org/cvpr15/workshops.php covering
4 34 Grace Vesom
* People’s choice awards for winning papers from CVPR 2015 and 
5 34 Grace Vesom
** People voted on the CVPR 2015 papers that they most want to see implemented and this will encourage them to be implemented in "opencv_contrib":https://github.com/Itseez/opencv_contrib
6 34 Grace Vesom
* Winning algorithms of the OpenCV Vision Challenge
7 34 Grace Vesom
** Our attempt to start collecting the baseline best in class algorithms also into "opencv_contrib":https://github.com/Itseez/opencv_contrib
8 30 Grace Vesom
9 30 Grace Vesom
This is a short workshop, one hour before lunch, to announce and describe winners of two separate contests:
10 30 Grace Vesom
11 30 Grace Vesom
<pre>
12 30 Grace Vesom
Location: Room 101 (~123) 
13 30 Grace Vesom
Time: 11am-12pm
14 30 Grace Vesom
</pre>
15 30 Grace Vesom
16 33 Grace Vesom
h2. (1) People’s Choice: Winning Papers CVPR 2015
17 30 Grace Vesom
18 33 Grace Vesom
We will tally the people’s vote for the paper you’d most like to see implemented in CVPR. We’ll present the histogram of results which is an indication of the algorithms people are interested in overall and then list the 5 top winners. We'll also mention the people's choice for best demo.
19 30 Grace Vesom
20 30 Grace Vesom
Prizes will be awarded in two stages: 
21 30 Grace Vesom
* A modest award for winning and 
22 30 Grace Vesom
* a larger award for presenting the code w/in 5 months as a pull request to OpenCV as Detailed here:
23 30 Grace Vesom
** http://code.opencv.org/projects/opencv/wiki/How_to_contribute
24 30 Grace Vesom
25 30 Grace Vesom
*Prizes:*
26 30 Grace Vesom
# Win: $500; Submit code: $6000 
27 30 Grace Vesom
# Win: $300; Submit code: $4000
28 30 Grace Vesom
# Win: $100; Submit code: $3000
29 30 Grace Vesom
# Win: $50; Submit code: $3000
30 30 Grace Vesom
# Win: $50; Submit code: $3000
31 30 Grace Vesom
32 30 Grace Vesom
Results will be listed on OpenCV’s website:
33 30 Grace Vesom
* (user) http://opencv.org/ and 
34 30 Grace Vesom
* (developer) http://code.opencv.org/projects/opencv/wiki 
35 1
36 30 Grace Vesom
h2. (2) State of the Art Vision Challenge
37 31 Grace Vesom
38 33 Grace Vesom
Our aim is to make available state of the art vision in OpenCV. We thus ran a vision challenge to meet or exceed the state of the art in various areas. We will present the results, some of which are quite compelling. The contest details are available at:
39 33 Grace Vesom
40 30 Grace Vesom
http://code.opencv.org/projects/opencv/wiki/VisionChallenge 
41 30 Grace Vesom
42 30 Grace Vesom
*Prizes:*
43 30 Grace Vesom
# Win: $1000; Submit code: $3000 
44 30 Grace Vesom
# Win: $1000; Submit code: $3000
45 30 Grace Vesom
# Win: $1000; Submit code: $3000
46 30 Grace Vesom
# Win: $1000; Submit code: $3000
47 30 Grace Vesom
# Win: $1000; Submit code: $3000
48 30 Grace Vesom
49 30 Grace Vesom
50 30 Grace Vesom
In this contest, if someone does not submit the code, the unclaimed money may be reallocated to those who do at the sole discretion of the prize committee.
51 30 Grace Vesom
52 30 Grace Vesom
53 30 Grace Vesom
h2. Proposers
54 30 Grace Vesom
55 32 Grace Vesom
* *Dr. Gary Rost Bradski,* Chief Scientist, Computer Vision and AI at Magic Leap, Inc.
56 30 Grace Vesom
** [email protected]
57 30 Grace Vesom
* *Vadim Pisarevsky*, Principal Engineer at Itseez
58 30 Grace Vesom
** [email protected]
59 30 Grace Vesom
* *Vincent Rabaud*, Perception Team Manager at Aldebaran Robotics
60 30 Grace Vesom
** [email protected]
61 30 Grace Vesom
* *Grace Vesom*, 3D Vision Senior Engineer at Magic Leap, Inc.
62 30 Grace Vesom
** [email protected]
63 30 Grace Vesom
64 30 Grace Vesom
h2. Presenters
65 30 Grace Vesom
66 30 Grace Vesom
*Dr. Gary Rost Bradski* is Chief Scientist of Computer Vision at Magic Leap.  Gary founded OpenCV at Intel Research in 2000 and is currently CEO of nonprofit OpenCV.org.  He ran the vision team for Stanley, the autonomous vehicle that completed and won the $2M DARPA Grand Challenge robot race across the desert.  Dr. Bradski helped start up NeuroScan (sold to Marmon), Video Surf (sold to Microsoft), and Willow Garage (absorbed into Suitable Tech).  In 2012, he founded Industrial Perception (sold to Google, August 2013).  Gary has more than 100 publications and more than 30 patents and is co-author of a bestseller in its category Learning OpenCV: Computer Vision with the OpenCV Library, O'Reilly Press. 
67 30 Grace Vesom
68 30 Grace Vesom
*Vadim Pisarevsky* is the chief architect of OpenCV.  He graduated from NNSU Cybernetics Department in 1998 with a  Master’s degree in Applied Math.  Afterwards, Vadim worked as software engineer and the team leader of OpenCV project at Intel Corp in 2000-2008.  Since May 2008 he is an employee of Itseez corp and now works full time on OpenCV under a Magic Leap contract.
69 30 Grace Vesom
70 30 Grace Vesom
*Vincent Rabaud* is the perception team manager at Aldebaran Robotics.  He co-founded the non-profit OpenCV.org with Gary Bradski in 2012 while a research engineer at Willow Garage.  His research interests include 3D processing, object recognition and anything that involves underusing CPUs by feeding them fast algorithms.  Dr. Rabaud completed his PhD at UCSD, advised by Serge Belongie.  He also holds a MS in space mechanics and space imagery from SUPAERO and a MS in optimization from the Ecole Polytechnique. 
71 30 Grace Vesom
72 30 Grace Vesom
*Grace Vesom* is a senior engineer in 3D vision at Magic Leap and Director of Development for the OpenCV Foundation.  Previously, she was a research scientist at Lawrence Livermore National Laboratory working on global security applications and completed her DPhil at the University of Oxford in 2010.
73 30 Grace Vesom
74 30 Grace Vesom
75 35 Grace Vesom
h1. Details of the People's Choice Winning Paper -- vote using the CVPR app
76 30 Grace Vesom
77 30 Grace Vesom
78 30 Grace Vesom
79 31 Grace Vesom
h1. Details of the State of the Art Vision Challenge:
80 1
81 29 Grace Vesom
OpenCV is launching a community-wide challenge to update and extend the OpenCV library with state of the art algorithms. An award pool of $20,000 will be provided to the best performing algorithms in the following 11 CV application areas: 
82 1
83 14 Grace Vesom
# image segmentation
84 14 Grace Vesom
# image registration
85 14 Grace Vesom
# human pose estimation
86 14 Grace Vesom
# SLAM
87 14 Grace Vesom
# multi-view stereo matching
88 14 Grace Vesom
# object recognition
89 14 Grace Vesom
# face recognition
90 14 Grace Vesom
# gesture recognition
91 14 Grace Vesom
# action recognition
92 14 Grace Vesom
# text recognition
93 14 Grace Vesom
# tracking
94 8 Grace Vesom
95 22 Grace Vesom
We prepared code to read from existing data sets in each of these areas: "modules/datasets":http://docs.opencv.org/master/modules/datasets/doc/datasets.html
96 10 Grace Vesom
97 1
h2. Conditions:
98 6 Grace Vesom
99 1
The OpenCV Vision Challenge Committee will judge up to five best entries. 
100 1
101 14 Grace Vesom
# You may submit a new algorithm developed by yourself.
102 14 Grace Vesom
# You may submit an existing algorithm *whether or not developed by yourself* (as long as you own or re-implement it yourself).
103 14 Grace Vesom
# Up to 5 winning algorithms will receive $1000 each.
104 29 Grace Vesom
# For an additional $3000 to $15,000*, you must submit your winning code as an OpenCV pull request under a BSD or compatible license.
105 16 Grace Vesom
** You acknowledge that your code may be included, with citation, in OpenCV.
106 6 Grace Vesom
107 6 Grace Vesom
You may explicitly enter code for any work you have submitted to CVPR 2015 or its workshops. We will not unveil it until after CVPR.
108 6 Grace Vesom
109 1
Winners and prizes are at the sole discretion of the committee.
110 14 Grace Vesom
111 22 Grace Vesom
*List of selected datasets and other details described here:* "OpenCV Vision Challenge":http://code.opencv.org/attachments/1672/OpenCVVisionChallenge.pdf
112 14 Grace Vesom
113 29 Grace Vesom
??* We will have a professional programmer assist people with their pull requests. The final amount will be adusted by number of pull requests. The minimum will be $3000 additional dollars for a pull request. The prize committee may adjust the amounts upwards depending on remaining budget at the commitees sole discretion??
114 11 Grace Vesom
115 6 Grace Vesom
h2. Timeline:
116 1
117 1
*Submission Period:*
118 15 Grace Vesom
_Now - May 15th 2015_
119 1
120 1
*Winners Announcement:* 
121 1
_June 8th 2015 at CVPR 2015_
122 1
123 10 Grace Vesom
h2. Contact:
124 1
125 1
[email protected]
126 20 Grace Vesom
127 20 Grace Vesom
h2. Q&A:
128 20 Grace Vesom
129 23 Grace Vesom
*Q.:* _What should be in performance evaluation report? Shall we send any report or paper along with the code?_
130 23 Grace Vesom
*A.:* Participants are required to send source code and a performance evaluation report of their algorithms. Report should be in the standard form of a paper with algorithm description. Evaluation should be performed on at least one of the chosen benchmark datasets associated with the building block. Evaluation methodology should be the same as specified by author of each dataset, this includes using the same train\validation\test splits, evaluation metrics, etc. In additional, we ask to report running time of algorithm and platform details to help with their comparison. Algorithm's accuracy should be compared with state-of-the-art algorithms. In addition, it’ll be useful to compare it with algorithms implemented in OpenCV whenever possible. Source code and supplied documentation should contain clear description on how to reproduce evaluation results. Source code have to be compiled and run under Ubuntu 14.
131 23 Grace Vesom
132 20 Grace Vesom
*Q.:* _Can I participate in this Vision Challenge by addressing building blocks different from the current 11 categories?_
133 20 Grace Vesom
*A.:* For this Vision Challenge, we have selected 11 categories and 21 supporting datasets. To participate in the Vision Challenge you need to address at least one of the building blocks we have selected and get results in at least one of the chosen associated datasets. Results on additional datasets (e.g., depth channel) will be evaluated accordingly by the awarding committee.
134 20 Grace Vesom
This may be just the first one of a series of challenges and we want to hear from the vision community which building blocks should come next, for the possible next challenges. Please, send your suggestions to our e-mail: [email protected].
135 20 Grace Vesom
136 20 Grace Vesom
*Current propositions list:*
137 21 Grace Vesom
* Background Subtraction - 1 vote
138 21 Grace Vesom
* Point Cloud Registration - 1 vote
139 26 Grace Vesom
* Pedestrian Detection - 1 vote
140 27 Grace Vesom
* Text Recognition for Arabic language - 1 vote
141 20 Grace Vesom
142 20 Grace Vesom
*Q.:* _Which external algorithms or libraries can we use?_
143 20 Grace Vesom
*A.:* All used 3rd party code should have Permissive free software licence. The most popular such licenses are: BSD, Apache 2.0, MIT.
144 20 Grace Vesom
145 20 Grace Vesom
*Q.:* _I don't find the tracking dataset loading in opencv_contrib/modules/datasets module._
146 20 Grace Vesom
*A.:* We are not implemented loading-evaluation code for VOT tracking dataset, because it already has its own "toolkit":http://www.votchallenge.net/vot2014/participation.html.
147 1
148 24 Grace Vesom
*Q.:* _Where I can find the Dataset Benchmarks?_
149 25 Grace Vesom
*A.:* They are placed with samples in "modules/datasets/samples":https://github.com/Itseez/opencv_contrib/tree/master/modules/datasets/samples.
150 24 Grace Vesom
151 1
h3. Back to Developer page:
152 1
153 1
"OpenCV":http://code.opencv.org/projects/opencv/wiki