VisionChallenge

Version 65 (Grace Vesom, 2016-06-20 06:47 am)

1 65 Grace Vesom
h1. OpenCV’s People’s Vote Winning Papers and State of the Art Vision Challenge Winners 2015
2 30 Grace Vesom
3 51 Grace Vesom
* Sponsored by Intel*
4 51 Grace Vesom
5 34 Grace Vesom
This is a 2-for-1 "CVPR 2015 Workshop":http://www.pamitc.org/cvpr15/workshops.php covering
6 65 Grace Vesom
* People’s choice awards for winning papers from CVPR 2015 
7 36 Grace Vesom
** _Vote on the CVPR 2015 papers that you most want to see implemented and we'll pay the winners to implement it in_ "opencv_contrib":https://github.com/Itseez/opencv_contrib
8 34 Grace Vesom
* Winning algorithms of the OpenCV Vision Challenge
9 37 Grace Vesom
** _Start collecting implementations of the best in class algorithms in_ "opencv_contrib":https://github.com/Itseez/opencv_contrib
10 30 Grace Vesom
11 30 Grace Vesom
This is a short workshop, one hour before lunch, to announce and describe winners of two separate contests:
12 30 Grace Vesom
13 30 Grace Vesom
<pre>
14 37 Grace Vesom
Location: Room 101
15 30 Grace Vesom
Time: 11am-12pm
16 30 Grace Vesom
</pre>
17 30 Grace Vesom
18 64 Grace Vesom
h2. (1) People's Choice Best Paper Award: CVPR 2015
19 30 Grace Vesom
20 65 Grace Vesom
We will tally the people’s vote !Vote5.png! for the paper you’d most like to see implemented (you may vote for as many as you want). We'll describe the 5 top winners. 
21 30 Grace Vesom
22 30 Grace Vesom
Prizes will be awarded in two stages: 
23 30 Grace Vesom
* A modest award for winning and 
24 45 Grace Vesom
* a larger award for presenting the code as a pull request to OpenCV by December 1st 2015 as Detailed here:
25 30 Grace Vesom
** http://code.opencv.org/projects/opencv/wiki/How_to_contribute
26 30 Grace Vesom
27 30 Grace Vesom
*Prizes:*
28 48 Grace Vesom
* 1st winner: $500; Submit code: $6000 
29 48 Grace Vesom
* 2nd winner: $300; Submit code: $4000
30 48 Grace Vesom
* 3rd winner: $100; Submit code: $3000
31 48 Grace Vesom
* 4th winner: $50; Submit code: $3000
32 48 Grace Vesom
* 5th winner: $50; Submit code: $3000
33 49 Grace Vesom
* 6th winner: $50: Submit code: $2000
34 1
35 51 Grace Vesom
h3. *Winners:*
36 49 Grace Vesom
37 56 Grace Vesom
* *1st:* Yuting Zhang, Kihyuk Sohn, Ruben Villegas, Gang Pan, Honglak Lee
38 56 Grace Vesom
** Improving Object Detection With Deep Convolutional Networks via Bayesian Optimization and Structured Prediction
39 1
* *2nd:* Richard A. Newcombe, Dieter Fox, Steven M. Seitz
40 1
** DynamicFusion: Reconstruction and Tracking of Non-Rigid Scenes in Real-Time
41 57 Grace Vesom
* *3rd:* Anh Nguyen, Jason Yosinski, Jeff Clune
42 56 Grace Vesom
** Deep Neural Networks Are Easily Fooled: High Confidence Predictions for Unrecognizable Images
43 65 Grace Vesom
* *4th:* Bharath Hariharan, Pablo Arbeláez, Ross Girshick, Jitendra Malik
44 57 Grace Vesom
** Hypercolumns for Object Segmentation and Fine-Grained Localization
45 56 Grace Vesom
* *TIE 5th:* Anton van den Hengel, Chris Russell, Anthony Dick, John Bastian, Daniel Pooley, Lachlan Fleming, Lourdes Agapito
46 56 Grace Vesom
** Part-Based Modelling of Compound Scenes From Images
47 56 Grace Vesom
* *TIE 5th:* Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich
48 56 Grace Vesom
** Going Deeper With Convolutions
49 56 Grace Vesom
50 56 Grace Vesom
There were many favorited tutorials!  They cannot win the Vision Challenge, but they are listed here along with their votes:
51 56 Grace Vesom
* *TIE 1st:* Applied Deep Learning for Computer Vision with Torch (19 votes)
52 56 Grace Vesom
* *TIE 1st:* DIY Deep Learning: A Hands-On Tutorial With Caffe (19 votes)
53 56 Grace Vesom
* *2nd:* ImageNet Large Scale Visual Recognition Challenge Tutorial (13 votes)
54 56 Grace Vesom
* *TIE 3rd:* OpenCV 3.0 Technical Tutorials: Beginner to Specialist (7 votes)
55 56 Grace Vesom
* *TIE 3rd:* Energy Minimization and Discrete Optimization (7 votes)
56 56 Grace Vesom
* *TIE 3rd:* Open Source Structure-from-Motion (7 votes)
57 56 Grace Vesom
* *TIE 3rd:* Sparse and Low-Rank Modeling for High-Dimensional Data Analysis (7 votes)
58 30 Grace Vesom
59 65 Grace Vesom
Results will be listed on OpenCV’s website:
60 30 Grace Vesom
* (user) http://opencv.org/ and 
61 30 Grace Vesom
* (developer) http://code.opencv.org/projects/opencv/wiki 
62 1
63 30 Grace Vesom
h2. (2) State of the Art Vision Challenge
64 31 Grace Vesom
65 33 Grace Vesom
Our aim is to make available state of the art vision in OpenCV. We thus ran a vision challenge to meet or exceed the state of the art in various areas. We will present the results, some of which are quite compelling. The contest details are available at:
66 33 Grace Vesom
67 30 Grace Vesom
http://code.opencv.org/projects/opencv/wiki/VisionChallenge 
68 30 Grace Vesom
69 30 Grace Vesom
*Prizes:*
70 59 Grace Vesom
* Win: $1000; Submit code: $3000 
71 59 Grace Vesom
* Win: $1000; Submit code: $3000
72 59 Grace Vesom
* Win: $1000; Submit code: $3000
73 59 Grace Vesom
* Win: $1000; Submit code: $3000
74 59 Grace Vesom
* Win: $1000; Submit code: $3000
75 1
76 1
77 59 Grace Vesom
h3. *Winners by Categories (all are winners, this isn't in order of priority):*
78 51 Grace Vesom
79 63 Grace Vesom
*Tracking* -- _No pull requests_
80 65 Grace Vesom
* Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan, Michael Felsberg (Discriminative Scale Space Tracker) - DCFSIR, DSST, fDSST
81 51 Grace Vesom
** Took first place on more correct general table and also has modifications with worse performance, but faster speed. 
82 51 Grace Vesom
** No pull request yet
83 51 Grace Vesom
84 51 Grace Vesom
*Image registration*	
85 63 Grace Vesom
* Gil Levi and Tal Hassner - LATCH descriptor -- +Accepted pull request+
86 51 Grace Vesom
**  State-of-the-art for binary descriptors for accuracy (its speed slower). It also outperforms several floating point descriptors and works much faster.
87 51 Grace Vesom
** Pull request done!
88 51 Grace Vesom
89 65 Grace Vesom
* Olexa Bilaniuk, Hamid Bazargani, and Robert Laganière - RHO for findHomography() -- +Accepted pull request+
90 51 Grace Vesom
** Comparable results to RANSAC with 25x speedup.
91 51 Grace Vesom
** Already in OpenCV!
92 51 Grace Vesom
93 51 Grace Vesom
*Image segmentation*
94 63 Grace Vesom
* Beat Kueng, Alex Locher, Michael Van den Bergh, Gemma Roig, Xavier Boix, Luc Van Gool - SEEDS Superpixel -- +Accepted pull request+
95 51 Grace Vesom
** State-of-the-art superpixel algorithm in terms of accuracy and computational performance
96 51 Grace Vesom
** Already in OpenCV!
97 51 Grace Vesom
98 51 Grace Vesom
*Gesture recognition*
99 63 Grace Vesom
* Natalia Neverova, Christian Wolf, Graham Taylor and Florian Nebout - ModDrop: adaptive multi-modal gesture recognition -- _No pull requests_
100 51 Grace Vesom
** State-of-the-art for ChaLearn 2014 gesture challenge (ChaLearn Looking at People (ECCV 2014).
101 65 Grace Vesom
** No pull request. Will be dependent on GSoC’s 2015 ability to read in any deep net and run it.
102 30 Grace Vesom
103 60 Grace Vesom
h3. More Details
104 60 Grace Vesom
105 62 Grace Vesom
Results for submitted functions on VOT 2014 (algorithms are described "in this paper":http://www.votchallenge.net/vot2014/download/vot_2014_paper.pdf):
106 62 Grace Vesom
Top-10 results from general table with all available algorithms:
107 62 Grace Vesom
!full-top10.png!
108 62 Grace Vesom
Results for algorithms submitted for the contest:
109 61 Grace Vesom
!VOC_submitted_results.png!
110 60 Grace Vesom
111 30 Grace Vesom
112 30 Grace Vesom
h2. Proposers
113 30 Grace Vesom
114 32 Grace Vesom
* *Dr. Gary Rost Bradski,* Chief Scientist, Computer Vision and AI at Magic Leap, Inc.
115 30 Grace Vesom
** [email protected]
116 30 Grace Vesom
* *Vadim Pisarevsky*, Principal Engineer at Itseez
117 30 Grace Vesom
** [email protected]
118 30 Grace Vesom
* *Vincent Rabaud*, Perception Team Manager at Aldebaran Robotics
119 30 Grace Vesom
** [email protected]
120 30 Grace Vesom
* *Grace Vesom*, 3D Vision Senior Engineer at Magic Leap, Inc.
121 30 Grace Vesom
** [email protected]
122 30 Grace Vesom
123 30 Grace Vesom
h2. Presenters
124 30 Grace Vesom
125 30 Grace Vesom
*Dr. Gary Rost Bradski* is Chief Scientist of Computer Vision at Magic Leap.  Gary founded OpenCV at Intel Research in 2000 and is currently CEO of nonprofit OpenCV.org.  He ran the vision team for Stanley, the autonomous vehicle that completed and won the $2M DARPA Grand Challenge robot race across the desert.  Dr. Bradski helped start up NeuroScan (sold to Marmon), Video Surf (sold to Microsoft), and Willow Garage (absorbed into Suitable Tech).  In 2012, he founded Industrial Perception (sold to Google, August 2013).  Gary has more than 100 publications and more than 30 patents and is co-author of a bestseller in its category Learning OpenCV: Computer Vision with the OpenCV Library, O'Reilly Press. 
126 30 Grace Vesom
127 65 Grace Vesom
*Vadim Pisarevsky* is the chief architect of OpenCV.  He graduated from NNSU Cybernetics Department in 1998 with a  Master’s degree in Applied Math.  Afterwards, Vadim worked as software engineer and the team leader of OpenCV project at Intel Corp in 2000-2008.  Since May 2008 he is an employee of Itseez corp and now works full time on OpenCV under a Magic Leap contract.
128 30 Grace Vesom
129 30 Grace Vesom
*Vincent Rabaud* is the perception team manager at Aldebaran Robotics.  He co-founded the non-profit OpenCV.org with Gary Bradski in 2012 while a research engineer at Willow Garage.  His research interests include 3D processing, object recognition and anything that involves underusing CPUs by feeding them fast algorithms.  Dr. Rabaud completed his PhD at UCSD, advised by Serge Belongie.  He also holds a MS in space mechanics and space imagery from SUPAERO and a MS in optimization from the Ecole Polytechnique. 
130 30 Grace Vesom
131 30 Grace Vesom
*Grace Vesom* is a senior engineer in 3D vision at Magic Leap and Director of Development for the OpenCV Foundation.  Previously, she was a research scientist at Lawrence Livermore National Laboratory working on global security applications and completed her DPhil at the University of Oxford in 2010.
132 30 Grace Vesom
133 30 Grace Vesom
134 39 Grace Vesom
h1. (1) Details of the People's Choice Winning Paper -- vote using the CVPR app
135 35 Grace Vesom
136 39 Grace Vesom
You may vote for as many papers as you want. This is the people's choice of the algorithms they'd most like to have. We will award the winners and encourage them to submit their working code to "opencv_contrib":https://github.com/Itseez/opencv_contrib
137 1
138 39 Grace Vesom
h2. Voting instructions:
139 1
140 39 Grace Vesom
On your phone, 
141 39 Grace Vesom
* search for the official CVPR app, it's name is CVF (Computer Vision Foundation). Down load it. 
142 39 Grace Vesom
* on your computer, go to the online app "http://www.cvfapp.org":http://www.cvfapp.org/
143 39 Grace Vesom
144 39 Grace Vesom
Open the app, you'll have to log in the first time.
145 39 Grace Vesom
146 39 Grace Vesom
On the homescreen:
147 39 Grace Vesom
148 40 Grace Vesom
!App11.png!
149 39 Grace Vesom
150 42 Grace Vesom
Click on the  the calendar view (lower left blue icon), then scroll down to the day your are on:
151 41 Grace Vesom
152 41 Grace Vesom
!Day22.png!
153 41 Grace Vesom
154 41 Grace Vesom
Find the talk, paper, demo that you like:
155 41 Grace Vesom
156 41 Grace Vesom
!Chose33.png!
157 41 Grace Vesom
158 41 Grace Vesom
Click on the PeoplesChoiceVote square:
159 41 Grace Vesom
160 41 Grace Vesom
!Vote44.png!
161 41 Grace Vesom
162 43 Grace Vesom
We will record this vote. Vote for the papers you'd really love/need to see implementation. We will award extra money to encourage people to submit working code for these popular algorithms!
163 39 Grace Vesom
164 39 Grace Vesom
h1. (2) Details of the State of the Art Vision Challenge:
165 1
166 29 Grace Vesom
OpenCV is launching a community-wide challenge to update and extend the OpenCV library with state of the art algorithms. An award pool of $20,000 will be provided to the best performing algorithms in the following 11 CV application areas: 
167 1
168 14 Grace Vesom
# image segmentation
169 14 Grace Vesom
# image registration
170 14 Grace Vesom
# human pose estimation
171 14 Grace Vesom
# SLAM
172 14 Grace Vesom
# multi-view stereo matching
173 14 Grace Vesom
# object recognition
174 14 Grace Vesom
# face recognition
175 14 Grace Vesom
# gesture recognition
176 14 Grace Vesom
# action recognition
177 14 Grace Vesom
# text recognition
178 14 Grace Vesom
# tracking
179 8 Grace Vesom
180 22 Grace Vesom
We prepared code to read from existing data sets in each of these areas: "modules/datasets":http://docs.opencv.org/master/modules/datasets/doc/datasets.html
181 10 Grace Vesom
182 1
h2. Conditions:
183 6 Grace Vesom
184 1
The OpenCV Vision Challenge Committee will judge up to five best entries. 
185 1
186 14 Grace Vesom
# You may submit a new algorithm developed by yourself.
187 14 Grace Vesom
# You may submit an existing algorithm *whether or not developed by yourself* (as long as you own or re-implement it yourself).
188 14 Grace Vesom
# Up to 5 winning algorithms will receive $1000 each.
189 44 Grace Vesom
# For an additional $3000 to $15,000*, you must submit your winning code as an OpenCV pull request under a BSD or compatible license by December 1st.
190 16 Grace Vesom
** You acknowledge that your code may be included, with citation, in OpenCV.
191 6 Grace Vesom
192 6 Grace Vesom
You may explicitly enter code for any work you have submitted to CVPR 2015 or its workshops. We will not unveil it until after CVPR.
193 6 Grace Vesom
194 1
Winners and prizes are at the sole discretion of the committee.
195 14 Grace Vesom
196 22 Grace Vesom
*List of selected datasets and other details described here:* "OpenCV Vision Challenge":http://code.opencv.org/attachments/1672/OpenCVVisionChallenge.pdf
197 14 Grace Vesom
198 58 Grace Vesom
??* We will have a professional programmer assist people with their pull requests. Awards are at the prize committee's sole discretion??
199 11 Grace Vesom
200 6 Grace Vesom
h2. Timeline:
201 1
202 1
*Submission Period:*
203 15 Grace Vesom
_Now - May 15th 2015_
204 1
205 1
*Winners Announcement:* 
206 1
_June 8th 2015 at CVPR 2015_
207 44 Grace Vesom
208 44 Grace Vesom
*Submit pull request:*
209 44 Grace Vesom
_December 1st, 2015_
210 1
211 10 Grace Vesom
h2. Contact:
212 1
213 1
[email protected]
214 20 Grace Vesom
215 20 Grace Vesom
h2. Q&A:
216 20 Grace Vesom
217 23 Grace Vesom
*Q.:* _What should be in performance evaluation report? Shall we send any report or paper along with the code?_
218 65 Grace Vesom
*A.:* Participants are required to send source code and a performance evaluation report of their algorithms. Report should be in the standard form of a paper with algorithm description. Evaluation should be performed on at least one of the chosen benchmark datasets associated with the building block. Evaluation methodology should be the same as specified by author of each dataset, this includes using the same train\validation\test splits, evaluation metrics, etc. In additional, we ask to report running time of algorithm and platform details to help with their comparison. Algorithm's accuracy should be compared with state-of-the-art algorithms. In addition, it’ll be useful to compare it with algorithms implemented in OpenCV whenever possible. Source code and supplied documentation should contain clear description on how to reproduce evaluation results. Source code have to be compiled and run under Ubuntu 14.
219 23 Grace Vesom
220 20 Grace Vesom
*Q.:* _Can I participate in this Vision Challenge by addressing building blocks different from the current 11 categories?_
221 20 Grace Vesom
*A.:* For this Vision Challenge, we have selected 11 categories and 21 supporting datasets. To participate in the Vision Challenge you need to address at least one of the building blocks we have selected and get results in at least one of the chosen associated datasets. Results on additional datasets (e.g., depth channel) will be evaluated accordingly by the awarding committee.
222 20 Grace Vesom
This may be just the first one of a series of challenges and we want to hear from the vision community which building blocks should come next, for the possible next challenges. Please, send your suggestions to our e-mail: [email protected].
223 20 Grace Vesom
224 20 Grace Vesom
*Current propositions list:*
225 21 Grace Vesom
* Background Subtraction - 1 vote
226 21 Grace Vesom
* Point Cloud Registration - 1 vote
227 26 Grace Vesom
* Pedestrian Detection - 1 vote
228 27 Grace Vesom
* Text Recognition for Arabic language - 1 vote
229 20 Grace Vesom
230 20 Grace Vesom
*Q.:* _Which external algorithms or libraries can we use?_
231 20 Grace Vesom
*A.:* All used 3rd party code should have Permissive free software licence. The most popular such licenses are: BSD, Apache 2.0, MIT.
232 20 Grace Vesom
233 20 Grace Vesom
*Q.:* _I don't find the tracking dataset loading in opencv_contrib/modules/datasets module._
234 20 Grace Vesom
*A.:* We are not implemented loading-evaluation code for VOT tracking dataset, because it already has its own "toolkit":http://www.votchallenge.net/vot2014/participation.html.
235 1
236 24 Grace Vesom
*Q.:* _Where I can find the Dataset Benchmarks?_
237 25 Grace Vesom
*A.:* They are placed with samples in "modules/datasets/samples":https://github.com/Itseez/opencv_contrib/tree/master/modules/datasets/samples.
238 24 Grace Vesom
239 1
h3. Back to Developer page:
240 1
241 1
"OpenCV":http://code.opencv.org/projects/opencv/wiki