Automatic Large-Scale Data Acquisition via Crowdsourcing for Crosswalk Classification: A Deep Learning Approach

De LCAD
Revisão de 13h27min de 10 de outubro de 2017 por Rodrigo Berriel (discussão | contribs)
(dif) ← Edição anterior | Revisão atual (dif) | Versão posterior → (dif)
Ir para: navegação, pesquisa

Authors: Rodrigo F. Berriel, Franco Schmidt Rossi, Alberto F. de Souza, Thiago Oliveira-Santos

DOI: 10.1016/J.CAG.2017.08.004

PDF: [1]

Published in Computers & Graphics

Abstract

Crosswalk-streetview-graphical-abstract.png

Correctly identifying crosswalks is an essential task for the driving activity and mobility autonomy. Many crosswalk classification, detection and localization systems have been proposed in the literature over the years. These systems use different perspectives to tackle the crosswalk classification problem: satellite imagery, cockpit view (from the top of a car or behind the windshield), and pedestrian perspective. Most of the works in the literature are designed and evaluated using small and local datasets, i.e. datasets that present low diversity. Scaling to large datasets imposes a challenge for the annotation procedure. Moreover, there is still need for cross-database experiments in the literature because it is usually hard to collect the data in the same place and conditions of the final application. In this paper, we present a crosswalk classification system based on deep learning. For that, crowdsourcing platforms, such as OpenStreetMap and Google Street View, are exploited to enable automatic training via automatic acquisition and annotation of a large-scale database. Additionally, this work proposes a comparison study of models trained using fully-automatic data acquisition and annotation against models that were partially annotated. Cross-database experiments were also included in the experimentation to show that the proposed methods enable use with real world applications. Our results show that the model trained on the fully-automatic database achieved high overall accuracy (94.12%), and that a statistically significant improvement (to 96.30%) can be achieved by manually annotating a specific part of the database. Finally, the results of the cross-database experiments show that both models are robust to the many variations of image and scenarios, presenting a consistent behavior.

Videos

See the IARA, GOPRO, and NIGHT dataset videos here.


Source-code and Models

Available soon: GitHub.

BibTeX

 @ARTICLE{Berriel2017cag, 
   author    = {Rodrigo Ferreira Berriel and Franco Schmidt Rossi and Alberto Ferreira de Souza and Thiago Oliveira-Santos}, 
   journal   = {Computers & Graphics},
   issn      = {0097-8493}
   title     = {Automatic Large-Scale Data Acquisition via Crowdsourcing for Crosswalk Classification: A Deep Learning Approach},
   volume    = {68}
   year      = {2017},
   month     = {Nov},
   pages     = {32-42},
   doi       = {10.1016/J.CAG.2017.08.004}
 }