Volume XLII-1/W1
Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLII-1/W1, 297-304, 2017
https://doi.org/10.5194/isprs-archives-XLII-1-W1-297-2017
© Author(s) 2017. This work is distributed under
the Creative Commons Attribution 3.0 License.
Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLII-1/W1, 297-304, 2017
https://doi.org/10.5194/isprs-archives-XLII-1-W1-297-2017
© Author(s) 2017. This work is distributed under
the Creative Commons Attribution 3.0 License.

  31 May 2017

31 May 2017

VARIABLE SELECTION FOR ROAD SEGMENTATION IN AERIAL IMAGES

S. Warnke and D. Bulatov S. Warnke and D. Bulatov
  • Fraunhofer IOSB, Ettlingen, Germany

Keywords: Classification, Feature Selection, Logistic Regression, Random Forests, Road Extraction

Abstract. For extraction of road pixels from combined image and elevation data, Wegner et al. (2015) proposed classification of superpixels into road and non-road, after which a refinement of the classification results using minimum cost paths and non-local optimization methods took place. We believed that the variable set used for classification was to a certain extent suboptimal, because many variables were redundant while several features known as useful in Photogrammetry and Remote Sensing are missed. This motivated us to implement a variable selection approach which builds a model for classification using portions of training data and subsets of features, evaluates this model, updates the feature set, and terminates when a stopping criterion is satisfied. The choice of classifier is flexible; however, we tested the approach with Logistic Regression and Random Forests, and taylored the evaluation module to the chosen classifier. To guarantee a fair comparison, we kept the segment-based approach and most of the variables from the related work, but we extended them by additional, mostly higher-level features. Applying these superior features, removing the redundant ones, as well as using more accurately acquired 3D data allowed to keep stable or even to reduce the misclassification error in a challenging dataset.