Volume XLII-2
Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLII-2, 707-714, 2018
https://doi.org/10.5194/isprs-archives-XLII-2-707-2018
© Author(s) 2018. This work is distributed under
the Creative Commons Attribution 4.0 License.
Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLII-2, 707-714, 2018
https://doi.org/10.5194/isprs-archives-XLII-2-707-2018
© Author(s) 2018. This work is distributed under
the Creative Commons Attribution 4.0 License.

  30 May 2018

30 May 2018

ETALON IMAGES: UNDERSTANDING THE CONVOLUTION NEURAL NETWORKS

V. V. Molchanov, B. V. Vishnyakov, V. S. Gorbatsevich, and Y. V. Vizilter V. V. Molchanov et al.
  • FGUP «State Research Institute of Aviation Systems», 125319, Moscow, Viktorenko street, 7, Russia

Keywords: CNN, deep learning, manifold learning, affine transformations, graphs, etalons

Abstract. In this paper we propose a new technic called etalons, which allows us to interpret the way how convolution network makes its predictions. This mechanism is very similar to voting among different experts. Thereby CNN could be interpreted as a variety of experts, but it acts not like a sum or product of them, but rather represent a complicated hierarchy. We implement algorithm for etalon acquisition based on well-known properties of affine maps. We show that neural net has two high-level mechanisms of voting: first, based on attention to input image regions, specific to current input, and second, based on ignoring specific input regions. We also make an assumption that there is a connection between complexity of the underlying data manifold and the number of etalon images and their quality.