Volume XLII-2/W12
Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLII-2/W12, 237-241, 2019
https://doi.org/10.5194/isprs-archives-XLII-2-W12-237-2019
© Author(s) 2019. This work is distributed under
the Creative Commons Attribution 4.0 License.
Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLII-2/W12, 237-241, 2019
https://doi.org/10.5194/isprs-archives-XLII-2-W12-237-2019
© Author(s) 2019. This work is distributed under
the Creative Commons Attribution 4.0 License.

  09 May 2019

09 May 2019

DETECTION OF A HUMAN HEAD ON A LOW-QUALITY IMAGE AND ITS SOFTWARE IMPLEMENTATION

D. Yudin, A. Ivanov, and M. Shchendrygin D. Yudin et al.
  • Dept. of Technical Cybernetics, Belgorod State Technological University named after V.G. Shukhov, Belgorod, Russia

Keywords: Image recognition, Human head, Detection, Deep learning, Convolutional neural network, Software

Abstract. The paper considers the task solution of detection on two-dimensional images not only face, but head of a human regardless of the turn to the observer. Such task is also complicated by the fact that the image receiving at the input of the recognition algorithm may be noisy or captured in low light conditions. The minimum size of a person’s head in an image to be detected for is 10 × 10 pixels. In the course of development, a dataset was prepared containing over 1000 labelled images of classrooms at BSTU n.a. V.G. Shukhov. The markup was carried out using a segmentation software tool specially developed by the authors. Three architectures of convolutional neural networks were trained for human head detection task: a fully convolutional neural network (FCN) with clustering, the Faster R-CNN architecture and the Mask R-CNN architecture. The third architecture works more than ten times slower than the first one, but it almost does not give false positives and has the precision and recall of head detection over 90% on both test and training samples. The Faster R-CNN architecture gives worse accuracy than Mask R-CNN, but it gives fewer false positives than FCN with clustering. Based on Mask R-CNN authors have developed software for human head detection on a lowquality image. It is two-level web-service with client and server modules. This software is used to detect and count people in the premises. The developed software works with IP cameras, which ensures its scalability for different practical computer vision applications.