Object recognition and detection are well-studied problems with a developed set of almost standard solutions.
This paper proposes a new architecture based on an artificial convolutional neural network and semantic segmentation for the recognition and detection of identity documents.
The research aims to evaluate the deep learning detection model trained on a mobile identity document video dataset, achieving an accuracy above 0.75 for the intersection over union (IoU) threshold value of 0.8.
The study also verifies the feasibility of running the model on industrial one-board microcomputer or smartphone hardware.