Due to complicated backgrounds and unclear target orientation, automated object detection is difficult in the field of archaeology. Most of the current convolutional neural network (CNN) object‐oriented detection techniques are based on a faster region‐based CNN (R‐CNN) and other one‐stage detectors that often lack adequate processing speeds and detection accuracies. Recently, the two‐stage detector Mask R‐CNN technique achieved impressive results in object detection and instance segmentation problems and was successfully applied in the analysis of archaeological airborne laser scanning (ALS) data. In this study, we outline a modified Mask R‐CNN technique that reliably and efficiently detects relict charcoal hearth (RCH) sites on light dete...