Fig. 2

Detection performance (F1 score) calculated using different number of training images for: a the TAMU2015 dataset, b the UGA2015 dataset, and c the UGA2018 dataset. When \(\text {IOU}_{\text {all}}\) (a more strict metric) was used, increasing trends of model performance were clearly observed by using more training images