Micro-average performance metrics for the labeling of pairs of references. The three metrics are calculated so that they represent the original distribution of Jaccard similarities in the method by resampling from the stratified sample. Each evaluated pair of references contributes equally to the score (regardless of the strata they belong to)