Figure 4.
Workflow performance versus random (unskilled) annotator. The filled grey area shows the distribution of semsim scores when TO terms were selected at random, while the color lines show the distribution of scores by each workflow. The number of terms selected at random for a descriptor was always the same as the number of terms annotated by the workflow. This was repeated 10 000 times, so the black line is a mean distribution