The Overall score is calculated as the macro-average performance over tasks. Details can be found within our publication.