Background

What is a data labeling tool?

Labeled datasets are essential for supervised machine learning. A data labeling tool is a system where annotators can create labeled datasets. It typically contains a user interface where the annotator can assign labels to data points. It may also contain algorithm modules (e.g., active learning) to assist the annotation process.

In the following, we list some of the most stared open source data labeling tools on github as examples:

ToolSupported Data Type & Label Task
LabelImgopen in new windowimage bounding box
LabelMe (python)open in new windowimage segmentation
Label Studioopen in new windowmultiple
CVATopen in new window)image & video segmentation
doccanoopen in new windowtext
VoTTopen in new windowimage & video object detection