The Annotator: Difference between revisions

Latest revision as of 17:53, 4 June 2019

by Algolit

The annotator asks for the guidance of visitors in annotating the archive of Mundaneum.

The annotation process is a crucial step in supervised machine learning where the algorithm is given examples of what it needs to learn. A spam filter in training will be fed examples of spam and real messages. These examples are entries, or rows from the dataset with a label, spam or non-spam.

The labelling of a dataset is work executed by humans, they pick a label for each row of the dataset. To ensure the quality of the labels multiple annotators see the same row and have to give the same label before an example is included in the training data. Only when enough samples of each label have been gathered in the dataset can the computer start the learning process.

In this interface we ask you to help us classify the cleaned texts from the Mundaneum archive to expand our training set and improve the quality of the installation 'Classifying the World' in Oracles.

Concept, code, interface: Gijs de Heij

@@ Line 1: / Line 1: @@
 by Algolit
-The annotator asks for the guidance of the visitor in annotating the archive of Mundaneum.
+[https://gitlab.constantvzw.org/algolit/mundaneum/tree/master/exhibition/4-Informants/the-annotator Sources on Gitlab]
-The annotation process is a crucial step in supervised machine learning where the algorithm is given examples of what it needs to learn. A spam filter in training will be fed examples of spam and real messages. These examples are entries, or rows from the dataset with a label, spam or not-spam.
+The annotator asks for the guidance of visitors in annotating the archive of Mundaneum.
-The labelling of a dataset is work executed by humans, they pick a label for each row of the dataset. To ensure the quality of the labels multiple annotators see the same row and have to give the same label before an example is included in the training data. Only once enough samples of each label have been gathered in the dataset, the computer can start the learning process.
+The annotation process is a crucial step in supervised machine learning where the algorithm is given examples of what it needs to learn. A spam filter in training will be fed examples of spam and real messages. These examples are entries, or rows from the dataset with a label, spam or non-spam.
+The labelling of a dataset is work executed by humans, they pick a label for each row of the dataset. To ensure the quality of the labels multiple annotators see the same row and have to give the same label before an example is included in the training data. Only when enough samples of each label have been gathered in the dataset can the computer start the learning process.
 In this interface we ask you to help us classify the cleaned texts from the Mundaneum archive to expand our training set and improve the quality of the installation 'Classifying the World' in Oracles.

The Annotator: Difference between revisions

From Algolit

Latest revision as of 17:53, 4 June 2019