Datasets

Annotated Dataset

2min

You are uploading a dataset (text) that already has the annotations: how is it going to work?

First of all, you need to prepare the document with the annotations in the correct format. You can either add all the annotations in one column or in multiple.

If they're all in one column each annotation has to be separated by " | ". Here are two examples:

  • For classification projects: happy|proud|joyful
  • NER/POS: if the sentece is "it’s 11am in Spain, 5am in Mexico” the annotations should look like this: time:11am|country:Spain|time:5am|country:Mexico

For further details on how to upload the annotated dataset please go to "Datasets" > "Create a Dataset".

How am I going to see these annotations?

In a project with classes, you will see the annotations already selected and marked as "original". If you go to the next task it will automatically save those annotations and will mark the task as completed. If you want to change anything you can do that too - the "original" tag will remain next to the labels that came with the dataset.

Document image


In other projects such as Part Of Speach or Named-Entity Recognition each word will be tagged and underlined with the annotations. You will be able to modify the tag but the underlined part will stay as a reference.

Document image