1

Doccano is an annotation tool for data to use in machine learning. Its creator has also developed the doccano-transformer which is used to transform the format of the JSONL output that doccano gives to the spacy format.

Here is the Github https://github.com/doccano/doccano-transformer

The code :

dataset.to_spacy(tokenizer=str.split)

gives me a "generator" object and I don't know what to do with it. It is surely not the expected "spacy" object. So I wonder if anyone has a clue on how to convert this generator into spacy ?

Ajeet Verma
  • 2,938
  • 3
  • 13
  • 24

0 Answers0