I am new to SpaCy. I noticed that there are a number of NER categories listed in the documentation of all en_core_web
models:
'CARDINAL',
'DATE',
'EVENT',
'FAC',
'GPE',
'LANGUAGE',
'LAW',
'LOC',
'MONEY',
'NORP',
'ORDINAL',
'ORG',
'PERCENT',
'PERSON',
'PRODUCT',
'QUANTITY',
'TIME',
'WORK_OF_ART'
I need to access the raw data used to assign each word the correct category. In other words, what's the list of words labelled as 'WORK_OF_ART'
, and is this list available?
The reason I ask this question is that I want to build a custom model that uses some of the default NER categories, as well as my own.