It seems to me that Parsey has severe issues with correctly tagging questions and any sentence with "is" in it.
Text: Is Barrack Obama from Hawaii?
GCloud Tokens (correct):
- Is - [root] VERB
- Barrack - [nn] NOUN
- Obama - [nsubj] NOUN
- from - [adp] PREP
- Hawaii - [pobj] NOUN
Parsey Tokens (wrong):
- Is - [cop] VERB
- Barrack - [nsubj] NOUN
- Obama - [root] NOUN
- from - [adp] PREP
- Hawaii - [pobj] NOUN
Parsey decides to make the noun (!) Obama the root, which messes up everything else.
Text: My name is Philipp
GCloud Tokens (correct):
- My [poss] PRON
- name [nsubj] NOUN
- is [root] VERB
- Philipp [attr] NOUN
ParseyTokens (incorrect):
- My [poss] PRON
- name [nsubj] NOUN
- is [cop] VERB
- Philipp [root] NOUN
Again parsey chooses the NOUN as root and struggles with COP.
Any ideas why this is happening and how I could fix it?
Thanks, Phil