SpaCy is having trouble with POS tagging in Korean. When inserting a break point in the code and using the sample sentences from SpaCy documentation, the parser does perfectly, so this may be a bit more insidious of a bug.
The below example should be getting a number of different tokens but instead gets one big one.
실제적인효용은크않을지몰라도
Attributes for Token: 실제적인효용은크않을지몰라도
Tag: ncn+jp+ecs
POS: VERB
SpaCy is having trouble with POS tagging in Korean. When inserting a break point in the code and using the sample sentences from SpaCy documentation, the parser does perfectly, so this may be a bit more insidious of a bug.
The below example should be getting a number of different tokens but instead gets one big one.