Skip to main content
Version: 25.2

operators.spacy.SpacyTokenizer

class operators.spacy.SpacyTokenizer(text_field, tokens_field=None)

Preprocessor that parses document and adds tokens column.

Used by Sequence Tagging applications to add additional document metadata.

Parameters

NameTypeDefaultInfo
text_fieldstrThe field to parse with spacy.
tokens_fieldOptional[str]NoneThe field in which to store the tokens list object.