operators.candidates.extractor.EmailAddressSpanFeaturizer
- class operators.candidates.extractor.EmailAddressSpanFeaturizer(field, col_suffix=None)
Extracts spans (slices of documents) that contain email addresses (using regex)
This operator uses a regex to extract all spans from the parent document that contain properly formatted email addresses according to RFC6530.
Parameters
Parameters
Name Type Default Info field strThe dataframe column to extract email address spans from. col_suffix Optional[str]NoneAn optional suffix for the column containing the extracted spans.