operators.candidates.extractor.DateSpanExtractor
- class operators.candidates.extractor.DateSpanExtractor(field, col_suffix=None)
Extracts spans (slices of documents) that contain dates (using regex)
This operator uses a regex to extract all spans from the parent document that contain dates. The vast majority of standard date formats are supported, including ISO-8601, RFC-3339, and others.
Parameters
Parameters
Name Type Default Info field str
The dataframe column to extract date spans from. col_suffix Optional[str]
None
An optional suffix for the column containing the extracted spans.