snorkelflow.operators.dask_extractor
- class snorkelflow.operators.dask_extractor(*, input_schema, name=None, resources=None, resources_fn=None, output_schema=None)
Bases:
dask_operatorDecorator to define Dask Extractor from a function.
Parameters
Parameters
Name Type Default Info name Optional[str]NoneName of the Operator. resources Optional[Dict[str, Any]]NoneResources passed in to fviakwargsresources_fn Optional[Callable[[], Dict[str, Any]]]NoneA function for generating a dictionary of values passed to fviakwargs, that are too expensive to serialize as resources.input_schema Dict[str, Any]Dictionary mapping from column to dtype, used to validate the dtypes of the input dataframe. output_schema Optional[Dict[str, Any]]NoneDictionary mapping from column to dtype, used to validate the dtypes of the output dataframe.
If not
None, thenfmust not delete any dataframe columns, and all new columns must be specified along with types inoutput_schema.- __init__(*, input_schema, name=None, resources=None, resources_fn=None, output_schema=None)
\_\_init\_\_
__init__
Methods
__init__(*, input_schema[, name, resources, ...])