snorkelflow.ingest.conversation_json_to_parquet
- snorkelflow.ingest.conversation_json_to_parquet(input_json_file_path, output_parquet_file_path)
Generates SnorkelFlow ingestible PARQUET file from JSON. Note that we convert all columns to str using json.dumps(). Please use json.loads() if you want to use these columns later.
noteSince v0.73, this function no longer adds the output parquet file to a dataset. Please usecreate_datasource
to do so.