Skip to main content
Version: 0.91

Data and application limit requirements

This page is a reference for data and application limit requirements for a standard Snorkel Flow installation. When creating uploading new datasets and creating new applications, please refer to the these class, dataset, and datapoint limits for the best performance.

Snorkel Flow enforces a per-file limit size of 100 MB via the user interface. If you are uploading datasets via the SDK, this limit does not apply.

 # Unique classesMax total dataset sizeMax datapoint size (i.e., single row)GPU required?
Text classification (multi-class)2 - 1002 GB10 KBRecommended
Text classification (multi-label)1 - 1002 GB10 KBRecommended
Text extraction  (candidate-based)2 - 1002 GB10 KBRecommended
Text extraction (sequence tagging)1 - 25250 MB10 KBRequired
PDF extraction2 - 201.6 GB1.6 MBRecommended
Image classification1 - 1020 GB500 KBRequired

Reach out to your Snorkel representative for more information about instance sizing.