Managing files in data storage
User files are the assets related to data points in a given data source. These files can be associated with your data sources.
Permissions
To upload user files, ensure you have these instance-level permissions:
- Direct file uploads from the computer that is running the Snorkel AI Data Development Platform.
- Remote file transfer from a cloud storage provider, such as Amazon S3 or Google Cloud Storage.
An administrator may disable upload permissions for the instance you are working on, which could prevent you from uploading or transferring user files.
To upload user files
-
To upload user files, select Data Storage.
-
Select the Import data button.
-
Choose an existing folder from the dropdown or create a new folder.
-
Choose your data source. The available sources are Amazon S3, Google Cloud Storage, and Local File Upload.
-
Select the file type from the use case dropdown. Supported use cases include uploading images, PDFs, CSVs, parquet, and arrow files.
-
Enter your file paths or select files from local storage. Select Import data to finish the data upload.
- If you select the Remote Storage option, your files are transferred in the background. You can check the status of this transfer using the Jobs icon. - If you select the Local Storage option, your files are uploaded. A loading spinner appears, and the user interface is disabled.
importantDo not close the tab. Wait for an alert to confirm the upload finished successfully. It may take several minutes to finish uploading.
Once you have uploaded or transferred files to your Snorkel AI Data Development Platform, you can associate them with your data sources when uploading new datasets.
To download files from a folder
- In the left-hand sidebar, select the file folder from which you want to download.
- Select the
...
icon next to an individual file to open the file options menu. - Select Download.
The file downloads to your browser's default download location.