Upload File for Extraction
Upload a file and trigger an extraction run
Uploads a file and initiates an extraction process using the specified template.
Usage Notes
- Maximum file size: 1GB
- Files are processed according to the specified template
- Use the
metadata
parameter to include custom data (like user IDs, reference numbers) that will be preserved across all extraction API responses and webhooks - Metadata is useful for tying back extractions to your systems, correlation, and application integration purposes
- Configure webhooks for asynchronous notifications when extractions complete
Request
The file to upload and process. Supported formats include PDF (.pdf), Excel (.xlsx, .xls), and CSV (.csv). Only one file can be uploaded per request.
The ID of the template to use for mapping the document data during extraction.
Optional custom filename to use for the uploaded file. If not provided, the original filename will be used.
Optional JSON string containing custom metadata to associate with this extraction. This can include any information you need to reference, such as user IDs, order numbers, or other contextual data. The metadata will be included in all extraction responses (API endpoints and webhooks), making it useful for correlating extractions with your application.
Example: {"user_id": "123", "reference": "INV-2023-04-15", "source": "mobile-app"}
Response
The unique identifier for the new extraction.
The ID of the workspace this extraction belongs to.
The ID of the template used for the extraction. You can also use auto
as the template_id
and TableFlow will select
the best template based on the template’s allowed file types.
The current status of the extraction, typically “processing” for a new upload.
Additional metadata associated with the extraction.
Unix timestamp when the extraction was created.
Unix timestamp when the extraction was last updated.
Error Responses
Error message describing what went wrong.
What Happens After Upload
After successfully uploading a file, the extraction process follows these steps:
- Processing - The file is being analyzed and data is being extracted
- Completed - Data extraction has finished successfully
- Failed - An error occurred during extraction
You can check the status of an extraction using the Get Extraction endpoint:
For real-time notifications when extractions complete, configure webhooks to receive events.
File Type Support
TableFlow supports the following file types:
PDFs
- Digital (text-based) PDFs
- Scanned (image-based) PDFs
- Multi-page documents
Spreadsheets
- Excel files (.xlsx, .xls)
- CSV files (.csv)
- Multi-sheet workbooks