Add your Unstructured.io API Key
You can get an api key at https://unstructured.io/#get-api-key
Select the files you want to process.
Select the strategy for processing your documents.
Four strategies are available for processing PDF/Images files: hi_res, fast, ocr_only, and auto. fast is the default strategy and works well for documents that do not have text embedded in images.
Fast
Default
High Resolution
the better choice for PDFs that may have text within embedded images, or for achieving greater precision of element types in the response JSON.
OCR Only
Runs the document through Tesseract for OCR. Currently, hi_res has difficulty ordering elements for documents with multiple columns.
Auto
The best of all worlds, auto will determine when a page can be extracted using fast or ocr_only mode, otherwise, it will fall back to hi_res.
Submit documents for processing
Explore and download the generated JSON.
Upload files to see results