Train-your-own Extraction Model#
Our API Hub offers a wide variety of pre-built extraction workflows. However, your specific business problem may not be included. With our train-your-own extraction capabilities, you can solve any extraction use case outside of our provided models. You have complete control over the entities you want to extract and can get a model up and running in no time.
The steps required to build your own extraction model are fairly simple and straightforward. Just follow our simple wizard:
- Define the entities you want to extract from your documents
- Upload your documents: You can specify individual templates, or just upload all your documents in one place if they
don't follow specific templates)
- Annotate your documents by selecting an entity on the left and assigning it to the text boxes on the right.
- Train your model and explore the analysis for each entity to see what can be improved.
You can check this blog post to see the steps in detail.
Once training is complete, this workflow will provide you with extractions
and ocr
. Your workflow
identifier is a UUID that is automatically generated during the creation process, i.e. you can call the workflow
like any other workflow using the
at the processing endpoint
i.e. POST /processing/{your_workflow_identifier}
.
Supported return values#
As automatically included in the response JSON, unless otherwise specified via include
query parameters.
ocr
, see OCR Formatextractions
, see below
Credit cost#
A Freemium account allows for up to 100 pages per month, where the cost is 80 credits per page, and 120 credits per document.
Note
A document is usually a bundle of 10 pages.
Extractions format#
The value of the extractions
key for this workflow is based on your specifications in the "Define Fields" wizard step.
Note
For a reference of the structure of each of the extractions
objects see
Extracted Values.
Also, for accessing individual processing results or artifacts, have a look
at Fetch Processing Results and Artifacts.
Important
The structure of extractions
might contain optional paths.
See this and
this part of the documentation.
Previewing Workflow Updates#
natif.ai is constantly improving the model architectures and baselines for self-trained workflows which sometimes requires (beneficial) updates to existing workflows. In order to not interfere with productive usage of your workflow, natif.ai will inform you in advance by email about such updates and will provide a preview version of the upcoming workflow update for you to try out before the automatic migration.
Please refer to the preview endpoint documentation to make use of the endpoint to test the upcoming version of your workflow for production usage and provide feedback to us.