This section provides guidelines for structuring your Invoice Data Extraction API requests to Extracta.ai. Ensure to follow the format below for successful data extraction:
{
"extractionDetails": {
"name": "Extraction Name", // required - Name your extraction process
"language": "Supported Language", // required - Choose from the supported languages
"fields": [
{
"key": "Field Key", // required - Define the key for data extraction
"description": "Field Description", // optional - Describe the field
"example": "Field Example" // optional - Provide an example value
},
...
]
},
"file": "base64String or file URL" // required - Provide the document in base64String format or as a URL
}
In addition to the basic format outlined in the previous sections, Extracta.ai also supports more complex data structures for specialized extraction needs. This advanced format allows the definition of nested objects and arrays, catering to a broader range of data representation.
object
The object type represents a structured object with multiple properties. Each property is defined as an object within an array, and can include its own key, description, type, and example.
{
"key": "personal_info",
"description": "Personal information of the person", // optional
"type": "object",
"properties": [
{
"key": "name",
"description": "Name of the person", // optional
"example": "Alex Smith", // optional
"type": "string" // optional
},
{
"key": "email",
"description": "Email of the person",
"example": "alex.smith@gmail.com",
"type": "string"
},
.....
]
}
array
The array type is used for lists of items, such as a collection of work experiences. The items key contains an object defining the structure of each item in the array.
{
"key": "work_experience",
"description": "Work experience of the person", // optional
"type": "array",
"items": {
"type": "object",
"properties": [
{
"key": "title",
"description": "Title of the job", // optional
"example": "Software Engineer", // optional
"type": "string" // optional
},
{
"key": "start_date",
"description": "Start date of the job",
"example": "2022",
"type": "string"
},
...
]
}
}
object
and array
types, the example
parameter is applicable only for their inner properties/items.type
is specified, it defaults to string
.object
and array
types, the inner fields can only be of type string
. This means that each property within an object or each item within an array should be a string type, ensuring consistency and simplicity in data representation.Extracta.ai is capable of processing documents in image (JPG, PNG), PDF, and DOCX formats. This enhancement allows for a wider range of document types to be submitted for extraction.
Extracta.ai currently supports document extraction in the following languages: Romanian, English, French, Spanish, Arabic, Portuguese, German, Italian. Additional support for 20 more languages is planned by January.
Note: If an unsupported language is specified, the API will return an error message indicating an invalid language choice. Keep updated with our API documentation for new language additions.
Who We Are
Extracta.ai, a pioneer in document data extraction, specializes in transforming a variety of documents into structured data. Serving multiple industries, we enable businesses to automate data processing, increase operational efficiency, and minimize manual intervention. Our expertise extends to extracting both structured and unstructured data from diverse document types.
What We Do
Our specialized product, the Invoice Data Extraction API, is designed to parse detailed information from various formats including PDF, Word, TXT, as well as scanned images in PNG and JPG formats, employing OCR technology where necessary.
Key Features:
Our Mission
Extracta.ai is committed to transforming the way invoice data extraction is perceived and executed, aiming to simplify, accelerate, and enhance accuracy within financial and accounting practices. Innovation and adaptation to our clients’ evolving needs are at the core of our mission.
Our Vision
We see Extracta.ai as a pivotal force in enhancing financial and operational efficiencies, making the extraction of invoice data a catalyst for business development and process optimization. Our goal is to create solutions that are not only robust and effective but also intuitive and centered around the user experience, establishing ourselves as pioneers in the invoice data extraction sector.
Learn more about our Invoice Data Extraction API or get in touch with us directly at www.extracta.ai.
We appreciate your interest in Extracta.ai for your invoice data extraction needs.