This section provides guidelines for structuring your API requests to Extracta.ai. Ensure to follow the format below for successful data extraction:
{
"extractionDetails": {
"name": "Extraction Name", // required - Name your extraction process
"language": "Supported Language", // required - Choose from the supported languages
"fields": [
{
"key": "Field Key", // required - Define the key for data extraction
"description": "Field Description", // optional - Describe the field
"example": "Field Example" // optional - Provide an example value
},
...
]
},
"file": "base64String or file URL" // required - Provide the document in base64String format or as a URL
}
In addition to the basic format outlined in the previous sections, Extracta.ai also supports more complex data structures for specialized extraction needs. This advanced format allows the definition of nested objects and arrays, catering to a broader range of data representation.
object
The object type represents a structured object with multiple properties. Each property is defined as an object within an array, and can include its own key, description, type, and example.
{
"key": "personal_info",
"description": "Personal information of the person", // optional
"type": "object",
"properties": [
{
"key": "name",
"description": "Name of the person", // optional
"example": "Alex Smith", // optional
"type": "string" // optional
},
{
"key": "email",
"description": "Email of the person",
"example": "alex.smith@gmail.com",
"type": "string"
},
.....
]
}
array
The array type is used for lists of items, such as a collection of work experiences. The items key contains an object defining the structure of each item in the array.
{
"key": "work_experience",
"description": "Work experience of the person", // optional
"type": "array",
"items": {
"type": "object",
"properties": [
{
"key": "title",
"description": "Title of the job", // optional
"example": "Software Engineer", // optional
"type": "string" // optional
},
{
"key": "start_date",
"description": "Start date of the job",
"example": "2022",
"type": "string"
},
...
]
}
}
object
and array
types, the example
parameter is applicable only for their inner properties/items.type
is specified, it defaults to string
.object
and array
types, the inner fields can only be of type string
. This means that each property within an object or each item within an array should be a string type, ensuring consistency and simplicity in data representation.Extracta.ai is capable of processing documents in image (JPG, PNG), PDF, and DOCX formats. This enhancement allows for a wider range of document types to be submitted for extraction.
Extracta.ai currently supports document extraction in the following languages: Romanian, English, French, Spanish, Arabic, Portuguese, German, Italian. Additional support for 20 more languages is planned by January.
Note: If an unsupported language is specified, the API will return an error message indicating an invalid language choice. Keep updated with our API documentation for new language additions.
Who We Are
Extracta.ai is at the forefront of document data extraction technology. Our specialization lies in converting various document types into structured data, making us a vital resource across multiple industries. Our commitment is to empower businesses by automating data processing, enhancing operational efficiency, and significantly reducing the need for manual intervention. We excel in handling both structured and unstructured data from an array of document forms.
What We Do
Our flagship product, the Contract Parsing API, is ingeniously designed to automate and streamline data extraction from contracts. This robust tool is adept at parsing comprehensive information from diverse formats, including PDF, Word, TXT, and scanned documents in PNG and JPG formats, utilizing OCR technology for image-based files.
Key Features:
Our Mission
At Extracta.ai, our mission is to transform the way businesses and legal entities handle contract documentation. We strive for continuous innovation to meet the evolving needs of our clients, making contract data extraction an efficient, accurate, and trouble-free aspect of their operations.
Our Vision
We see Extracta.ai as a pivotal contributor to efficiency in legal and procurement sectors, turning contract extraction into a catalyst for business growth and streamlined workflow. Our goal is to develop technology that is not only effective but also user-friendly and intuitive, establishing ourselves as leaders in the contract data extraction arena.
Explore our Contract Parsing API or get in touch with us at www.extracta.ai.
Thank you for relying on Extracta.ai for your contract parsing and data extraction needs.