Run Job Test User
POST/api/v1/extractionv2/jobs/test
Run Job Test User
Request
Cookie Parameters
- application/json
Body
required
- MOD1
- MOD1
- MOD2
- MOD3
- MOD4
- MOD5
- MOD6
Array [
]
- ExtractConfig
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
job_create
object
required
Schema for creating an extraction job.
The id of the extraction agent
The id of the file
data_schema_override
object
The data schema to override the extraction agent's data schema with
anyOf
property name*
object
anyOf
object
string
integer
number
boolean
config_override
object
The config to override the extraction agent's config with
anyOf
Additional parameters for the extraction agent.
The extraction mode specified.
Possible values: [PER_DOC
, PER_PAGE
]
PER_DOC
Whether to handle missing fields in the schema.
false
system_prompt
object
The system prompt to use for the extraction.
anyOf
string
extract_settings
object
All settings for the extraction agent. Only the settings in ExtractConfig are exposed to the user.
The model to use for the extraction.
gpt-4o
The temperature to use for the extraction.
0
The maximum file size (in bytes) allowed for the document.
5242880
The maximum number of pages allowed for the document.
30
The prompt to use for the extraction.
The extracted data using the given JSON schema.
The prompt to use for error handling.
If the text does not contain enough information to comply with the schema, explain the reason. Else, output null and fill out the 'extracted' field.
llama_parse_params
object
Settings that can be configured for how to use LlamaParse to parse files within a LlamaCloud pipeline.
Possible values: [af
, az
, bs
, cs
, cy
, da
, de
, en
, es
, et
, fr
, ga
, hr
, hu
, id
, is
, it
, ku
, la
, lt
, lv
, mi
, ms
, mt
, nl
, no
, oc
, pi
, pl
, pt
, ro
, rs_latin
, sk
, sl
, sq
, sv
, sw
, tl
, tr
, uz
, vi
, ar
, fa
, ug
, ur
, bn
, as
, mni
, ru
, rs_cyrillic
, be
, bg
, uk
, mn
, abq
, ady
, kbd
, ava
, dar
, inh
, che
, lbe
, lez
, tab
, tjk
, hi
, mr
, ne
, bh
, mai
, ang
, bho
, mah
, sck
, new
, gom
, sa
, bgc
, th
, ch_sim
, ch_tra
, ja
, ko
, ta
, te
, kn
], >= 1
false
false
false
false
false
false
false
false
false
false
false
false
false
false
false
false
page_separator
object
anyOf
string
bbox_top
object
anyOf
number
bbox_right
object
anyOf
number
bbox_bottom
object
anyOf
number
bbox_left
object
anyOf
number
false
false
true
false
false
project_id
object
anyOf
string
azure_openai_deployment_name
object
anyOf
string
azure_openai_endpoint
object
anyOf
string
azure_openai_api_version
object
anyOf
string
azure_openai_key
object
anyOf
string
input_url
object
anyOf
string
http_proxy
object
anyOf
string
false
auto_mode_trigger_on_regexp_in_page
object
anyOf
string
auto_mode_trigger_on_text_in_page
object
anyOf
string
false
false
false
structured_output_json_schema
object
anyOf
string
structured_output_json_schema_name
object
anyOf
string
max_pages
object
anyOf
integer
max_pages_enforced
object
anyOf
integer
false
formatting_instruction
object
anyOf
string
complemental_formatting_instruction
object
anyOf
string
content_guideline_instruction
object
anyOf
string
false
job_timeout_in_seconds
object
anyOf
number
job_timeout_extra_time_per_page_in_seconds
object
anyOf
number
false
false
false
false
Responses
- 200
- 422
Successful Response
- application/json
- Schema
- Example (from schema)
Schema
- MOD1
- MOD2
- MOD3
- MOD4
- MOD5
- MOD6
Array [
]
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD1
- MOD2
- MOD3
- MOD4
- MOD5
- MOD6
Array [
]
- MOD1
- MOD1
- MOD2
- MOD3
- MOD4
- MOD5
- MOD6
Array [
]
- MOD1
The id of the extraction job
extraction_agent
object
required
Schema and configuration for creating an extraction agent.
The id of the extraction agent.
The name of the extraction agent.
The ID of the project that the extraction agent belongs to.
data_schema
object
required
The schema of the data.
property name*
object
anyOf
object
string
integer
number
boolean
config
object
required
The configuration parameters for the extraction agent.
The extraction mode specified.
Possible values: [PER_DOC
, PER_PAGE
]
PER_DOC
Whether to handle missing fields in the schema.
false
system_prompt
object
The system prompt to use for the extraction.
anyOf
string
created_at
object
The creation time of the extraction agent.
anyOf
string
updated_at
object
The last update time of the extraction agent.
anyOf
string
Enum for representing the status of a job
Possible values: [PENDING
, SUCCESS
, ERROR
, PARTIAL_SUCCESS
, CANCELLED
]
error
object
The error that occurred during extraction
anyOf
string
file
object
required
Schema for a file.
Unique identifier
created_at
object
Creation datetime
anyOf
string
updated_at
object
Update datetime
anyOf
string
Possible values: non-empty
and <= 3000 characters
The ID of the file in the external system
file_size
object
Size of the file in bytes
anyOf
integer
file_type
object
File type (e.g. pdf, docx, etc.)
anyOf
string
Possible values: non-empty
and <= 3000 characters
The ID of the project that the file belongs to
last_modified_at
object
The last modified time of the file
anyOf
string
resource_info
object
Resource information for the file
anyOf
property name*
object
anyOf
object
string
integer
number
boolean
permission_info
object
Permission information for the file
anyOf
property name*
object
anyOf
object
string
integer
number
boolean
data_source_id
object
The ID of the data source that the file belongs to
anyOf
string
{
"id": "3fa85f64-5717-4562-b3fc-2c963f66afa6",
"extraction_agent": {
"id": "3fa85f64-5717-4562-b3fc-2c963f66afa6",
"name": "string",
"project_id": "3fa85f64-5717-4562-b3fc-2c963f66afa6",
"data_schema": {},
"config": {
"extraction_mode": "PER_DOC",
"handle_missing": false,
"system_prompt": "string"
},
"created_at": "2024-07-29T15:51:28.071Z",
"updated_at": "2024-07-29T15:51:28.071Z"
},
"status": "PENDING",
"error": "string",
"file": {
"id": "3fa85f64-5717-4562-b3fc-2c963f66afa6",
"created_at": "2024-07-29T15:51:28.071Z",
"updated_at": "2024-07-29T15:51:28.071Z",
"name": "string",
"external_file_id": "string",
"file_size": 0,
"file_type": "string",
"project_id": "3fa85f64-5717-4562-b3fc-2c963f66afa6",
"last_modified_at": "2024-07-29T15:51:28.071Z",
"resource_info": {},
"permission_info": {},
"data_source_id": "3fa85f64-5717-4562-b3fc-2c963f66afa6"
}
}
Validation Error
- application/json
- Schema
- Example (from schema)
Schema
Array [
Array [
- MOD1
- MOD2
]
]
detail
object[]
loc
object[]
required
anyOf
string
integer
{
"detail": [
{
"loc": [
"string",
0
],
"msg": "string",
"type": "string"
}
]
}