Extensions

LlamaExtract offers several advanced features that provide additional metadata and insights alongside your extracted data. These extensions are available under Advanced Settings in the UI and return schema-level metadata in the extraction_metadata field of the response.

Citations

Citations provide the source information for every extracted field, allowing you to trace back exactly where each piece of data came from in the original document.

How it works: For every leaf-level field in your schema, citations return:

The page number where the information was found
The verbatim text that was used to extract the field value

Usage: Use the ExtractConfig.cite_sources argument from the SDK to enable this feature.

Use cases:

Compliance and audit requirements
Fact-checking and verification workflows
Understanding extraction quality and accuracy

Limitations:

Only available for MULTIMODAL and PREMIUM extraction modes

Reasoning

Reasoning provides explanations for the extracted values, helping you understand the logic behind each extraction decision.

How it works: For every top-level field in your schema, reasoning returns:

A brief explanation for the extracted value based on the provided text
An error message if the text doesn't contain enough information to extract the field

Usage: Use the ExtractConfig.use_reasoning argument from the SDK to enable this feature.

Use cases:

Debugging extraction results
Understanding model decision-making
Improving schema design based on extraction logic

Limitations:

Only available for BALANCED, MULTIMODAL and PREMIUM extraction modes

Confidence Scores (Beta)

Confidence scores provide quantitative measures of how confident the system is in the extracted values, helping you identify potentially unreliable extractions.

How it works: This feature adds three confidence-related fields to the extraction metadata:

parsing_confidence: Confidence score indicating how well the relevant context was parsed from the source document. Only available for Multimodal extraction mode.
extraction_confidence: Confidence score indicating the relevance of the extraction based on the JSON schema field.
confidence: Combined confidence score that incorporates both parsing and extraction confidence.

Usage: Use the ExtractConfig.confidence_scores argument from the SDK to enable confidence scores.

⚠️ Important: Scores Are Uncalibrated. Critical understanding for proper usage:

Relative scale matters, not absolute values: The confidence scores are not calibrated to real-world accuracy percentages. A score of 0.6 doesn't mean "60% accurate" - it could indicate the model is hallucinating entirely.
Use for comparison, not thresholds: Focus on relative differences between scores rather than absolute values. A field with a score of 0.9 is more reliable than one with 0.6, but neither score directly translates to accuracy.
Longer text fields score lower: Summaries, descriptions, and other lengthy text fields will typically have lower confidence scores on average. This doesn't indicate lower accuracy - it reflects that there are many valid ways to construct longer text, making the model naturally less "confident" about any specific phrasing.
Threshold determination is use-case specific: The confidence score threshold for triggering human review must be determined through testing with your specific documents and use cases. What works for financial data extraction may not work for legal document processing.
Beta feature subject to change: This is an experimental feature. We may modify the computation method as we gather more data, including potentially adding proper calibration in future releases.

Limitations:

Currently has a 100-page size limit
Only available for MULTIMODAL and PREMIUM extraction modes

Use cases:

Quality assurance workflows (with properly tuned thresholds)
Relative ranking of extraction reliability across fields
Identifying documents that may need manual review (after threshold validation)

Performance Considerations

⚠️ Important: Citations and confidence scores will significantly slow down extraction processing time. Enable these features only when the additional metadata is essential for your use case.

Configuration and Usage

For complete examples of how to configure and use these extensions with both the Python SDK and REST API, see the Configuration Options page.

The configuration section includes:

Complete Python SDK examples with extension settings
REST API curl command examples
Configuration reference table with all available options

Quick reference for extensions:

from llama_cloud import ExtractConfig, ExtractMode

config = ExtractConfig(
    cite_sources=True,        # Enable citations  
    use_reasoning=True,       # Enable reasoning
    confidence_scores=True,   # Enable confidence scores (MULTIMODAL/PREMIUM only)
    extraction_mode=ExtractMode.MULTIMODAL  # Required for confidence scores
)

Citations​

Reasoning​

Confidence Scores (Beta)​

Performance Considerations​

Configuration and Usage​

Citations

Reasoning

Confidence Scores (Beta)

Performance Considerations

Configuration and Usage