Structured Output with Pydantic¶

Ondine provides type-safe structured output parsing using Pydantic models. This ensures LLM responses conform to your expected schema with automatic validation.

Why Use Structured Output?¶

Without structured output:

# Response: "The brand is Apple and model is iPhone 15 Pro"
# Manual parsing required, error-prone, no type safety

With structured output:

# Response automatically validated and parsed to:
ProductInfo(brand="Apple", model="iPhone 15 Pro", price=999.99, condition="new")

Benefits:

Type safety with Pydantic validation
Automatic JSON parsing and error handling
Schema enforcement (required fields, types, constraints)
IDE autocomplete for response fields
Validation errors caught early

Basic Usage¶

1. Define Your Pydantic Model¶

from pydantic import BaseModel, Field

class ProductInfo(BaseModel):
    brand: str = Field(..., description="Manufacturer name")
    model: str = Field(..., description="Product model")
    price: float = Field(..., gt=0, description="Price in USD")
    condition: str = Field(..., pattern="^(new|used|refurbished)$")

2. Use with Pipeline¶

from ondine import PipelineBuilder
from ondine.stages.response_parser_stage import PydanticParser

pipeline = (
    PipelineBuilder.create()
    .from_csv(
        "products.csv",
        input_columns=["product_description"],
        output_columns=["brand", "model", "price", "condition"]
    )
    .with_prompt("""
        Extract product information and return JSON:
        {
          "brand": "manufacturer name",
          "model": "product model",
          "price": 999.99,
          "condition": "new|used|refurbished"
        }

        Description: {product_description}
    """)
    .with_llm(provider="openai", model="gpt-4o-mini", temperature=0.0)
    .with_parser(PydanticParser(ProductInfo, strict=True))
    .build()
)

result = pipeline.execute()

3. Access Validated Results¶

# Results are automatically validated
print(result.data)

#    brand           model      price condition
# 0  Apple    iPhone 15 Pro    999.99       new
# 1  Samsung  Galaxy S24      899.99      used

Pydantic Model Examples¶

Simple Model¶

from pydantic import BaseModel

class Sentiment(BaseModel):
    label: str  # "positive", "negative", "neutral"
    confidence: float  # 0.0 to 1.0

Model with Validation¶

from pydantic import BaseModel, Field, validator

class Review(BaseModel):
    rating: int = Field(..., ge=1, le=5, description="Rating from 1-5")
    sentiment: str = Field(..., pattern="^(positive|negative|neutral)$")
    summary: str = Field(..., min_length=10, max_length=200)

    @validator('rating')
    def rating_must_match_sentiment(cls, v, values):
        sentiment = values.get('sentiment')
        if sentiment == 'positive' and v < 4:
            raise ValueError('Positive sentiment requires rating >= 4')
        if sentiment == 'negative' and v > 2:
            raise ValueError('Negative sentiment requires rating <= 2')
        return v

Nested Model¶

from pydantic import BaseModel
from typing import List

class Address(BaseModel):
    street: str
    city: str
    country: str
    postal_code: str

class Person(BaseModel):
    name: str
    age: int
    email: str
    addresses: List[Address]

Model with Optional Fields¶

from pydantic import BaseModel
from typing import Optional

class Product(BaseModel):
    name: str
    brand: str
    price: float
    description: Optional[str] = None  # Optional field
    sku: Optional[str] = None

Model with Enums¶

from pydantic import BaseModel
from enum import Enum

class Category(str, Enum):
    ELECTRONICS = "electronics"
    CLOTHING = "clothing"
    FOOD = "food"
    BOOKS = "books"

class Item(BaseModel):
    name: str
    category: Category
    price: float

Complete Example¶

from pydantic import BaseModel, Field
from ondine import PipelineBuilder
from ondine.stages.response_parser_stage import PydanticParser
import pandas as pd

# Define schema
class EmailClassification(BaseModel):
    category: str = Field(..., pattern="^(spam|important|promotional|personal)$")
    confidence: float = Field(..., ge=0.0, le=1.0)
    priority: int = Field(..., ge=1, le=5)
    action: str = Field(..., pattern="^(archive|flag|delete|respond)$")

# Sample data
data = pd.DataFrame({
    "email": [
        "URGENT: You won $1,000,000! Click here now!",
        "Meeting tomorrow at 2pm with the CEO",
        "50% off sale this weekend only!"
    ]
})

# Build pipeline
pipeline = (
    PipelineBuilder.create()
    .from_dataframe(
        data,
        input_columns=["email"],
        output_columns=["category", "confidence", "priority", "action"]
    )
    .with_prompt("""
        Classify this email and return JSON:
        {{
          "category": "spam|important|promotional|personal",
          "confidence": 0.0-1.0,
          "priority": 1-5,
          "action": "archive|flag|delete|respond"
        }}

        Email: {email}
    """)
    .with_llm(provider="openai", model="gpt-4o-mini", temperature=0.0)
    .with_parser(PydanticParser(EmailClassification, strict=True))
    .build()
)

# Execute with type-safe validation
result = pipeline.execute()
print(result.data)

Output:

   email                                   category  confidence  priority   action
0  URGENT: You won $1,000,000! Click...     spam        0.98         5    delete
1  Meeting tomorrow at 2pm with CEO      important      0.95         1      flag
2  50% off sale this weekend only!    promotional      0.92         3   archive

Strict vs Non-Strict Mode¶

Strict Mode (Recommended)¶

.with_parser(PydanticParser(ProductInfo, strict=True))

Validation errors stop processing
Failed rows are retried (if retry policy configured)
Guarantees all results match schema

Non-Strict Mode¶

.with_parser(PydanticParser(ProductInfo, strict=False))

Validation errors logged but processing continues
Invalid rows get None values
Useful for exploratory analysis

Handling Validation Errors¶

With Retries¶

pipeline = (
    PipelineBuilder.create()
    ...
    .with_parser(PydanticParser(ProductInfo, strict=True))
    .with_retry_policy(max_retries=3)  # Retry validation failures
    .build()
)

result = pipeline.execute()

# Check for failed validations
if result.metrics.failed_rows > 0:
    print(f"Failed to validate {result.metrics.failed_rows} rows")
    failed = result.data[result.data['brand'].isna()]
    print(failed)

Custom Error Handling¶

from pydantic import ValidationError

try:
    result = pipeline.execute()
except ValidationError as e:
    print(f"Validation error: {e}")
    # Handle validation failures

Prompt Engineering for Structured Output¶

Best Practices¶

Show example JSON in prompt:

prompt = """
Extract product info as JSON:
{{
  "brand": "Apple",
  "model": "iPhone 15",
  "price": 999.99
}}

Description: {description}
"""

Specify field constraints:

prompt = """
Analyze sentiment and return JSON:
{{
  "label": "positive|negative|neutral",
  "confidence": 0.0-1.0,
  "keywords": ["word1", "word2"]
}}

Text: {text}
"""

Use low temperature:

.with_llm(provider="openai", model="gpt-4o-mini", temperature=0.0)

Include field descriptions:

class Product(BaseModel):
    brand: str = Field(..., description="Manufacturer name (e.g., Apple, Samsung)")
    price: float = Field(..., description="Price in USD, numeric only")

JSON vs Pydantic Parser¶

JSON Parser (Simple)¶

from ondine.stages.parser_factory import JSONParser

# Just parses JSON, no validation
.with_parser(JSONParser())

Use when: - Schema is simple and flexible - Don't need type validation - Rapid prototyping

Pydantic Parser (Type-Safe)¶

from ondine.stages.response_parser_stage import PydanticParser

# Parses AND validates against schema
.with_parser(PydanticParser(MyModel, strict=True))

Use when: - Need type safety and validation - Schema has constraints (ranges, patterns) - Production applications - API responses

Advanced Patterns¶

Multiple Models¶

For different output types:

from typing import Union

class ShortSummary(BaseModel):
    summary: str = Field(..., max_length=100)

class LongSummary(BaseModel):
    summary: str = Field(..., max_length=500)
    key_points: List[str]

# Use Union types
class SummaryResponse(BaseModel):
    content: Union[ShortSummary, LongSummary]
    type: str

Post-Validation Processing¶

from pydantic import BaseModel, validator

class Price(BaseModel):
    amount: float
    currency: str = "USD"

    @validator('amount')
    def round_price(cls, v):
        return round(v, 2)

    @property
    def formatted(self) -> str:
        return f"${self.amount:.2f}"

Dynamic Schema¶

For runtime schema definition:

from pydantic import create_model

# Create model dynamically
fields = {
    "name": (str, ...),
    "age": (int, ...),
    "email": (str, ...)
}

DynamicModel = create_model("DynamicModel", **fields)

pipeline = (
    PipelineBuilder.create()
    ...
    .with_parser(PydanticParser(DynamicModel))
    .build()
)

Performance Considerations¶

Validation Overhead¶

Pydantic validation adds ~1-5ms per row. Negligible for most use cases.

# For 10K rows:
# - Without validation: ~120s
# - With Pydantic: ~120.05s (0.04% overhead)

Complex Models¶

Deeply nested models increase validation time:

# Simple model: ~1ms
class Simple(BaseModel):
    name: str
    value: float

# Complex nested: ~5ms
class Complex(BaseModel):
    data: List[Dict[str, List[SubModel]]]

Tip: Keep models as flat as possible for best performance.

Troubleshooting¶

Common Validation Errors¶

Missing required field:

ValidationError: field required (type=value_error.missing)

Solution: Ensure LLM outputs all required fields in prompt.

Type mismatch:

ValidationError: value is not a valid float (type=type_error.float)

Solution: Add type hints in prompt, use temperature=0.0.

Pattern mismatch:

ValidationError: string does not match regex (type=value_error.str.regex)

Solution: Show valid values in prompt example.

Debugging Tips¶

Test with small sample first:

df_sample = df.head(10)
pipeline = builder.from_dataframe(df_sample, ...).build()

Use non-strict mode for debugging:

.with_parser(PydanticParser(Model, strict=False))

Check raw responses:

# Enable debug logging
import logging
logging.basicConfig(level=logging.DEBUG)

API Reference: PydanticParser
Example: 03_structured_output.py
Cost Control - Optimize costs
Multi-Column Processing - Multiple outputs