Pandas Extension¶

openaivec.pandas_ext ¶

Pandas Series / DataFrame extension for OpenAI.

Setup¶

from openai import OpenAI, AzureOpenAI, AsyncOpenAI, AsyncAzureOpenAI
from openaivec import pandas_ext

# Option 1: Use environment variables (automatic detection)
# Set OPENAI_API_KEY or Azure OpenAI environment variables
# (AZURE_OPENAI_API_KEY, AZURE_OPENAI_BASE_URL, AZURE_OPENAI_API_VERSION)
# No explicit setup needed - clients are automatically created

# Option 2: Register an existing OpenAI client instance
client = OpenAI(api_key="your-api-key")
pandas_ext.set_client(client)

# Option 3: Register an Azure OpenAI client instance
azure_client = AzureOpenAI(
    api_key="your-azure-key",
    base_url="https://YOUR-RESOURCE-NAME.services.ai.azure.com/openai/v1/",
    api_version="preview"
)
pandas_ext.set_client(azure_client)

# Option 4: Register an async Azure OpenAI client instance
async_azure_client = AsyncAzureOpenAI(
    api_key="your-azure-key",
    base_url="https://YOUR-RESOURCE-NAME.services.ai.azure.com/openai/v1/",
    api_version="preview"
)
pandas_ext.set_async_client(async_azure_client)

# Set up model names (optional, defaults shown)
pandas_ext.set_responses_model("gpt-4.1-mini")
pandas_ext.set_embeddings_model("text-embedding-3-small")

# Inspect current configuration
configured_model = pandas_ext.get_responses_model()

This module provides .ai and .aio accessors for pandas Series and DataFrames to easily interact with OpenAI APIs for tasks like generating responses or embeddings.

Classes¶

OpenAIVecSeriesAccessor ¶

OpenAIVecSeriesAccessor(series_obj: Series)

pandas Series accessor (.ai) that adds OpenAI helpers.

Source code in src/openaivec/pandas_ext.py

def __init__(self, series_obj: pd.Series):
    self._obj = series_obj

Functions¶

responses_with_cache ¶

responses_with_cache(
    instructions: str,
    cache: BatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] = str,
    **api_kwargs,
) -> pd.Series

Call an LLM once for every Series element using a provided cache.

This is a lower-level method that allows explicit cache management for advanced use cases. Most users should use the standard responses method instead.

Parameters:

Name	Type	Description	Default
`instructions`	`str`	System prompt prepended to every user message.	required
`cache`	`BatchingMapProxy[str, ResponseFormat]`	Explicit cache instance for batching and deduplication control.	required
`response_format`	`type[ResponseFormat]`	Pydantic model or built-in type the assistant should return. Defaults to `str`.	`str`
`**api_kwargs`		Arbitrary OpenAI Responses API parameters (e.g. `temperature`, `top_p`, `frequency_penalty`, `presence_penalty`, `seed`, etc.) are forwarded verbatim to the underlying client.	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Series whose values are instances of `response_format`.

Source code in src/openaivec/pandas_ext.py

def responses_with_cache(
    self,
    instructions: str,
    cache: BatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] = str,
    **api_kwargs,
) -> pd.Series:
    """Call an LLM once for every Series element using a provided cache.

    This is a lower-level method that allows explicit cache management for advanced
    use cases. Most users should use the standard ``responses`` method instead.

    Args:
        instructions (str): System prompt prepended to every user message.
        cache (BatchingMapProxy[str, ResponseFormat]): Explicit cache instance for
            batching and deduplication control.
        response_format (type[ResponseFormat], optional): Pydantic model or built-in
            type the assistant should return. Defaults to ``str``.
        **api_kwargs: Arbitrary OpenAI Responses API parameters (e.g. ``temperature``,
            ``top_p``, ``frequency_penalty``, ``presence_penalty``, ``seed``, etc.) are
            forwarded verbatim to the underlying client.

    Returns:
        pandas.Series: Series whose values are instances of ``response_format``.
    """

    client: BatchResponses = BatchResponses(
        client=CONTAINER.resolve(OpenAI),
        model_name=CONTAINER.resolve(ResponsesModelName).value,
        system_message=instructions,
        response_format=response_format,
        cache=cache,
        api_kwargs=api_kwargs,
    )

    return pd.Series(client.parse(self._obj.tolist()), index=self._obj.index, name=self._obj.name)

responses ¶

responses(
    instructions: str,
    response_format: type[ResponseFormat] = str,
    batch_size: int | None = None,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series

Call an LLM once for every Series element.

Example

animals = pd.Series(["cat", "dog", "elephant"])
# Basic usage
animals.ai.responses("translate to French")

# With progress bar in Jupyter notebooks
large_series = pd.Series(["data"] * 1000)
large_series.ai.responses(
    "analyze this data",
    batch_size=32,
    show_progress=True
)

# With custom temperature
animals.ai.responses(
    "translate creatively",
    temperature=0.8
)

Parameters:

Name	Type	Description	Default
`instructions`	`str`	System prompt prepended to every user message.	required
`response_format`	`type[ResponseFormat]`	Pydantic model or built‑in type the assistant should return. Defaults to `str`.	`str`
`batch_size`	`int \| None`	Number of prompts grouped into a single request. Defaults to `None` (automatic batch size optimization based on execution time). Set to a positive integer for fixed batch size.	`None`
`show_progress`	`bool`	Show progress bar in Jupyter notebooks. Defaults to `True`.	`True`
`**api_kwargs`		Additional OpenAI API parameters (temperature, top_p, etc.).	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Series whose values are instances of `response_format`.

Source code in src/openaivec/pandas_ext.py

def responses(
    self,
    instructions: str,
    response_format: type[ResponseFormat] = str,
    batch_size: int | None = None,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series:
    """Call an LLM once for every Series element.

    Example:
        ```python
        animals = pd.Series(["cat", "dog", "elephant"])
        # Basic usage
        animals.ai.responses("translate to French")

        # With progress bar in Jupyter notebooks
        large_series = pd.Series(["data"] * 1000)
        large_series.ai.responses(
            "analyze this data",
            batch_size=32,
            show_progress=True
        )

        # With custom temperature
        animals.ai.responses(
            "translate creatively",
            temperature=0.8
        )
        ```

    Args:
        instructions (str): System prompt prepended to every user message.
        response_format (type[ResponseFormat], optional): Pydantic model or built‑in
            type the assistant should return. Defaults to ``str``.
        batch_size (int | None, optional): Number of prompts grouped into a single
            request. Defaults to ``None`` (automatic batch size optimization
            based on execution time). Set to a positive integer for fixed batch size.
        show_progress (bool, optional): Show progress bar in Jupyter notebooks. Defaults to ``True``.
        **api_kwargs: Additional OpenAI API parameters (temperature, top_p, etc.).

    Returns:
        pandas.Series: Series whose values are instances of ``response_format``.
    """
    return self.responses_with_cache(
        instructions=instructions,
        cache=BatchingMapProxy(batch_size=batch_size, show_progress=show_progress),
        response_format=response_format,
        **api_kwargs,
    )

embeddings_with_cache ¶

embeddings_with_cache(
    cache: BatchingMapProxy[str, ndarray], **api_kwargs
) -> pd.Series

Compute OpenAI embeddings for every Series element using a provided cache.

This method allows external control over caching behavior by accepting a pre-configured BatchingMapProxy instance, enabling cache sharing across multiple operations or custom batch size management.

Example

from openaivec._cache import BatchingMapProxy
import numpy as np

# Create a shared cache with custom batch size
shared_cache = BatchingMapProxy[str, np.ndarray](batch_size=64)

animals = pd.Series(["cat", "dog", "elephant"])
embeddings = animals.ai.embeddings_with_cache(cache=shared_cache)

Parameters:

Name	Type	Description	Default
`cache`	`BatchingMapProxy[str, ndarray]`	Pre-configured cache instance for managing API call batching and deduplication. Set cache.batch_size=None to enable automatic batch size optimization.	required
`**api_kwargs`		Additional keyword arguments to pass to the OpenAI API.	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Series whose values are `np.ndarray` objects (dtype `float32`).

Source code in src/openaivec/pandas_ext.py

def embeddings_with_cache(
    self,
    cache: BatchingMapProxy[str, np.ndarray],
    **api_kwargs,
) -> pd.Series:
    """Compute OpenAI embeddings for every Series element using a provided cache.

    This method allows external control over caching behavior by accepting
    a pre-configured BatchingMapProxy instance, enabling cache sharing
    across multiple operations or custom batch size management.

    Example:
        ```python
        from openaivec._cache import BatchingMapProxy
        import numpy as np

        # Create a shared cache with custom batch size
        shared_cache = BatchingMapProxy[str, np.ndarray](batch_size=64)

        animals = pd.Series(["cat", "dog", "elephant"])
        embeddings = animals.ai.embeddings_with_cache(cache=shared_cache)
        ```

    Args:
        cache (BatchingMapProxy[str, np.ndarray]): Pre-configured cache
            instance for managing API call batching and deduplication.
            Set cache.batch_size=None to enable automatic batch size optimization.
        **api_kwargs: Additional keyword arguments to pass to the OpenAI API.

    Returns:
        pandas.Series: Series whose values are ``np.ndarray`` objects
            (dtype ``float32``).
    """
    client: BatchEmbeddings = BatchEmbeddings(
        client=CONTAINER.resolve(OpenAI),
        model_name=CONTAINER.resolve(EmbeddingsModelName).value,
        cache=cache,
        api_kwargs=api_kwargs,
    )

    return pd.Series(
        client.create(self._obj.tolist()),
        index=self._obj.index,
        name=self._obj.name,
    )

embeddings ¶

embeddings(
    batch_size: int | None = None,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series

Compute OpenAI embeddings for every Series element.

Example

animals = pd.Series(["cat", "dog", "elephant"])
# Basic usage
animals.ai.embeddings()

# With progress bar for large datasets
large_texts = pd.Series(["text"] * 5000)
embeddings = large_texts.ai.embeddings(
    batch_size=100,
    show_progress=True
)

Parameters:

Name	Type	Description	Default
`batch_size`	`int \| None`	Number of inputs grouped into a single request. Defaults to `None` (automatic batch size optimization based on execution time). Set to a positive integer for fixed batch size.	`None`
`show_progress`	`bool`	Show progress bar in Jupyter notebooks. Defaults to `True`.	`True`
`**api_kwargs`		Additional OpenAI API parameters (e.g., dimensions for text-embedding-3 models).	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Series whose values are `np.ndarray` objects (dtype `float32`).

Source code in src/openaivec/pandas_ext.py

def embeddings(self, batch_size: int | None = None, show_progress: bool = True, **api_kwargs) -> pd.Series:
    """Compute OpenAI embeddings for every Series element.

    Example:
        ```python
        animals = pd.Series(["cat", "dog", "elephant"])
        # Basic usage
        animals.ai.embeddings()

        # With progress bar for large datasets
        large_texts = pd.Series(["text"] * 5000)
        embeddings = large_texts.ai.embeddings(
            batch_size=100,
            show_progress=True
        )
        ```

    Args:
        batch_size (int | None, optional): Number of inputs grouped into a
            single request. Defaults to ``None`` (automatic batch size optimization
            based on execution time). Set to a positive integer for fixed batch size.
        show_progress (bool, optional): Show progress bar in Jupyter notebooks. Defaults to ``True``.
        **api_kwargs: Additional OpenAI API parameters (e.g., dimensions for text-embedding-3 models).

    Returns:
        pandas.Series: Series whose values are ``np.ndarray`` objects
            (dtype ``float32``).
    """
    return self.embeddings_with_cache(
        cache=BatchingMapProxy(batch_size=batch_size, show_progress=show_progress),
        **api_kwargs,
    )

task_with_cache ¶

task_with_cache(
    task: PreparedTask[ResponseFormat],
    cache: BatchingMapProxy[str, ResponseFormat],
) -> pd.Series

Execute a prepared task on every Series element using a provided cache.

This mirrors responses_with_cache but uses the task's stored instructions, response format, and API parameters. A supplied BatchingMapProxy enables cross‑operation deduplicated reuse and external batch size / progress control.

Example

from openaivec._cache import BatchingMapProxy
shared_cache = BatchingMapProxy(batch_size=64)
reviews.ai.task_with_cache(sentiment_task, cache=shared_cache)

Parameters:

Name	Type	Description	Default
`task`	`PreparedTask`	Prepared task (instructions + response_format + sampling params).	required
`cache`	`BatchingMapProxy[str, ResponseFormat]`	Pre‑configured cache instance.	required

Note

The task's stored API parameters are used. Core routing keys (model, system instructions, user input) are managed internally and cannot be overridden.

Returns:

Type	Description
`Series`	pandas.Series: Task results aligned with the original Series index.

Source code in src/openaivec/pandas_ext.py

def task_with_cache(
    self,
    task: PreparedTask[ResponseFormat],
    cache: BatchingMapProxy[str, ResponseFormat],
) -> pd.Series:
    """Execute a prepared task on every Series element using a provided cache.

    This mirrors ``responses_with_cache`` but uses the task's stored instructions,
    response format, and API parameters. A supplied ``BatchingMapProxy`` enables
    cross‑operation deduplicated reuse and external batch size / progress control.

    Example:
        ```python
        from openaivec._cache import BatchingMapProxy
        shared_cache = BatchingMapProxy(batch_size=64)
        reviews.ai.task_with_cache(sentiment_task, cache=shared_cache)
        ```

    Args:
        task (PreparedTask): Prepared task (instructions + response_format + sampling params).
        cache (BatchingMapProxy[str, ResponseFormat]): Pre‑configured cache instance.

    Note:
        The task's stored API parameters are used. Core routing keys (``model``, system
        instructions, user input) are managed internally and cannot be overridden.

    Returns:
        pandas.Series: Task results aligned with the original Series index.
    """
    client: BatchResponses = BatchResponses(
        client=CONTAINER.resolve(OpenAI),
        model_name=CONTAINER.resolve(ResponsesModelName).value,
        system_message=task.instructions,
        response_format=task.response_format,
        cache=cache,
        api_kwargs=task.api_kwargs,
    )
    return pd.Series(client.parse(self._obj.tolist()), index=self._obj.index, name=self._obj.name)

task ¶

task(
    task: PreparedTask,
    batch_size: int | None = None,
    show_progress: bool = True,
) -> pd.Series

Execute a prepared task on every Series element.

Example

from openaivec._model import PreparedTask

# Assume you have a prepared task for sentiment analysis
sentiment_task = PreparedTask(...)

reviews = pd.Series(["Great product!", "Not satisfied", "Amazing quality"])
# Basic usage
results = reviews.ai.task(sentiment_task)

# With progress bar for large datasets
large_reviews = pd.Series(["review text"] * 2000)
results = large_reviews.ai.task(
    sentiment_task,
    batch_size=50,
    show_progress=True
)

Parameters:

Name	Type	Description	Default
`task`	`PreparedTask`	A pre-configured task containing instructions, response format, and other parameters for processing the inputs.	required
`batch_size`	`int \| None`	Number of prompts grouped into a single request to optimize API usage. Defaults to `None` (automatic batch size optimization based on execution time). Set to a positive integer for fixed batch size.	`None`
`show_progress`	`bool`	Show progress bar in Jupyter notebooks. Defaults to `True`.	`True`

Note

The task's stored API parameters are used. Core batching / routing keys (model, instructions / system message, user input) are managed by the library and cannot be overridden.

Returns:

Type	Description
`Series`	pandas.Series: Series whose values are instances of the task's response format.

Source code in src/openaivec/pandas_ext.py

def task(
    self,
    task: PreparedTask,
    batch_size: int | None = None,
    show_progress: bool = True,
) -> pd.Series:
    """Execute a prepared task on every Series element.

    Example:
        ```python
        from openaivec._model import PreparedTask

        # Assume you have a prepared task for sentiment analysis
        sentiment_task = PreparedTask(...)

        reviews = pd.Series(["Great product!", "Not satisfied", "Amazing quality"])
        # Basic usage
        results = reviews.ai.task(sentiment_task)

        # With progress bar for large datasets
        large_reviews = pd.Series(["review text"] * 2000)
        results = large_reviews.ai.task(
            sentiment_task,
            batch_size=50,
            show_progress=True
        )
        ```

    Args:
        task (PreparedTask): A pre-configured task containing instructions,
            response format, and other parameters for processing the inputs.
        batch_size (int | None, optional): Number of prompts grouped into a single
            request to optimize API usage. Defaults to ``None`` (automatic batch size
            optimization based on execution time). Set to a positive integer for fixed batch size.
        show_progress (bool, optional): Show progress bar in Jupyter notebooks. Defaults to ``True``.

    Note:
        The task's stored API parameters are used. Core batching / routing keys
        (``model``, ``instructions`` / system message, user ``input``) are managed by the
        library and cannot be overridden.

    Returns:
        pandas.Series: Series whose values are instances of the task's response format.
    """
    return self.task_with_cache(
        task=task,
        cache=BatchingMapProxy(batch_size=batch_size, show_progress=show_progress),
    )

parse_with_cache ¶

parse_with_cache(
    instructions: str,
    cache: BatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    **api_kwargs,
) -> pd.Series

Parse Series values using an LLM with a provided cache.

This method allows external control over caching behavior while parsing Series content into structured data. If no response format is provided, the method automatically infers an appropriate schema by analyzing the data patterns.

Parameters:

Name	Type	Description	Default
`instructions`	`str`	Plain language description of what information to extract (e.g., "Extract customer information including name and contact details"). This guides both the extraction process and schema inference.	required
`cache`	`BatchingMapProxy[str, ResponseFormat]`	Pre-configured cache instance for managing API call batching and deduplication. Set cache.batch_size=None to enable automatic batch size optimization.	required
`response_format`	`type[ResponseFormat] \| None`	Target structure for the parsed data. Can be a Pydantic model class, built-in type (str, int, float, bool, list, dict), or None. If None, the method infers an appropriate schema based on the instructions and data. Defaults to None.	`None`
`max_examples`	`int`	Maximum number of Series values to analyze when inferring the schema. Only used when response_format is None. Defaults to 100.	`100`
`**api_kwargs`		Additional OpenAI API parameters (temperature, top_p, frequency_penalty, presence_penalty, seed, etc.) forwarded to the underlying API calls.	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Series containing parsed structured data. Each value is an instance of the specified response_format or the inferred schema model, aligned with the original Series index.

Source code in src/openaivec/pandas_ext.py

def parse_with_cache(
    self,
    instructions: str,
    cache: BatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    **api_kwargs,
) -> pd.Series:
    """Parse Series values using an LLM with a provided cache.

    This method allows external control over caching behavior while parsing
    Series content into structured data. If no response format is provided,
    the method automatically infers an appropriate schema by analyzing the
    data patterns.

    Args:
        instructions (str): Plain language description of what information
            to extract (e.g., "Extract customer information including name
            and contact details"). This guides both the extraction process
            and schema inference.
        cache (BatchingMapProxy[str, ResponseFormat]): Pre-configured cache
            instance for managing API call batching and deduplication.
            Set cache.batch_size=None to enable automatic batch size optimization.
        response_format (type[ResponseFormat] | None, optional): Target structure
            for the parsed data. Can be a Pydantic model class, built-in type
            (str, int, float, bool, list, dict), or None. If None, the method
            infers an appropriate schema based on the instructions and data.
            Defaults to None.
        max_examples (int, optional): Maximum number of Series values to
            analyze when inferring the schema. Only used when response_format
            is None. Defaults to 100.
        **api_kwargs: Additional OpenAI API parameters (temperature, top_p,
            frequency_penalty, presence_penalty, seed, etc.) forwarded to
            the underlying API calls.

    Returns:
        pandas.Series: Series containing parsed structured data. Each value
            is an instance of the specified response_format or the inferred
            schema model, aligned with the original Series index.
    """

    schema: SchemaInferenceOutput | None = None
    if response_format is None:
        schema = self.infer_schema(instructions=instructions, max_examples=max_examples, **api_kwargs)

    return self.responses_with_cache(
        instructions=schema.inference_prompt if schema else instructions,
        cache=cache,
        response_format=response_format or schema.model,
        **api_kwargs,
    )

parse ¶

parse(
    instructions: str,
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    batch_size: int | None = None,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series

Parse Series values into structured data using an LLM.

This method extracts structured information from unstructured text in the Series. When no response format is provided, it automatically infers an appropriate schema by analyzing patterns in the data.

Parameters:

Name	Type	Description	Default
`instructions`	`str`	Plain language description of what information to extract (e.g., "Extract product details including price, category, and availability"). This guides both the extraction process and schema inference.	required
`response_format`	`type[ResponseFormat] \| None`	Target structure for the parsed data. Can be a Pydantic model class, built-in type (str, int, float, bool, list, dict), or None. If None, automatically infers a schema. Defaults to None.	`None`
`max_examples`	`int`	Maximum number of Series values to analyze when inferring schema. Only used when response_format is None. Defaults to 100.	`100`
`batch_size`	`int \| None`	Number of requests to process per batch. None enables automatic optimization. Defaults to None.	`None`
`show_progress`	`bool`	Display progress bar in Jupyter notebooks. Defaults to True.	`True`
`**api_kwargs`		Additional OpenAI API parameters (temperature, top_p, frequency_penalty, presence_penalty, seed, etc.).	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Series containing parsed structured data as instances of response_format or the inferred schema model.

Example

# With explicit schema
from pydantic import BaseModel
class Product(BaseModel):
    name: str
    price: float
    in_stock: bool

descriptions = pd.Series([
    "iPhone 15 Pro - $999, available now",
    "Samsung Galaxy S24 - $899, out of stock"
])
products = descriptions.ai.parse(
    "Extract product information",
    response_format=Product
)

# With automatic schema inference
reviews = pd.Series([
    "Great product! 5 stars. Fast shipping.",
    "Poor quality. 2 stars. Slow delivery."
])
parsed = reviews.ai.parse(
    "Extract review rating and shipping feedback"
)

Source code in src/openaivec/pandas_ext.py

def parse(
    self,
    instructions: str,
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    batch_size: int | None = None,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series:
    """Parse Series values into structured data using an LLM.

    This method extracts structured information from unstructured text in
    the Series. When no response format is provided, it automatically
    infers an appropriate schema by analyzing patterns in the data.

    Args:
        instructions (str): Plain language description of what information
            to extract (e.g., "Extract product details including price,
            category, and availability"). This guides both the extraction
            process and schema inference.
        response_format (type[ResponseFormat] | None, optional): Target
            structure for the parsed data. Can be a Pydantic model class,
            built-in type (str, int, float, bool, list, dict), or None.
            If None, automatically infers a schema. Defaults to None.
        max_examples (int, optional): Maximum number of Series values to
            analyze when inferring schema. Only used when response_format
            is None. Defaults to 100.
        batch_size (int | None, optional): Number of requests to process
            per batch. None enables automatic optimization. Defaults to None.
        show_progress (bool, optional): Display progress bar in Jupyter
            notebooks. Defaults to True.
        **api_kwargs: Additional OpenAI API parameters (temperature, top_p,
            frequency_penalty, presence_penalty, seed, etc.).

    Returns:
        pandas.Series: Series containing parsed structured data as instances
            of response_format or the inferred schema model.

    Example:
        ```python
        # With explicit schema
        from pydantic import BaseModel
        class Product(BaseModel):
            name: str
            price: float
            in_stock: bool

        descriptions = pd.Series([
            "iPhone 15 Pro - $999, available now",
            "Samsung Galaxy S24 - $899, out of stock"
        ])
        products = descriptions.ai.parse(
            "Extract product information",
            response_format=Product
        )

        # With automatic schema inference
        reviews = pd.Series([
            "Great product! 5 stars. Fast shipping.",
            "Poor quality. 2 stars. Slow delivery."
        ])
        parsed = reviews.ai.parse(
            "Extract review rating and shipping feedback"
        )
        ```
    """
    return self.parse_with_cache(
        instructions=instructions,
        cache=BatchingMapProxy(batch_size=batch_size, show_progress=show_progress),
        response_format=response_format,
        max_examples=max_examples,
        **api_kwargs,
    )

infer_schema ¶

infer_schema(
    instructions: str, max_examples: int = 100, **api_kwargs
) -> SchemaInferenceOutput

Infer a structured data schema from Series content using AI.

This method analyzes a sample of Series values to automatically generate a Pydantic model that captures the relevant information structure. The inferred schema supports both flat and hierarchical (nested) structures, making it suitable for complex data extraction tasks.

Parameters:

Name	Type	Description	Default
`instructions`	`str`	Plain language description of the extraction goal (e.g., "Extract customer information for CRM system", "Parse event details for calendar integration"). This guides which fields to include and their purpose.	required
`max_examples`	`int`	Maximum number of Series values to analyze for pattern detection. The method samples randomly up to this limit. Higher values may improve schema quality but increase inference time. Defaults to 100.	`100`
`**api_kwargs`		Additional OpenAI API parameters for fine-tuning the inference process.	`{}`

Returns:

Name	Type	Description
`InferredSchema`	`SchemaInferenceOutput`	A comprehensive schema object containing: - instructions: Refined extraction objective statement - fields: Hierarchical field specifications with names, types, descriptions, and nested structures where applicable - inference_prompt: Optimized prompt for consistent extraction - model: Dynamically generated Pydantic model class supporting both flat and nested structures - task: PreparedTask configured for batch extraction using the inferred schema

Example

# Simple flat structure
reviews = pd.Series([
    "5 stars! Great product, fast shipping to NYC.",
    "2 stars. Product broke, slow delivery to LA."
])
schema = reviews.ai.infer_schema(
    "Extract review ratings and shipping information"
)

# Hierarchical structure
orders = pd.Series([
    "Order #123: John Doe, 123 Main St, NYC. Items: iPhone ($999), Case ($29)",
    "Order #456: Jane Smith, 456 Oak Ave, LA. Items: iPad ($799)"
])
schema = orders.ai.infer_schema(
    "Extract order details including customer and items"
)
# Inferred schema may include nested structures like:
# - customer: {name: str, address: str, city: str}
# - items: [{product: str, price: float}]

# Apply the schema for extraction
extracted = orders.ai.task(schema.task)

Note

The inference process uses multiple AI iterations to ensure schema validity. Nested structures are automatically detected when the data contains hierarchical relationships. The generated Pydantic model ensures type safety and validation for all extracted data.

Source code in src/openaivec/pandas_ext.py

def infer_schema(self, instructions: str, max_examples: int = 100, **api_kwargs) -> SchemaInferenceOutput:
    """Infer a structured data schema from Series content using AI.

    This method analyzes a sample of Series values to automatically generate
    a Pydantic model that captures the relevant information structure. The
    inferred schema supports both flat and hierarchical (nested) structures,
    making it suitable for complex data extraction tasks.

    Args:
        instructions (str): Plain language description of the extraction goal
            (e.g., "Extract customer information for CRM system", "Parse
            event details for calendar integration"). This guides which
            fields to include and their purpose.
        max_examples (int, optional): Maximum number of Series values to
            analyze for pattern detection. The method samples randomly up
            to this limit. Higher values may improve schema quality but
            increase inference time. Defaults to 100.
        **api_kwargs: Additional OpenAI API parameters for fine-tuning
            the inference process.

    Returns:
        InferredSchema: A comprehensive schema object containing:
            - instructions: Refined extraction objective statement
            - fields: Hierarchical field specifications with names, types,
              descriptions, and nested structures where applicable
            - inference_prompt: Optimized prompt for consistent extraction
            - model: Dynamically generated Pydantic model class supporting
              both flat and nested structures
            - task: PreparedTask configured for batch extraction using
              the inferred schema

    Example:
        ```python
        # Simple flat structure
        reviews = pd.Series([
            "5 stars! Great product, fast shipping to NYC.",
            "2 stars. Product broke, slow delivery to LA."
        ])
        schema = reviews.ai.infer_schema(
            "Extract review ratings and shipping information"
        )

        # Hierarchical structure
        orders = pd.Series([
            "Order #123: John Doe, 123 Main St, NYC. Items: iPhone ($999), Case ($29)",
            "Order #456: Jane Smith, 456 Oak Ave, LA. Items: iPad ($799)"
        ])
        schema = orders.ai.infer_schema(
            "Extract order details including customer and items"
        )
        # Inferred schema may include nested structures like:
        # - customer: {name: str, address: str, city: str}
        # - items: [{product: str, price: float}]

        # Apply the schema for extraction
        extracted = orders.ai.task(schema.task)
        ```

    Note:
        The inference process uses multiple AI iterations to ensure schema
        validity. Nested structures are automatically detected when the
        data contains hierarchical relationships. The generated Pydantic
        model ensures type safety and validation for all extracted data.
    """
    inferer = CONTAINER.resolve(SchemaInferer)

    input: SchemaInferenceInput = SchemaInferenceInput(
        examples=self._obj.sample(n=min(max_examples, len(self._obj))).tolist(),
        instructions=instructions,
        **api_kwargs,
    )
    return inferer.infer_schema(input)

count_tokens ¶

count_tokens() -> pd.Series

Count tiktoken tokens per row.

Example

animals = pd.Series(["cat", "dog", "elephant"])
animals.ai.count_tokens()

This method uses the tiktoken library to count tokens based on the model name configured via set_responses_model.

Returns:

Type	Description
`Series`	pandas.Series: Token counts for each element.

Source code in src/openaivec/pandas_ext.py

def count_tokens(self) -> pd.Series:
    """Count `tiktoken` tokens per row.

    Example:
        ```python
        animals = pd.Series(["cat", "dog", "elephant"])
        animals.ai.count_tokens()
        ```
        This method uses the `tiktoken` library to count tokens based on the
        model name configured via `set_responses_model`.

    Returns:
        pandas.Series: Token counts for each element.
    """
    encoding: tiktoken.Encoding = CONTAINER.resolve(tiktoken.Encoding)
    return self._obj.map(encoding.encode).map(len).rename("num_tokens")

extract ¶

extract() -> pd.DataFrame

Expand a Series of Pydantic models/dicts into columns.

Example

animals = pd.Series([
    {"name": "cat", "legs": 4},
    {"name": "dog", "legs": 4},
    {"name": "elephant", "legs": 4},
])
animals.ai.extract()

This method returns a DataFrame with the same index as the Series, where each column corresponds to a key in the dictionaries. If the Series has a name, extracted columns are prefixed with it.

Returns:

Type	Description
`DataFrame`	pandas.DataFrame: Expanded representation.

Source code in src/openaivec/pandas_ext.py

def extract(self) -> pd.DataFrame:
    """Expand a Series of Pydantic models/dicts into columns.

    Example:
        ```python
        animals = pd.Series([
            {"name": "cat", "legs": 4},
            {"name": "dog", "legs": 4},
            {"name": "elephant", "legs": 4},
        ])
        animals.ai.extract()
        ```
        This method returns a DataFrame with the same index as the Series,
        where each column corresponds to a key in the dictionaries.
        If the Series has a name, extracted columns are prefixed with it.

    Returns:
        pandas.DataFrame: Expanded representation.
    """
    extracted = pd.DataFrame(
        self._obj.map(lambda x: _extract_value(x, self._obj.name)).tolist(),
        index=self._obj.index,
    )

    if self._obj.name:
        # If the Series has a name and all elements are dict or BaseModel, use it as the prefix for the columns
        extracted.columns = [f"{self._obj.name}_{col}" for col in extracted.columns]
    return extracted

OpenAIVecDataFrameAccessor ¶

OpenAIVecDataFrameAccessor(df_obj: DataFrame)

pandas DataFrame accessor (.ai) that adds OpenAI helpers.

Source code in src/openaivec/pandas_ext.py

def __init__(self, df_obj: pd.DataFrame):
    self._obj = df_obj

Functions¶

responses_with_cache ¶

responses_with_cache(
    instructions: str,
    cache: BatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] = str,
    **api_kwargs,
) -> pd.Series

Generate a response for each row after serializing it to JSON using a provided cache.

This method allows external control over caching behavior by accepting a pre-configured BatchingMapProxy instance, enabling cache sharing across multiple operations or custom batch size management.

Example

from openaivec._cache import BatchingMapProxy

# Create a shared cache with custom batch size
shared_cache = BatchingMapProxy(batch_size=64)

df = pd.DataFrame([
    {"name": "cat", "legs": 4},
    {"name": "dog", "legs": 4},
    {"name": "elephant", "legs": 4},
])
result = df.ai.responses_with_cache(
    "what is the animal's name?",
    cache=shared_cache
)

Parameters:

Name	Type	Description	Default
`instructions`	`str`	System prompt for the assistant.	required
`cache`	`BatchingMapProxy[str, ResponseFormat]`	Pre-configured cache instance for managing API call batching and deduplication. Set cache.batch_size=None to enable automatic batch size optimization.	required
`response_format`	`type[ResponseFormat]`	Desired Python type of the responses. Defaults to `str`.	`str`
`**api_kwargs`		Additional OpenAI API parameters (temperature, top_p, etc.).	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Responses aligned with the DataFrame's original index.

Source code in src/openaivec/pandas_ext.py

def responses_with_cache(
    self,
    instructions: str,
    cache: BatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] = str,
    **api_kwargs,
) -> pd.Series:
    """Generate a response for each row after serializing it to JSON using a provided cache.

    This method allows external control over caching behavior by accepting
    a pre-configured BatchingMapProxy instance, enabling cache sharing
    across multiple operations or custom batch size management.

    Example:
        ```python
        from openaivec._cache import BatchingMapProxy

        # Create a shared cache with custom batch size
        shared_cache = BatchingMapProxy(batch_size=64)

        df = pd.DataFrame([
            {"name": "cat", "legs": 4},
            {"name": "dog", "legs": 4},
            {"name": "elephant", "legs": 4},
        ])
        result = df.ai.responses_with_cache(
            "what is the animal's name?",
            cache=shared_cache
        )
        ```

    Args:
        instructions (str): System prompt for the assistant.
        cache (BatchingMapProxy[str, ResponseFormat]): Pre-configured cache
            instance for managing API call batching and deduplication.
            Set cache.batch_size=None to enable automatic batch size optimization.
        response_format (type[ResponseFormat], optional): Desired Python type of the
            responses. Defaults to ``str``.
        **api_kwargs: Additional OpenAI API parameters (temperature, top_p, etc.).

    Returns:
        pandas.Series: Responses aligned with the DataFrame's original index.
    """
    return _df_rows_to_json_series(self._obj).ai.responses_with_cache(
        instructions=instructions,
        cache=cache,
        response_format=response_format,
        **api_kwargs,
    )

responses ¶

responses(
    instructions: str,
    response_format: type[ResponseFormat] = str,
    batch_size: int | None = None,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series

Generate a response for each row after serializing it to JSON.

Example

df = pd.DataFrame([
    {"name": "cat", "legs": 4},
    {"name": "dog", "legs": 4},
    {"name": "elephant", "legs": 4},
])
# Basic usage
df.ai.responses("what is the animal's name?")

# With progress bar for large datasets
large_df = pd.DataFrame({"id": list(range(1000))})
large_df.ai.responses(
    "generate a name for this ID",
    batch_size=20,
    show_progress=True
)

Parameters:

Name	Type	Description	Default
`instructions`	`str`	System prompt for the assistant.	required
`response_format`	`type[ResponseFormat]`	Desired Python type of the responses. Defaults to `str`.	`str`
`batch_size`	`int \| None`	Number of requests sent in one batch. Defaults to `None` (automatic batch size optimization based on execution time). Set to a positive integer for fixed batch size.	`None`
`show_progress`	`bool`	Show progress bar in Jupyter notebooks. Defaults to `True`.	`True`
`**api_kwargs`		Additional OpenAI API parameters (temperature, top_p, etc.).	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Responses aligned with the DataFrame's original index.

Source code in src/openaivec/pandas_ext.py

def responses(
    self,
    instructions: str,
    response_format: type[ResponseFormat] = str,
    batch_size: int | None = None,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series:
    """Generate a response for each row after serializing it to JSON.

    Example:
        ```python
        df = pd.DataFrame([
            {"name": "cat", "legs": 4},
            {"name": "dog", "legs": 4},
            {"name": "elephant", "legs": 4},
        ])
        # Basic usage
        df.ai.responses("what is the animal's name?")

        # With progress bar for large datasets
        large_df = pd.DataFrame({"id": list(range(1000))})
        large_df.ai.responses(
            "generate a name for this ID",
            batch_size=20,
            show_progress=True
        )
        ```

    Args:
        instructions (str): System prompt for the assistant.
        response_format (type[ResponseFormat], optional): Desired Python type of the
            responses. Defaults to ``str``.
        batch_size (int | None, optional): Number of requests sent in one batch.
            Defaults to ``None`` (automatic batch size optimization
            based on execution time). Set to a positive integer for fixed batch size.
        show_progress (bool, optional): Show progress bar in Jupyter notebooks. Defaults to ``True``.
        **api_kwargs: Additional OpenAI API parameters (temperature, top_p, etc.).

    Returns:
        pandas.Series: Responses aligned with the DataFrame's original index.
    """
    return self.responses_with_cache(
        instructions=instructions,
        cache=BatchingMapProxy(batch_size=batch_size, show_progress=show_progress),
        response_format=response_format,
        **api_kwargs,
    )

task_with_cache ¶

task_with_cache(
    task: PreparedTask[ResponseFormat],
    cache: BatchingMapProxy[str, ResponseFormat],
) -> pd.Series

Execute a prepared task on each DataFrame row after serializing it to JSON using a provided cache.

Parameters:

Name	Type	Description	Default
`task`	`PreparedTask`	Prepared task (instructions + response_format + sampling params).	required
`cache`	`BatchingMapProxy[str, ResponseFormat]`	Pre‑configured cache instance.	required

Note

The task's stored API parameters are used. Core routing keys are managed internally.

Returns:

Type	Description
`Series`	pandas.Series: Task results aligned with the DataFrame's original index.

Source code in src/openaivec/pandas_ext.py

def task_with_cache(
    self,
    task: PreparedTask[ResponseFormat],
    cache: BatchingMapProxy[str, ResponseFormat],
) -> pd.Series:
    """Execute a prepared task on each DataFrame row after serializing it to JSON using a provided cache.

    Args:
        task (PreparedTask): Prepared task (instructions + response_format + sampling params).
        cache (BatchingMapProxy[str, ResponseFormat]): Pre‑configured cache instance.

    Note:
        The task's stored API parameters are used. Core routing keys are managed internally.

    Returns:
        pandas.Series: Task results aligned with the DataFrame's original index.
    """
    return _df_rows_to_json_series(self._obj).ai.task_with_cache(
        task=task,
        cache=cache,
    )

task ¶

task(
    task: PreparedTask,
    batch_size: int | None = None,
    show_progress: bool = True,
) -> pd.Series

Execute a prepared task on each DataFrame row after serializing it to JSON.

Example

from openaivec._model import PreparedTask

# Assume you have a prepared task for data analysis
analysis_task = PreparedTask(...)

df = pd.DataFrame([
    {"name": "cat", "legs": 4},
    {"name": "dog", "legs": 4},
    {"name": "elephant", "legs": 4},
])
# Basic usage
results = df.ai.task(analysis_task)

# With progress bar for large datasets
large_df = pd.DataFrame({"id": list(range(1000))})
results = large_df.ai.task(
    analysis_task,
    batch_size=50,
    show_progress=True
)

Parameters:

Name	Type	Description	Default
`task`	`PreparedTask`	A pre-configured task containing instructions, response format, and other parameters for processing the inputs.	required
`batch_size`	`int \| None`	Number of requests sent in one batch to optimize API usage. Defaults to `None` (automatic batch size optimization based on execution time). Set to a positive integer for fixed batch size.	`None`
`show_progress`	`bool`	Show progress bar in Jupyter notebooks. Defaults to `True`.	`True`

Note

The task's stored API parameters are used. Core batching / routing keys (model, instructions / system message, user input) are managed by the library and cannot be overridden.

Returns:

Type	Description
`Series`	pandas.Series: Series whose values are instances of the task's response format, aligned with the DataFrame's original index.

Source code in src/openaivec/pandas_ext.py

def task(
    self,
    task: PreparedTask,
    batch_size: int | None = None,
    show_progress: bool = True,
) -> pd.Series:
    """Execute a prepared task on each DataFrame row after serializing it to JSON.

    Example:
        ```python
        from openaivec._model import PreparedTask

        # Assume you have a prepared task for data analysis
        analysis_task = PreparedTask(...)

        df = pd.DataFrame([
            {"name": "cat", "legs": 4},
            {"name": "dog", "legs": 4},
            {"name": "elephant", "legs": 4},
        ])
        # Basic usage
        results = df.ai.task(analysis_task)

        # With progress bar for large datasets
        large_df = pd.DataFrame({"id": list(range(1000))})
        results = large_df.ai.task(
            analysis_task,
            batch_size=50,
            show_progress=True
        )
        ```

    Args:
        task (PreparedTask): A pre-configured task containing instructions,
            response format, and other parameters for processing the inputs.
        batch_size (int | None, optional): Number of requests sent in one batch
            to optimize API usage. Defaults to ``None`` (automatic batch size
            optimization based on execution time). Set to a positive integer for fixed batch size.
        show_progress (bool, optional): Show progress bar in Jupyter notebooks. Defaults to ``True``.

    Note:
        The task's stored API parameters are used. Core batching / routing
        keys (``model``, ``instructions`` / system message, user ``input``) are managed by the
        library and cannot be overridden.

    Returns:
        pandas.Series: Series whose values are instances of the task's
            response format, aligned with the DataFrame's original index.
    """
    return _df_rows_to_json_series(self._obj).ai.task(
        task=task,
        batch_size=batch_size,
        show_progress=show_progress,
    )

parse_with_cache ¶

parse_with_cache(
    instructions: str,
    cache: BatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    **api_kwargs,
) -> pd.Series

Parse DataFrame rows into structured data using an LLM with a provided cache.

This method processes each DataFrame row (converted to JSON) and extracts structured information using an LLM. External cache control enables deduplication across operations and custom batch management.

Parameters:

Name	Type	Description	Default
`instructions`	`str`	Plain language description of what information to extract from each row (e.g., "Extract shipping details and order status"). Guides both extraction and schema inference.	required
`cache`	`BatchingMapProxy[str, ResponseFormat]`	Pre-configured cache instance for managing API call batching and deduplication. Set cache.batch_size=None for automatic optimization.	required
`response_format`	`type[ResponseFormat] \| None`	Target structure for parsed data. Can be a Pydantic model, built-in type, or None for automatic schema inference. Defaults to None.	`None`
`max_examples`	`int`	Maximum rows to analyze when inferring schema (only used when response_format is None). Defaults to 100.	`100`
`**api_kwargs`		Additional OpenAI API parameters (temperature, top_p, frequency_penalty, presence_penalty, seed, etc.).	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Series containing parsed structured data as instances of response_format or the inferred schema model, indexed like the original DataFrame.

Source code in src/openaivec/pandas_ext.py

def parse_with_cache(
    self,
    instructions: str,
    cache: BatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    **api_kwargs,
) -> pd.Series:
    """Parse DataFrame rows into structured data using an LLM with a provided cache.

    This method processes each DataFrame row (converted to JSON) and extracts
    structured information using an LLM. External cache control enables
    deduplication across operations and custom batch management.

    Args:
        instructions (str): Plain language description of what information
            to extract from each row (e.g., "Extract shipping details and
            order status"). Guides both extraction and schema inference.
        cache (BatchingMapProxy[str, ResponseFormat]): Pre-configured cache
            instance for managing API call batching and deduplication.
            Set cache.batch_size=None for automatic optimization.
        response_format (type[ResponseFormat] | None, optional): Target
            structure for parsed data. Can be a Pydantic model, built-in
            type, or None for automatic schema inference. Defaults to None.
        max_examples (int, optional): Maximum rows to analyze when inferring
            schema (only used when response_format is None). Defaults to 100.
        **api_kwargs: Additional OpenAI API parameters (temperature, top_p,
            frequency_penalty, presence_penalty, seed, etc.).

    Returns:
        pandas.Series: Series containing parsed structured data as instances
            of response_format or the inferred schema model, indexed like
            the original DataFrame.
    """
    return _df_rows_to_json_series(self._obj).ai.parse_with_cache(
        instructions=instructions,
        cache=cache,
        response_format=response_format,
        max_examples=max_examples,
        **api_kwargs,
    )

parse ¶

parse(
    instructions: str,
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    batch_size: int | None = None,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series

Parse DataFrame rows into structured data using an LLM.

Each row is converted to JSON and processed to extract structured information. When no response format is provided, the method automatically infers an appropriate schema from the data.

Parameters:

Name	Type	Description	Default
`instructions`	`str`	Plain language description of extraction goals (e.g., "Extract transaction details including amount, date, and merchant"). Guides extraction and schema inference.	required
`response_format`	`type[ResponseFormat] \| None`	Target structure for parsed data. Can be a Pydantic model, built-in type, or None for automatic inference. Defaults to None.	`None`
`max_examples`	`int`	Maximum rows to analyze for schema inference (when response_format is None). Defaults to 100.	`100`
`batch_size`	`int \| None`	Rows per API batch. None enables automatic optimization. Defaults to None.	`None`
`show_progress`	`bool`	Show progress bar in Jupyter notebooks. Defaults to True.	`True`
`**api_kwargs`		Additional OpenAI API parameters.	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Parsed structured data indexed like the original DataFrame.

Example

df = pd.DataFrame({
    'log': [
        '2024-01-01 10:00 ERROR Database connection failed',
        '2024-01-01 10:05 INFO Service started successfully'
    ]
})

# With automatic schema inference
parsed = df.ai.parse("Extract timestamp, level, and message")
# Returns Series with inferred structure like:
# {timestamp: str, level: str, message: str}

Source code in src/openaivec/pandas_ext.py

def parse(
    self,
    instructions: str,
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    batch_size: int | None = None,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series:
    """Parse DataFrame rows into structured data using an LLM.

    Each row is converted to JSON and processed to extract structured
    information. When no response format is provided, the method
    automatically infers an appropriate schema from the data.

    Args:
        instructions (str): Plain language description of extraction goals
            (e.g., "Extract transaction details including amount, date,
            and merchant"). Guides extraction and schema inference.
        response_format (type[ResponseFormat] | None, optional): Target
            structure for parsed data. Can be a Pydantic model, built-in
            type, or None for automatic inference. Defaults to None.
        max_examples (int, optional): Maximum rows to analyze for schema
            inference (when response_format is None). Defaults to 100.
        batch_size (int | None, optional): Rows per API batch. None
            enables automatic optimization. Defaults to None.
        show_progress (bool, optional): Show progress bar in Jupyter
            notebooks. Defaults to True.
        **api_kwargs: Additional OpenAI API parameters.

    Returns:
        pandas.Series: Parsed structured data indexed like the original
            DataFrame.

    Example:
        ```python
        df = pd.DataFrame({
            'log': [
                '2024-01-01 10:00 ERROR Database connection failed',
                '2024-01-01 10:05 INFO Service started successfully'
            ]
        })

        # With automatic schema inference
        parsed = df.ai.parse("Extract timestamp, level, and message")
        # Returns Series with inferred structure like:
        # {timestamp: str, level: str, message: str}
        ```
    """
    return self.parse_with_cache(
        instructions=instructions,
        cache=BatchingMapProxy(batch_size=batch_size, show_progress=show_progress),
        response_format=response_format,
        max_examples=max_examples,
        **api_kwargs,
    )

infer_schema ¶

infer_schema(
    instructions: str, max_examples: int = 100, **api_kwargs
) -> SchemaInferenceOutput

Infer a structured data schema from DataFrame rows using AI.

This method analyzes a sample of DataFrame rows to automatically infer a structured schema that can be used for consistent data extraction. Each row is converted to JSON format and analyzed to identify patterns, field types, and potential categorical values.

Parameters:

Name	Type	Description	Default
`instructions`	`str`	Plain language description of how the extracted structured data will be used (e.g., "Extract operational metrics for dashboard", "Parse customer attributes for segmentation"). This guides field relevance and helps exclude irrelevant information.	required
`max_examples`	`int`	Maximum number of rows to analyze from the DataFrame. The method will sample randomly up to this limit. Defaults to 100.	`100`

Returns:

Name	Type	Description
`InferredSchema`	`SchemaInferenceOutput`	An object containing: - instructions: Normalized statement of the extraction objective - fields: List of field specifications with names, types, and descriptions - inference_prompt: Reusable prompt for future extractions - model: Dynamically generated Pydantic model for parsing - task: PreparedTask for batch extraction operations

Example

df = pd.DataFrame({
    'text': [
        "Order #123: Shipped to NYC, arriving Tuesday",
        "Order #456: Delayed due to weather, new ETA Friday",
        "Order #789: Delivered to customer in LA"
    ],
    'timestamp': ['2024-01-01', '2024-01-02', '2024-01-03']
})

# Infer schema for logistics tracking
schema = df.ai.infer_schema(
    instructions="Extract shipping status and location data for logistics tracking"
)

# Apply the schema to extract structured data
extracted_df = df.ai.task(schema.task)

Note

Each row is converted to JSON before analysis. The inference process automatically detects hierarchical relationships and creates appropriate nested structures when present. The generated Pydantic model ensures type safety and validation.

Source code in src/openaivec/pandas_ext.py

def infer_schema(self, instructions: str, max_examples: int = 100, **api_kwargs) -> SchemaInferenceOutput:
    """Infer a structured data schema from DataFrame rows using AI.

    This method analyzes a sample of DataFrame rows to automatically infer
    a structured schema that can be used for consistent data extraction.
    Each row is converted to JSON format and analyzed to identify patterns,
    field types, and potential categorical values.

    Args:
        instructions (str): Plain language description of how the extracted
            structured data will be used (e.g., "Extract operational metrics
            for dashboard", "Parse customer attributes for segmentation").
            This guides field relevance and helps exclude irrelevant information.
        max_examples (int): Maximum number of rows to analyze from the
            DataFrame. The method will sample randomly up to this limit.
            Defaults to 100.

    Returns:
        InferredSchema: An object containing:
            - instructions: Normalized statement of the extraction objective
            - fields: List of field specifications with names, types, and descriptions
            - inference_prompt: Reusable prompt for future extractions
            - model: Dynamically generated Pydantic model for parsing
            - task: PreparedTask for batch extraction operations

    Example:
        ```python
        df = pd.DataFrame({
            'text': [
                "Order #123: Shipped to NYC, arriving Tuesday",
                "Order #456: Delayed due to weather, new ETA Friday",
                "Order #789: Delivered to customer in LA"
            ],
            'timestamp': ['2024-01-01', '2024-01-02', '2024-01-03']
        })

        # Infer schema for logistics tracking
        schema = df.ai.infer_schema(
            instructions="Extract shipping status and location data for logistics tracking"
        )

        # Apply the schema to extract structured data
        extracted_df = df.ai.task(schema.task)
        ```

    Note:
        Each row is converted to JSON before analysis. The inference
        process automatically detects hierarchical relationships and
        creates appropriate nested structures when present. The generated
        Pydantic model ensures type safety and validation.
    """
    return _df_rows_to_json_series(self._obj).ai.infer_schema(
        instructions=instructions,
        max_examples=max_examples,
        **api_kwargs,
    )

extract ¶

extract(column: str) -> pd.DataFrame

Flatten one column of Pydantic models/dicts into top‑level columns.

Example

df = pd.DataFrame([
    {"animal": {"name": "cat", "legs": 4}},
    {"animal": {"name": "dog", "legs": 4}},
    {"animal": {"name": "elephant", "legs": 4}},
])
df.ai.extract("animal")

This method returns a DataFrame with the same index as the original, where each column corresponds to a key in the dictionaries. The source column is dropped.

Parameters:

Name	Type	Description	Default
`column`	`str`	Column to expand.	required

Returns:

Type	Description
`DataFrame`	pandas.DataFrame: Original DataFrame with the extracted columns; the source column is dropped.

Source code in src/openaivec/pandas_ext.py

def extract(self, column: str) -> pd.DataFrame:
    """Flatten one column of Pydantic models/dicts into top‑level columns.

    Example:
        ```python
        df = pd.DataFrame([
            {"animal": {"name": "cat", "legs": 4}},
            {"animal": {"name": "dog", "legs": 4}},
            {"animal": {"name": "elephant", "legs": 4}},
        ])
        df.ai.extract("animal")
        ```
        This method returns a DataFrame with the same index as the original,
        where each column corresponds to a key in the dictionaries.
        The source column is dropped.

    Args:
        column (str): Column to expand.

    Returns:
        pandas.DataFrame: Original DataFrame with the extracted columns; the source column is dropped.
    """
    if column not in self._obj.columns:
        raise ValueError(f"Column '{column}' does not exist in the DataFrame.")

    return (
        self._obj.pipe(lambda df: df.reset_index(drop=True))
        .pipe(lambda df: df.join(df[column].ai.extract()))
        .pipe(lambda df: df.set_index(self._obj.index))
        .pipe(lambda df: df.drop(columns=[column], axis=1))
    )

fillna ¶

fillna(
    target_column_name: str,
    max_examples: int = 500,
    batch_size: int | None = None,
    show_progress: bool = True,
) -> pd.DataFrame

Fill missing values in a DataFrame column using AI-powered inference.

This method uses machine learning to intelligently fill missing (NaN) values in a specified column by analyzing patterns from non-missing rows in the DataFrame. It creates a prepared task that provides examples of similar rows to help the AI model predict appropriate values for the missing entries.

Parameters:

Name	Type	Description	Default
`target_column_name`	`str`	The name of the column containing missing values that need to be filled.	required
`max_examples`	`int`	The maximum number of example rows to use for context when predicting missing values. Higher values may improve accuracy but increase API costs and processing time. Defaults to 500.	`500`
`batch_size`	`int \| None`	Number of requests sent in one batch to optimize API usage. Defaults to `None` (automatic batch size optimization based on execution time). Set to a positive integer for fixed batch size.	`None`
`show_progress`	`bool`	Show progress bar in Jupyter notebooks. Defaults to `True`.	`True`

Returns:

Type	Description
`DataFrame`	pandas.DataFrame: A new DataFrame with missing values filled in the target column. The original DataFrame is not modified.

Example

df = pd.DataFrame({
    'name': ['Alice', 'Bob', None, 'David'],
    'age': [25, 30, 35, None],
    'city': ['Tokyo', 'Osaka', 'Kyoto', 'Tokyo']
})

# Fill missing values in the 'name' column
filled_df = df.ai.fillna('name')

# With progress bar for large datasets
large_df = pd.DataFrame({'name': [None] * 1000, 'age': list(range(1000))})
filled_df = large_df.ai.fillna('name', batch_size=32, show_progress=True)

Note

If the target column has no missing values, the original DataFrame is returned unchanged.

Source code in src/openaivec/pandas_ext.py

def fillna(
    self,
    target_column_name: str,
    max_examples: int = 500,
    batch_size: int | None = None,
    show_progress: bool = True,
) -> pd.DataFrame:
    """Fill missing values in a DataFrame column using AI-powered inference.

    This method uses machine learning to intelligently fill missing (NaN) values
    in a specified column by analyzing patterns from non-missing rows in the DataFrame.
    It creates a prepared task that provides examples of similar rows to help the AI
    model predict appropriate values for the missing entries.

    Args:
        target_column_name (str): The name of the column containing missing values
            that need to be filled.
        max_examples (int, optional): The maximum number of example rows to use
            for context when predicting missing values. Higher values may improve
            accuracy but increase API costs and processing time. Defaults to 500.
        batch_size (int | None, optional): Number of requests sent in one batch
            to optimize API usage. Defaults to ``None`` (automatic batch size
            optimization based on execution time). Set to a positive integer for fixed batch size.
        show_progress (bool, optional): Show progress bar in Jupyter notebooks. Defaults to ``True``.

    Returns:
        pandas.DataFrame: A new DataFrame with missing values filled in the target
            column. The original DataFrame is not modified.

    Example:
        ```python
        df = pd.DataFrame({
            'name': ['Alice', 'Bob', None, 'David'],
            'age': [25, 30, 35, None],
            'city': ['Tokyo', 'Osaka', 'Kyoto', 'Tokyo']
        })

        # Fill missing values in the 'name' column
        filled_df = df.ai.fillna('name')

        # With progress bar for large datasets
        large_df = pd.DataFrame({'name': [None] * 1000, 'age': list(range(1000))})
        filled_df = large_df.ai.fillna('name', batch_size=32, show_progress=True)
        ```

    Note:
        If the target column has no missing values, the original DataFrame
        is returned unchanged.
    """

    task: PreparedTask = fillna(self._obj, target_column_name, max_examples)
    missing_rows = self._obj[self._obj[target_column_name].isna()]
    if missing_rows.empty:
        return self._obj

    filled_values: list[FillNaResponse] = missing_rows.ai.task(
        task=task, batch_size=batch_size, show_progress=show_progress
    )

    # get deep copy of the DataFrame to avoid modifying the original
    df = self._obj.copy()

    # Get the actual indices of missing rows to map the results correctly
    missing_indices = missing_rows.index.tolist()

    for i, result in enumerate(filled_values):
        if result.output is not None:
            # Use the actual index from the original DataFrame, not the relative index from result
            actual_index = missing_indices[i]
            df.at[actual_index, target_column_name] = result.output

    return df

similarity ¶

similarity(col1: str, col2: str) -> pd.Series

Compute cosine similarity between two columns containing embedding vectors.

This method calculates the cosine similarity between vectors stored in two columns of the DataFrame. The vectors should be numpy arrays or array-like objects that support dot product operations.

Example

df = pd.DataFrame({
    'vec1': [np.array([1, 0, 0]), np.array([0, 1, 0])],
    'vec2': [np.array([1, 0, 0]), np.array([1, 1, 0])]
})
similarities = df.ai.similarity('vec1', 'vec2')

Parameters:

Name	Type	Description	Default
`col1`	`str`	Name of the first column containing embedding vectors.	required
`col2`	`str`	Name of the second column containing embedding vectors.	required

Returns:

Type	Description
`Series`	pandas.Series: Series containing cosine similarity scores between corresponding vectors in col1 and col2, with values ranging from -1 to 1, where 1 indicates identical direction.

Source code in src/openaivec/pandas_ext.py

def similarity(self, col1: str, col2: str) -> pd.Series:
    """Compute cosine similarity between two columns containing embedding vectors.

    This method calculates the cosine similarity between vectors stored in
    two columns of the DataFrame. The vectors should be numpy arrays or
    array-like objects that support dot product operations.

    Example:
        ```python
        df = pd.DataFrame({
            'vec1': [np.array([1, 0, 0]), np.array([0, 1, 0])],
            'vec2': [np.array([1, 0, 0]), np.array([1, 1, 0])]
        })
        similarities = df.ai.similarity('vec1', 'vec2')
        ```

    Args:
        col1 (str): Name of the first column containing embedding vectors.
        col2 (str): Name of the second column containing embedding vectors.

    Returns:
        pandas.Series: Series containing cosine similarity scores between
            corresponding vectors in col1 and col2, with values ranging
            from -1 to 1, where 1 indicates identical direction.
    """
    return self._obj.apply(
        lambda row: np.dot(row[col1], row[col2]) / (np.linalg.norm(row[col1]) * np.linalg.norm(row[col2])),
        axis=1,
    ).rename("similarity")  # type: ignore[arg-type]

AsyncOpenAIVecSeriesAccessor ¶

AsyncOpenAIVecSeriesAccessor(series_obj: Series)

pandas Series accessor (.aio) that adds OpenAI helpers.

Source code in src/openaivec/pandas_ext.py

def __init__(self, series_obj: pd.Series):
    self._obj = series_obj

Functions¶

responses_with_cache `async` ¶

responses_with_cache(
    instructions: str,
    cache: AsyncBatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] = str,
    **api_kwargs,
) -> pd.Series

Call an LLM once for every Series element using a provided cache (asynchronously).

This method allows external control over caching behavior by accepting a pre-configured AsyncBatchingMapProxy instance, enabling cache sharing across multiple operations or custom batch size management. The concurrency is controlled by the cache instance itself.

Example

result = await series.aio.responses_with_cache(
    "classify",
    cache=shared,
    max_output_tokens=256,
    frequency_penalty=0.2,
)

Parameters:

Name	Type	Description	Default
`instructions`	`str`	System prompt prepended to every user message.	required
`cache`	`AsyncBatchingMapProxy[str, ResponseFormat]`	Pre-configured cache instance for managing API call batching and deduplication. Set cache.batch_size=None to enable automatic batch size optimization.	required
`response_format`	`type[ResponseFormat]`	Pydantic model or built‑in type the assistant should return. Defaults to `str`.	`str`
`**api_kwargs`		Additional keyword arguments forwarded verbatim to `AsyncOpenAI.responses.parse` (e.g. `temperature`, `top_p`, `max_output_tokens`, penalties, future parameters). Core batching keys (model, instructions, input, text_format) are protected and silently ignored if provided.	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Series whose values are instances of `response_format`.

Note

This is an asynchronous method and must be awaited.

Source code in src/openaivec/pandas_ext.py

async def responses_with_cache(
    self,
    instructions: str,
    cache: AsyncBatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] = str,
    **api_kwargs,
) -> pd.Series:
    """Call an LLM once for every Series element using a provided cache (asynchronously).

    This method allows external control over caching behavior by accepting
    a pre-configured AsyncBatchingMapProxy instance, enabling cache sharing
    across multiple operations or custom batch size management. The concurrency
    is controlled by the cache instance itself.

    Example:
        ```python
        result = await series.aio.responses_with_cache(
            "classify",
            cache=shared,
            max_output_tokens=256,
            frequency_penalty=0.2,
        )
        ```

    Args:
        instructions (str): System prompt prepended to every user message.
        cache (AsyncBatchingMapProxy[str, ResponseFormat]): Pre-configured cache
            instance for managing API call batching and deduplication.
            Set cache.batch_size=None to enable automatic batch size optimization.
        response_format (type[ResponseFormat], optional): Pydantic model or built‑in
            type the assistant should return. Defaults to ``str``.
        **api_kwargs: Additional keyword arguments forwarded verbatim to
            ``AsyncOpenAI.responses.parse`` (e.g. ``temperature``, ``top_p``,
            ``max_output_tokens``, penalties, future parameters). Core batching keys
            (model, instructions, input, text_format) are protected and silently
            ignored if provided.

    Returns:
        pandas.Series: Series whose values are instances of ``response_format``.

    Note:
        This is an asynchronous method and must be awaited.
    """
    client: AsyncBatchResponses = AsyncBatchResponses(
        client=CONTAINER.resolve(AsyncOpenAI),
        model_name=CONTAINER.resolve(ResponsesModelName).value,
        system_message=instructions,
        response_format=response_format,
        cache=cache,
        api_kwargs=api_kwargs,
    )

    results = await client.parse(self._obj.tolist())
    return pd.Series(results, index=self._obj.index, name=self._obj.name)

responses `async` ¶

responses(
    instructions: str,
    response_format: type[ResponseFormat] = str,
    batch_size: int | None = None,
    max_concurrency: int = 8,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series

Call an LLM once for every Series element (asynchronously).

Example

animals = pd.Series(["cat", "dog", "elephant"])
# Must be awaited
results = await animals.aio.responses("translate to French")

# With progress bar for large datasets
large_series = pd.Series(["data"] * 1000)
results = await large_series.aio.responses(
    "analyze this data",
    batch_size=32,
    max_concurrency=4,
    show_progress=True
)

Parameters:

Name	Type	Description	Default
`instructions`	`str`	System prompt prepended to every user message.	required
`response_format`	`type[ResponseFormat]`	Pydantic model or built‑in type the assistant should return. Defaults to `str`.	`str`
`batch_size`	`int \| None`	Number of prompts grouped into a single request. Defaults to `None` (automatic batch size optimization based on execution time). Set to a positive integer for fixed batch size.	`None`
`max_concurrency`	`int`	Maximum number of concurrent requests. Defaults to `8`.	`8`
`show_progress`	`bool`	Show progress bar in Jupyter notebooks. Defaults to `True`.	`True`
`**api_kwargs`		Additional keyword arguments forwarded verbatim to `AsyncOpenAI.responses.parse` (e.g. `temperature`, `top_p`, `max_output_tokens`, penalties, future parameters). Core batching keys (model, instructions, input, text_format) are protected and silently ignored if provided.	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Series whose values are instances of `response_format`.

Note

This is an asynchronous method and must be awaited.

Source code in src/openaivec/pandas_ext.py

async def responses(
    self,
    instructions: str,
    response_format: type[ResponseFormat] = str,
    batch_size: int | None = None,
    max_concurrency: int = 8,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series:
    """Call an LLM once for every Series element (asynchronously).

    Example:
        ```python
        animals = pd.Series(["cat", "dog", "elephant"])
        # Must be awaited
        results = await animals.aio.responses("translate to French")

        # With progress bar for large datasets
        large_series = pd.Series(["data"] * 1000)
        results = await large_series.aio.responses(
            "analyze this data",
            batch_size=32,
            max_concurrency=4,
            show_progress=True
        )
        ```

    Args:
        instructions (str): System prompt prepended to every user message.
        response_format (type[ResponseFormat], optional): Pydantic model or built‑in
            type the assistant should return. Defaults to ``str``.
        batch_size (int | None, optional): Number of prompts grouped into a single
            request. Defaults to ``None`` (automatic batch size optimization
            based on execution time). Set to a positive integer for fixed batch size.
        max_concurrency (int, optional): Maximum number of concurrent
            requests. Defaults to ``8``.
        show_progress (bool, optional): Show progress bar in Jupyter notebooks. Defaults to ``True``.
        **api_kwargs: Additional keyword arguments forwarded verbatim to
            ``AsyncOpenAI.responses.parse`` (e.g. ``temperature``, ``top_p``,
            ``max_output_tokens``, penalties, future parameters). Core batching keys
            (model, instructions, input, text_format) are protected and silently
            ignored if provided.

    Returns:
        pandas.Series: Series whose values are instances of ``response_format``.

    Note:
        This is an asynchronous method and must be awaited.
    """
    return await self.responses_with_cache(
        instructions=instructions,
        cache=AsyncBatchingMapProxy(
            batch_size=batch_size, max_concurrency=max_concurrency, show_progress=show_progress
        ),
        response_format=response_format,
        **api_kwargs,
    )

embeddings_with_cache `async` ¶

embeddings_with_cache(
    cache: AsyncBatchingMapProxy[str, ndarray], **api_kwargs
) -> pd.Series

Compute OpenAI embeddings for every Series element using a provided cache (asynchronously).

This method allows external control over caching behavior by accepting a pre-configured AsyncBatchingMapProxy instance, enabling cache sharing across multiple operations or custom batch size management. The concurrency is controlled by the cache instance itself.

Example

from openaivec._cache import AsyncBatchingMapProxy
import numpy as np

# Create a shared cache with custom batch size and concurrency
shared_cache = AsyncBatchingMapProxy[str, np.ndarray](
    batch_size=64, max_concurrency=4
)

animals = pd.Series(["cat", "dog", "elephant"])
# Must be awaited
embeddings = await animals.aio.embeddings_with_cache(cache=shared_cache)

Parameters:

Name	Type	Description	Default
`cache`	`AsyncBatchingMapProxy[str, ndarray]`	Pre-configured cache instance for managing API call batching and deduplication. Set cache.batch_size=None to enable automatic batch size optimization.	required
`**api_kwargs`		Additional OpenAI API parameters (e.g., dimensions for text-embedding-3 models).	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Series whose values are `np.ndarray` objects (dtype `float32`).

Note

This is an asynchronous method and must be awaited.

Source code in src/openaivec/pandas_ext.py

async def embeddings_with_cache(
    self,
    cache: AsyncBatchingMapProxy[str, np.ndarray],
    **api_kwargs,
) -> pd.Series:
    """Compute OpenAI embeddings for every Series element using a provided cache (asynchronously).

    This method allows external control over caching behavior by accepting
    a pre-configured AsyncBatchingMapProxy instance, enabling cache sharing
    across multiple operations or custom batch size management. The concurrency
    is controlled by the cache instance itself.

    Example:
        ```python
        from openaivec._cache import AsyncBatchingMapProxy
        import numpy as np

        # Create a shared cache with custom batch size and concurrency
        shared_cache = AsyncBatchingMapProxy[str, np.ndarray](
            batch_size=64, max_concurrency=4
        )

        animals = pd.Series(["cat", "dog", "elephant"])
        # Must be awaited
        embeddings = await animals.aio.embeddings_with_cache(cache=shared_cache)
        ```

    Args:
        cache (AsyncBatchingMapProxy[str, np.ndarray]): Pre-configured cache
            instance for managing API call batching and deduplication.
            Set cache.batch_size=None to enable automatic batch size optimization.
        **api_kwargs: Additional OpenAI API parameters (e.g., dimensions for text-embedding-3 models).

    Returns:
        pandas.Series: Series whose values are ``np.ndarray`` objects
            (dtype ``float32``).

    Note:
        This is an asynchronous method and must be awaited.
    """
    client: AsyncBatchEmbeddings = AsyncBatchEmbeddings(
        client=CONTAINER.resolve(AsyncOpenAI),
        model_name=CONTAINER.resolve(EmbeddingsModelName).value,
        cache=cache,
        api_kwargs=api_kwargs,
    )

    # Await the async operation
    results = await client.create(self._obj.tolist())

    return pd.Series(
        results,
        index=self._obj.index,
        name=self._obj.name,
    )

embeddings `async` ¶

embeddings(
    batch_size: int | None = None,
    max_concurrency: int = 8,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series

Compute OpenAI embeddings for every Series element (asynchronously).

Example

animals = pd.Series(["cat", "dog", "elephant"])
# Must be awaited
embeddings = await animals.aio.embeddings()

# With progress bar for large datasets
large_texts = pd.Series(["text"] * 5000)
embeddings = await large_texts.aio.embeddings(
    batch_size=100,
    max_concurrency=4,
    show_progress=True
)

Parameters:

Name	Type	Description	Default
`batch_size`	`int \| None`	Number of inputs grouped into a single request. Defaults to `None` (automatic batch size optimization based on execution time). Set to a positive integer for fixed batch size.	`None`
`max_concurrency`	`int`	Maximum number of concurrent requests. Defaults to `8`.	`8`
`show_progress`	`bool`	Show progress bar in Jupyter notebooks. Defaults to `True`.	`True`
`**api_kwargs`		Additional OpenAI API parameters (e.g., dimensions for text-embedding-3 models).	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Series whose values are `np.ndarray` objects (dtype `float32`).

Note

This is an asynchronous method and must be awaited.

Source code in src/openaivec/pandas_ext.py

async def embeddings(
    self, batch_size: int | None = None, max_concurrency: int = 8, show_progress: bool = True, **api_kwargs
) -> pd.Series:
    """Compute OpenAI embeddings for every Series element (asynchronously).

    Example:
        ```python
        animals = pd.Series(["cat", "dog", "elephant"])
        # Must be awaited
        embeddings = await animals.aio.embeddings()

        # With progress bar for large datasets
        large_texts = pd.Series(["text"] * 5000)
        embeddings = await large_texts.aio.embeddings(
            batch_size=100,
            max_concurrency=4,
            show_progress=True
        )
        ```

    Args:
        batch_size (int | None, optional): Number of inputs grouped into a
            single request. Defaults to ``None`` (automatic batch size optimization
            based on execution time). Set to a positive integer for fixed batch size.
        max_concurrency (int, optional): Maximum number of concurrent
            requests. Defaults to ``8``.
        show_progress (bool, optional): Show progress bar in Jupyter notebooks. Defaults to ``True``.
        **api_kwargs: Additional OpenAI API parameters (e.g., dimensions for text-embedding-3 models).

    Returns:
        pandas.Series: Series whose values are ``np.ndarray`` objects
            (dtype ``float32``).

    Note:
        This is an asynchronous method and must be awaited.
    """
    return await self.embeddings_with_cache(
        cache=AsyncBatchingMapProxy(
            batch_size=batch_size, max_concurrency=max_concurrency, show_progress=show_progress
        ),
        **api_kwargs,
    )

task_with_cache `async` ¶

task_with_cache(
    task: PreparedTask[ResponseFormat],
    cache: AsyncBatchingMapProxy[str, ResponseFormat],
) -> pd.Series

Execute a prepared task on every Series element using a provided cache (asynchronously).

This method allows external control over caching behavior by accepting a pre-configured AsyncBatchingMapProxy instance, enabling cache sharing across multiple operations or custom batch size management. The concurrency is controlled by the cache instance itself.

Parameters:

Name	Type	Description	Default
`task`	`PreparedTask`	A pre-configured task containing instructions, response format, and other parameters for processing the inputs.	required
`cache`	`AsyncBatchingMapProxy[str, ResponseFormat]`	Pre-configured cache instance for managing API call batching and deduplication. Set cache.batch_size=None to enable automatic batch size optimization.	required

Example

from openaivec._model import PreparedTask
from openaivec._cache import AsyncBatchingMapProxy

# Create a shared cache with custom batch size and concurrency
shared_cache = AsyncBatchingMapProxy(batch_size=64, max_concurrency=4)

# Assume you have a prepared task for sentiment analysis
sentiment_task = PreparedTask(...)

reviews = pd.Series(["Great product!", "Not satisfied", "Amazing quality"])
# Must be awaited
results = await reviews.aio.task_with_cache(sentiment_task, cache=shared_cache)

Additional Keyword Args

Arbitrary OpenAI Responses API parameters (e.g. frequency_penalty, presence_penalty, seed, etc.) are forwarded verbatim to the underlying client. Core batching / routing keys (model, instructions / system message, user input) are managed by the library and cannot be overridden.

Returns:

Type	Description
`Series`	pandas.Series: Series whose values are instances of the task's response format, aligned with the original Series index.

Note

This is an asynchronous method and must be awaited.

Source code in src/openaivec/pandas_ext.py

async def task_with_cache(
    self,
    task: PreparedTask[ResponseFormat],
    cache: AsyncBatchingMapProxy[str, ResponseFormat],
) -> pd.Series:
    """Execute a prepared task on every Series element using a provided cache (asynchronously).

    This method allows external control over caching behavior by accepting
    a pre-configured AsyncBatchingMapProxy instance, enabling cache sharing
    across multiple operations or custom batch size management. The concurrency
    is controlled by the cache instance itself.

    Args:
        task (PreparedTask): A pre-configured task containing instructions,
            response format, and other parameters for processing the inputs.
        cache (AsyncBatchingMapProxy[str, ResponseFormat]): Pre-configured cache
            instance for managing API call batching and deduplication.
            Set cache.batch_size=None to enable automatic batch size optimization.

    Example:
        ```python
        from openaivec._model import PreparedTask
        from openaivec._cache import AsyncBatchingMapProxy

        # Create a shared cache with custom batch size and concurrency
        shared_cache = AsyncBatchingMapProxy(batch_size=64, max_concurrency=4)

        # Assume you have a prepared task for sentiment analysis
        sentiment_task = PreparedTask(...)

        reviews = pd.Series(["Great product!", "Not satisfied", "Amazing quality"])
        # Must be awaited
        results = await reviews.aio.task_with_cache(sentiment_task, cache=shared_cache)
        ```

    Additional Keyword Args:
        Arbitrary OpenAI Responses API parameters (e.g. ``frequency_penalty``, ``presence_penalty``,
        ``seed``, etc.) are forwarded verbatim to the underlying client. Core batching / routing
        keys (``model``, ``instructions`` / system message, user ``input``) are managed by the
        library and cannot be overridden.

    Returns:
        pandas.Series: Series whose values are instances of the task's
            response format, aligned with the original Series index.

    Note:
        This is an asynchronous method and must be awaited.
    """
    client = AsyncBatchResponses(
        client=CONTAINER.resolve(AsyncOpenAI),
        model_name=CONTAINER.resolve(ResponsesModelName).value,
        system_message=task.instructions,
        response_format=task.response_format,
        cache=cache,
        api_kwargs=task.api_kwargs,
    )
    results = await client.parse(self._obj.tolist())

    return pd.Series(results, index=self._obj.index, name=self._obj.name)

task `async` ¶

task(
    task: PreparedTask,
    batch_size: int | None = None,
    max_concurrency: int = 8,
    show_progress: bool = True,
) -> pd.Series

Execute a prepared task on every Series element (asynchronously).

Example

from openaivec._model import PreparedTask

# Assume you have a prepared task for sentiment analysis
sentiment_task = PreparedTask(...)

reviews = pd.Series(["Great product!", "Not satisfied", "Amazing quality"])
# Must be awaited
results = await reviews.aio.task(sentiment_task)

# With progress bar for large datasets
large_reviews = pd.Series(["review text"] * 2000)
results = await large_reviews.aio.task(
    sentiment_task,
    batch_size=50,
    max_concurrency=4,
    show_progress=True
)

Parameters:

Name	Type	Description	Default
`task`	`PreparedTask`	A pre-configured task containing instructions, response format, and other parameters for processing the inputs.	required
`batch_size`	`int \| None`	Number of prompts grouped into a single request to optimize API usage. Defaults to `None` (automatic batch size optimization based on execution time). Set to a positive integer for fixed batch size.	`None`
`max_concurrency`	`int`	Maximum number of concurrent requests. Defaults to 8.	`8`
`show_progress`	`bool`	Show progress bar in Jupyter notebooks. Defaults to `True`.	`True`

Note

The task's stored API parameters are used. Core batching / routing keys (model, instructions / system message, user input) are managed by the library and cannot be overridden.

Returns:

Type	Description
`Series`	pandas.Series: Series whose values are instances of the task's response format, aligned with the original Series index.

Note

This is an asynchronous method and must be awaited.

Source code in src/openaivec/pandas_ext.py

async def task(
    self,
    task: PreparedTask,
    batch_size: int | None = None,
    max_concurrency: int = 8,
    show_progress: bool = True,
) -> pd.Series:
    """Execute a prepared task on every Series element (asynchronously).

    Example:
        ```python
        from openaivec._model import PreparedTask

        # Assume you have a prepared task for sentiment analysis
        sentiment_task = PreparedTask(...)

        reviews = pd.Series(["Great product!", "Not satisfied", "Amazing quality"])
        # Must be awaited
        results = await reviews.aio.task(sentiment_task)

        # With progress bar for large datasets
        large_reviews = pd.Series(["review text"] * 2000)
        results = await large_reviews.aio.task(
            sentiment_task,
            batch_size=50,
            max_concurrency=4,
            show_progress=True
        )
        ```

    Args:
        task (PreparedTask): A pre-configured task containing instructions,
            response format, and other parameters for processing the inputs.
        batch_size (int | None, optional): Number of prompts grouped into a single
            request to optimize API usage. Defaults to ``None`` (automatic batch size
            optimization based on execution time). Set to a positive integer for fixed batch size.
        max_concurrency (int, optional): Maximum number of concurrent
            requests. Defaults to 8.
        show_progress (bool, optional): Show progress bar in Jupyter notebooks. Defaults to ``True``.

    Note:
        The task's stored API parameters are used. Core batching / routing
        keys (``model``, ``instructions`` / system message, user ``input``) are managed by the
        library and cannot be overridden.

    Returns:
        pandas.Series: Series whose values are instances of the task's
            response format, aligned with the original Series index.

    Note:
        This is an asynchronous method and must be awaited.
    """
    return await self.task_with_cache(
        task=task,
        cache=AsyncBatchingMapProxy(
            batch_size=batch_size, max_concurrency=max_concurrency, show_progress=show_progress
        ),
    )

parse_with_cache `async` ¶

parse_with_cache(
    instructions: str,
    cache: AsyncBatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    **api_kwargs,
) -> pd.Series

Parse Series values into structured data using an LLM with a provided cache (asynchronously).

This async method provides external cache control while parsing Series content into structured data. Automatic schema inference is performed when no response format is specified.

Parameters:

Name	Type	Description	Default
`instructions`	`str`	Plain language description of what to extract (e.g., "Extract dates, amounts, and descriptions from receipts"). Guides both extraction and schema inference.	required
`cache`	`AsyncBatchingMapProxy[str, ResponseFormat]`	Pre-configured async cache for managing concurrent API calls and deduplication. Set cache.batch_size=None for automatic optimization.	required
`response_format`	`type[ResponseFormat] \| None`	Target structure for parsed data. Can be a Pydantic model, built-in type, or None for automatic inference. Defaults to None.	`None`
`max_examples`	`int`	Maximum values to analyze for schema inference (when response_format is None). Defaults to 100.	`100`
`**api_kwargs`		Additional OpenAI API parameters.	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Series containing parsed structured data aligned with the original index.

Note

This is an asynchronous method and must be awaited.

Source code in src/openaivec/pandas_ext.py

async def parse_with_cache(
    self,
    instructions: str,
    cache: AsyncBatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    **api_kwargs,
) -> pd.Series:
    """Parse Series values into structured data using an LLM with a provided cache (asynchronously).

    This async method provides external cache control while parsing Series
    content into structured data. Automatic schema inference is performed
    when no response format is specified.

    Args:
        instructions (str): Plain language description of what to extract
            (e.g., "Extract dates, amounts, and descriptions from receipts").
            Guides both extraction and schema inference.
        cache (AsyncBatchingMapProxy[str, ResponseFormat]): Pre-configured
            async cache for managing concurrent API calls and deduplication.
            Set cache.batch_size=None for automatic optimization.
        response_format (type[ResponseFormat] | None, optional): Target
            structure for parsed data. Can be a Pydantic model, built-in
            type, or None for automatic inference. Defaults to None.
        max_examples (int, optional): Maximum values to analyze for schema
            inference (when response_format is None). Defaults to 100.
        **api_kwargs: Additional OpenAI API parameters.

    Returns:
        pandas.Series: Series containing parsed structured data aligned
            with the original index.

    Note:
        This is an asynchronous method and must be awaited.
    """
    schema: SchemaInferenceOutput | None = None
    if response_format is None:
        # Use synchronous schema inference
        schema = self._obj.ai.infer_schema(instructions=instructions, max_examples=max_examples)

    return await self.responses_with_cache(
        instructions=schema.inference_prompt if schema else instructions,
        cache=cache,
        response_format=response_format or schema.model,
        **api_kwargs,
    )

parse `async` ¶

parse(
    instructions: str,
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    batch_size: int | None = None,
    max_concurrency: int = 8,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series

Parse Series values into structured data using an LLM (asynchronously).

Async version of the parse method, extracting structured information from unstructured text with automatic schema inference when needed.

Parameters:

Name	Type	Description	Default
`instructions`	`str`	Plain language extraction goals (e.g., "Extract product names, prices, and categories from descriptions").	required
`response_format`	`type[ResponseFormat] \| None`	Target structure. None triggers automatic schema inference. Defaults to None.	`None`
`max_examples`	`int`	Maximum values for schema inference. Defaults to 100.	`100`
`batch_size`	`int \| None`	Requests per batch. None for automatic optimization. Defaults to None.	`None`
`max_concurrency`	`int`	Maximum concurrent API requests. Defaults to 8.	`8`
`show_progress`	`bool`	Show progress bar. Defaults to True.	`True`
`**api_kwargs`		Additional OpenAI API parameters.	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Parsed structured data indexed like the original Series.

Example

emails = pd.Series([
    "Meeting tomorrow at 3pm with John about Q4 planning",
    "Lunch with Sarah on Friday to discuss new project"
])

# Async extraction with schema inference
parsed = await emails.aio.parse(
    "Extract meeting details including time, person, and topic"
)

Note

This is an asynchronous method and must be awaited.

Source code in src/openaivec/pandas_ext.py

async def parse(
    self,
    instructions: str,
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    batch_size: int | None = None,
    max_concurrency: int = 8,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series:
    """Parse Series values into structured data using an LLM (asynchronously).

    Async version of the parse method, extracting structured information
    from unstructured text with automatic schema inference when needed.

    Args:
        instructions (str): Plain language extraction goals (e.g., "Extract
            product names, prices, and categories from descriptions").
        response_format (type[ResponseFormat] | None, optional): Target
            structure. None triggers automatic schema inference. Defaults to None.
        max_examples (int, optional): Maximum values for schema inference.
            Defaults to 100.
        batch_size (int | None, optional): Requests per batch. None for
            automatic optimization. Defaults to None.
        max_concurrency (int, optional): Maximum concurrent API requests.
            Defaults to 8.
        show_progress (bool, optional): Show progress bar. Defaults to True.
        **api_kwargs: Additional OpenAI API parameters.

    Returns:
        pandas.Series: Parsed structured data indexed like the original Series.

    Example:
        ```python
        emails = pd.Series([
            "Meeting tomorrow at 3pm with John about Q4 planning",
            "Lunch with Sarah on Friday to discuss new project"
        ])

        # Async extraction with schema inference
        parsed = await emails.aio.parse(
            "Extract meeting details including time, person, and topic"
        )
        ```

    Note:
        This is an asynchronous method and must be awaited.
    """
    return await self.parse_with_cache(
        instructions=instructions,
        cache=AsyncBatchingMapProxy(
            batch_size=batch_size, max_concurrency=max_concurrency, show_progress=show_progress
        ),
        response_format=response_format,
        max_examples=max_examples,
        **api_kwargs,
    )

AsyncOpenAIVecDataFrameAccessor ¶

AsyncOpenAIVecDataFrameAccessor(df_obj: DataFrame)

pandas DataFrame accessor (.aio) that adds OpenAI helpers.

Source code in src/openaivec/pandas_ext.py

def __init__(self, df_obj: pd.DataFrame):
    self._obj = df_obj

Functions¶

responses_with_cache `async` ¶

responses_with_cache(
    instructions: str,
    cache: AsyncBatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] = str,
    **api_kwargs,
) -> pd.Series

Generate a response for each row after serializing it to JSON using a provided cache (asynchronously).

This method allows external control over caching behavior by accepting a pre-configured AsyncBatchingMapProxy instance, enabling cache sharing across multiple operations or custom batch size management. The concurrency is controlled by the cache instance itself.

Example

from openaivec._cache import AsyncBatchingMapProxy

# Create a shared cache with custom batch size and concurrency
shared_cache = AsyncBatchingMapProxy(batch_size=64, max_concurrency=4)

df = pd.DataFrame([
    {"name": "cat", "legs": 4},
    {"name": "dog", "legs": 4},
    {"name": "elephant", "legs": 4},
])
# Must be awaited
result = await df.aio.responses_with_cache(
    "what is the animal's name?",
    cache=shared_cache
)

Parameters:

Name	Type	Description	Default
`instructions`	`str`	System prompt for the assistant.	required
`cache`	`AsyncBatchingMapProxy[str, ResponseFormat]`	Pre-configured cache instance for managing API call batching and deduplication. Set cache.batch_size=None to enable automatic batch size optimization.	required
`response_format`	`type[ResponseFormat]`	Desired Python type of the responses. Defaults to `str`.	`str`
`**api_kwargs`		Additional OpenAI API parameters (temperature, top_p, etc.).	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Responses aligned with the DataFrame's original index.

Note

This is an asynchronous method and must be awaited.

Source code in src/openaivec/pandas_ext.py

async def responses_with_cache(
    self,
    instructions: str,
    cache: AsyncBatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] = str,
    **api_kwargs,
) -> pd.Series:
    """Generate a response for each row after serializing it to JSON using a provided cache (asynchronously).

    This method allows external control over caching behavior by accepting
    a pre-configured AsyncBatchingMapProxy instance, enabling cache sharing
    across multiple operations or custom batch size management. The concurrency
    is controlled by the cache instance itself.

    Example:
        ```python
        from openaivec._cache import AsyncBatchingMapProxy

        # Create a shared cache with custom batch size and concurrency
        shared_cache = AsyncBatchingMapProxy(batch_size=64, max_concurrency=4)

        df = pd.DataFrame([
            {"name": "cat", "legs": 4},
            {"name": "dog", "legs": 4},
            {"name": "elephant", "legs": 4},
        ])
        # Must be awaited
        result = await df.aio.responses_with_cache(
            "what is the animal's name?",
            cache=shared_cache
        )
        ```

    Args:
        instructions (str): System prompt for the assistant.
        cache (AsyncBatchingMapProxy[str, ResponseFormat]): Pre-configured cache
            instance for managing API call batching and deduplication.
            Set cache.batch_size=None to enable automatic batch size optimization.
        response_format (type[ResponseFormat], optional): Desired Python type of the
            responses. Defaults to ``str``.
        **api_kwargs: Additional OpenAI API parameters (temperature, top_p, etc.).

    Returns:
        pandas.Series: Responses aligned with the DataFrame's original index.

    Note:
        This is an asynchronous method and must be awaited.
    """
    # Await the call to the async Series method using .aio
    return await _df_rows_to_json_series(self._obj).aio.responses_with_cache(
        instructions=instructions,
        cache=cache,
        response_format=response_format,
        **api_kwargs,
    )

responses `async` ¶

responses(
    instructions: str,
    response_format: type[ResponseFormat] = str,
    batch_size: int | None = None,
    max_concurrency: int = 8,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series

Generate a response for each row after serializing it to JSON (asynchronously).

Example

df = pd.DataFrame([
    {"name": "cat", "legs": 4},
    {"name": "dog", "legs": 4},
    {"name": "elephant", "legs": 4},
])
# Must be awaited
results = await df.aio.responses("what is the animal's name?")

# With progress bar for large datasets
large_df = pd.DataFrame({"id": list(range(1000))})
results = await large_df.aio.responses(
    "generate a name for this ID",
    batch_size=20,
    max_concurrency=4,
    show_progress=True
)

Parameters:

Name	Type	Description	Default
`instructions`	`str`	System prompt for the assistant.	required
`response_format`	`type[ResponseFormat]`	Desired Python type of the responses. Defaults to `str`.	`str`
`batch_size`	`int \| None`	Number of requests sent in one batch. Defaults to `None` (automatic batch size optimization based on execution time). Set to a positive integer for fixed batch size.	`None`
`**api_kwargs`		Additional OpenAI API parameters (temperature, top_p, etc.).	`{}`
`max_concurrency`	`int`	Maximum number of concurrent requests. Defaults to `8`.	`8`
`show_progress`	`bool`	Show progress bar in Jupyter notebooks. Defaults to `True`.	`True`

Returns:

Type	Description
`Series`	pandas.Series: Responses aligned with the DataFrame's original index.

Note

This is an asynchronous method and must be awaited.

Source code in src/openaivec/pandas_ext.py

async def responses(
    self,
    instructions: str,
    response_format: type[ResponseFormat] = str,
    batch_size: int | None = None,
    max_concurrency: int = 8,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series:
    """Generate a response for each row after serializing it to JSON (asynchronously).

    Example:
        ```python
        df = pd.DataFrame([
            {"name": "cat", "legs": 4},
            {"name": "dog", "legs": 4},
            {"name": "elephant", "legs": 4},
        ])
        # Must be awaited
        results = await df.aio.responses("what is the animal's name?")

        # With progress bar for large datasets
        large_df = pd.DataFrame({"id": list(range(1000))})
        results = await large_df.aio.responses(
            "generate a name for this ID",
            batch_size=20,
            max_concurrency=4,
            show_progress=True
        )
        ```

    Args:
        instructions (str): System prompt for the assistant.
        response_format (type[ResponseFormat], optional): Desired Python type of the
            responses. Defaults to ``str``.
        batch_size (int | None, optional): Number of requests sent in one batch.
            Defaults to ``None`` (automatic batch size optimization
            based on execution time). Set to a positive integer for fixed batch size.
        **api_kwargs: Additional OpenAI API parameters (temperature, top_p, etc.).
        max_concurrency (int, optional): Maximum number of concurrent
            requests. Defaults to ``8``.
        show_progress (bool, optional): Show progress bar in Jupyter notebooks. Defaults to ``True``.

    Returns:
        pandas.Series: Responses aligned with the DataFrame's original index.

    Note:
        This is an asynchronous method and must be awaited.
    """
    return await self.responses_with_cache(
        instructions=instructions,
        cache=AsyncBatchingMapProxy(
            batch_size=batch_size, max_concurrency=max_concurrency, show_progress=show_progress
        ),
        response_format=response_format,
        **api_kwargs,
    )

task_with_cache `async` ¶

task_with_cache(
    task: PreparedTask[ResponseFormat],
    cache: AsyncBatchingMapProxy[str, ResponseFormat],
) -> pd.Series

Execute a prepared task on each DataFrame row using a provided cache (asynchronously).

After serializing each row to JSON, this method executes the prepared task.

Parameters:

Name	Type	Description	Default
`task`	`PreparedTask`	Prepared task (instructions + response_format + sampling params).	required
`cache`	`AsyncBatchingMapProxy[str, ResponseFormat]`	Pre‑configured async cache instance.	required

Note

The task's stored API parameters are used. Core routing keys are managed internally.

Returns:

Type	Description
`Series`	pandas.Series: Task results aligned with the DataFrame's original index.

Note

This is an asynchronous method and must be awaited.

Source code in src/openaivec/pandas_ext.py

async def task_with_cache(
    self,
    task: PreparedTask[ResponseFormat],
    cache: AsyncBatchingMapProxy[str, ResponseFormat],
) -> pd.Series:
    """Execute a prepared task on each DataFrame row using a provided cache (asynchronously).

    After serializing each row to JSON, this method executes the prepared task.

    Args:
        task (PreparedTask): Prepared task (instructions + response_format + sampling params).
        cache (AsyncBatchingMapProxy[str, ResponseFormat]): Pre‑configured async cache instance.

    Note:
        The task's stored API parameters are used. Core routing keys are managed internally.

    Returns:
        pandas.Series: Task results aligned with the DataFrame's original index.

    Note:
        This is an asynchronous method and must be awaited.
    """
    return await _df_rows_to_json_series(self._obj).aio.task_with_cache(
        task=task,
        cache=cache,
    )

task `async` ¶

task(
    task: PreparedTask,
    batch_size: int | None = None,
    max_concurrency: int = 8,
    show_progress: bool = True,
) -> pd.Series

Execute a prepared task on each DataFrame row after serializing it to JSON (asynchronously).

Example

from openaivec._model import PreparedTask

# Assume you have a prepared task for data analysis
analysis_task = PreparedTask(...)

df = pd.DataFrame([
    {"name": "cat", "legs": 4},
    {"name": "dog", "legs": 4},
    {"name": "elephant", "legs": 4},
])
# Must be awaited
results = await df.aio.task(analysis_task)

# With progress bar for large datasets
large_df = pd.DataFrame({"id": list(range(1000))})
results = await large_df.aio.task(
    analysis_task,
    batch_size=50,
    max_concurrency=4,
    show_progress=True
)

Parameters:

Name	Type	Description	Default
`task`	`PreparedTask`	A pre-configured task containing instructions, response format, and other parameters for processing the inputs.	required
`batch_size`	`int \| None`	Number of requests sent in one batch to optimize API usage. Defaults to `None` (automatic batch size optimization based on execution time). Set to a positive integer for fixed batch size.	`None`
`max_concurrency`	`int`	Maximum number of concurrent requests. Defaults to 8.	`8`
`show_progress`	`bool`	Show progress bar in Jupyter notebooks. Defaults to `True`.	`True`

Note

The task's stored API parameters are used. Core batching / routing keys (model, instructions / system message, user input) are managed by the library and cannot be overridden.

Returns:

Type	Description
`Series`	pandas.Series: Series whose values are instances of the task's response format, aligned with the DataFrame's original index.

Note

This is an asynchronous method and must be awaited.

Source code in src/openaivec/pandas_ext.py

async def task(
    self,
    task: PreparedTask,
    batch_size: int | None = None,
    max_concurrency: int = 8,
    show_progress: bool = True,
) -> pd.Series:
    """Execute a prepared task on each DataFrame row after serializing it to JSON (asynchronously).

    Example:
        ```python
        from openaivec._model import PreparedTask

        # Assume you have a prepared task for data analysis
        analysis_task = PreparedTask(...)

        df = pd.DataFrame([
            {"name": "cat", "legs": 4},
            {"name": "dog", "legs": 4},
            {"name": "elephant", "legs": 4},
        ])
        # Must be awaited
        results = await df.aio.task(analysis_task)

        # With progress bar for large datasets
        large_df = pd.DataFrame({"id": list(range(1000))})
        results = await large_df.aio.task(
            analysis_task,
            batch_size=50,
            max_concurrency=4,
            show_progress=True
        )
        ```

    Args:
        task (PreparedTask): A pre-configured task containing instructions,
            response format, and other parameters for processing the inputs.
        batch_size (int | None, optional): Number of requests sent in one batch
            to optimize API usage. Defaults to ``None`` (automatic batch size
            optimization based on execution time). Set to a positive integer for fixed batch size.
        max_concurrency (int, optional): Maximum number of concurrent
            requests. Defaults to 8.
        show_progress (bool, optional): Show progress bar in Jupyter notebooks. Defaults to ``True``.

    Note:
        The task's stored API parameters are used. Core batching / routing
        keys (``model``, ``instructions`` / system message, user ``input``) are managed by the
        library and cannot be overridden.

    Returns:
        pandas.Series: Series whose values are instances of the task's
            response format, aligned with the DataFrame's original index.

    Note:
        This is an asynchronous method and must be awaited.
    """
    # Await the call to the async Series method using .aio
    return await _df_rows_to_json_series(self._obj).aio.task(
        task=task,
        batch_size=batch_size,
        max_concurrency=max_concurrency,
        show_progress=show_progress,
    )

parse_with_cache `async` ¶

parse_with_cache(
    instructions: str,
    cache: AsyncBatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    **api_kwargs,
) -> pd.Series

Parse DataFrame rows into structured data using an LLM with cache (asynchronously).

Async method for parsing DataFrame rows (as JSON) with external cache control, enabling deduplication across operations and concurrent processing.

Parameters:

Name	Type	Description	Default
`instructions`	`str`	Plain language extraction goals (e.g., "Extract invoice details including items, quantities, and totals").	required
`cache`	`AsyncBatchingMapProxy[str, ResponseFormat]`	Pre-configured async cache for concurrent API call management.	required
`response_format`	`type[ResponseFormat] \| None`	Target structure. None triggers automatic schema inference. Defaults to None.	`None`
`max_examples`	`int`	Maximum rows for schema inference. Defaults to 100.	`100`
`**api_kwargs`		Additional OpenAI API parameters.	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Parsed structured data indexed like the original DataFrame.

Note

This is an asynchronous method and must be awaited.

Source code in src/openaivec/pandas_ext.py

async def parse_with_cache(
    self,
    instructions: str,
    cache: AsyncBatchingMapProxy[str, ResponseFormat],
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    **api_kwargs,
) -> pd.Series:
    """Parse DataFrame rows into structured data using an LLM with cache (asynchronously).

    Async method for parsing DataFrame rows (as JSON) with external cache
    control, enabling deduplication across operations and concurrent processing.

    Args:
        instructions (str): Plain language extraction goals (e.g., "Extract
            invoice details including items, quantities, and totals").
        cache (AsyncBatchingMapProxy[str, ResponseFormat]): Pre-configured
            async cache for concurrent API call management.
        response_format (type[ResponseFormat] | None, optional): Target
            structure. None triggers automatic schema inference. Defaults to None.
        max_examples (int, optional): Maximum rows for schema inference.
            Defaults to 100.
        **api_kwargs: Additional OpenAI API parameters.

    Returns:
        pandas.Series: Parsed structured data indexed like the original DataFrame.

    Note:
        This is an asynchronous method and must be awaited.
    """
    return await _df_rows_to_json_series(self._obj).aio.parse_with_cache(
        instructions=instructions,
        cache=cache,
        response_format=response_format,
        max_examples=max_examples,
        **api_kwargs,
    )

parse `async` ¶

parse(
    instructions: str,
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    batch_size: int | None = None,
    max_concurrency: int = 8,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series

Parse DataFrame rows into structured data using an LLM (asynchronously).

Async version for extracting structured information from DataFrame rows, with automatic schema inference when no format is specified.

Parameters:

Name	Type	Description	Default
`instructions`	`str`	Plain language extraction goals (e.g., "Extract customer details, order items, and payment information").	required
`response_format`	`type[ResponseFormat] \| None`	Target structure. None triggers automatic inference. Defaults to None.	`None`
`max_examples`	`int`	Maximum rows for schema inference. Defaults to 100.	`100`
`batch_size`	`int \| None`	Rows per batch. None for automatic optimization. Defaults to None.	`None`
`max_concurrency`	`int`	Maximum concurrent requests. Defaults to 8.	`8`
`show_progress`	`bool`	Show progress bar. Defaults to True.	`True`
`**api_kwargs`		Additional OpenAI API parameters.	`{}`

Returns:

Type	Description
`Series`	pandas.Series: Parsed structured data indexed like the original DataFrame.

Example

df = pd.DataFrame({
    'raw_data': [
        'Customer: John Doe, Order: 2 laptops @ $1200 each',
        'Customer: Jane Smith, Order: 5 phones @ $800 each'
    ]
})

# Async parsing with automatic schema inference
parsed = await df.aio.parse(
    "Extract customer name, product, quantity, and unit price"
)

Note

This is an asynchronous method and must be awaited.

Source code in src/openaivec/pandas_ext.py

async def parse(
    self,
    instructions: str,
    response_format: type[ResponseFormat] | None = None,
    max_examples: int = 100,
    batch_size: int | None = None,
    max_concurrency: int = 8,
    show_progress: bool = True,
    **api_kwargs,
) -> pd.Series:
    """Parse DataFrame rows into structured data using an LLM (asynchronously).

    Async version for extracting structured information from DataFrame rows,
    with automatic schema inference when no format is specified.

    Args:
        instructions (str): Plain language extraction goals (e.g., "Extract
            customer details, order items, and payment information").
        response_format (type[ResponseFormat] | None, optional): Target
            structure. None triggers automatic inference. Defaults to None.
        max_examples (int, optional): Maximum rows for schema inference.
            Defaults to 100.
        batch_size (int | None, optional): Rows per batch. None for
            automatic optimization. Defaults to None.
        max_concurrency (int, optional): Maximum concurrent requests.
            Defaults to 8.
        show_progress (bool, optional): Show progress bar. Defaults to True.
        **api_kwargs: Additional OpenAI API parameters.

    Returns:
        pandas.Series: Parsed structured data indexed like the original DataFrame.

    Example:
        ```python
        df = pd.DataFrame({
            'raw_data': [
                'Customer: John Doe, Order: 2 laptops @ $1200 each',
                'Customer: Jane Smith, Order: 5 phones @ $800 each'
            ]
        })

        # Async parsing with automatic schema inference
        parsed = await df.aio.parse(
            "Extract customer name, product, quantity, and unit price"
        )
        ```

    Note:
        This is an asynchronous method and must be awaited.
    """
    return await self.parse_with_cache(
        instructions=instructions,
        cache=AsyncBatchingMapProxy(
            batch_size=batch_size, max_concurrency=max_concurrency, show_progress=show_progress
        ),
        response_format=response_format,
        max_examples=max_examples,
        **api_kwargs,
    )

pipe `async` ¶

pipe(func: Callable[[DataFrame], Awaitable[T] | T]) -> T

Apply a function to the DataFrame, supporting both synchronous and asynchronous functions.

This method allows chaining operations on the DataFrame, similar to pandas' pipe method, but with support for asynchronous functions.

Example

async def process_data(df):
    # Simulate an asynchronous computation
    await asyncio.sleep(1)
    return df.dropna()

df = pd.DataFrame({"col": [1, 2, None, 4]})
# Must be awaited
result = await df.aio.pipe(process_data)

Parameters:

Name	Type	Description	Default
`func`	`Callable[[DataFrame], Awaitable[T] \| T]`	A function that takes a DataFrame as input and returns either a result or an awaitable result.	required

Returns:

Name	Type	Description
`T`	`T`	The result of applying the function, either directly or after awaiting it.

Note

This is an asynchronous method and must be awaited if the function returns an awaitable.

Source code in src/openaivec/pandas_ext.py

async def pipe(self, func: Callable[[pd.DataFrame], Awaitable[T] | T]) -> T:
    """Apply a function to the DataFrame, supporting both synchronous and asynchronous functions.

    This method allows chaining operations on the DataFrame, similar to pandas' `pipe` method,
    but with support for asynchronous functions.

    Example:
        ```python
        async def process_data(df):
            # Simulate an asynchronous computation
            await asyncio.sleep(1)
            return df.dropna()

        df = pd.DataFrame({"col": [1, 2, None, 4]})
        # Must be awaited
        result = await df.aio.pipe(process_data)
        ```

    Args:
        func (Callable[[pd.DataFrame], Awaitable[T] | T]): A function that takes a DataFrame
            as input and returns either a result or an awaitable result.

    Returns:
        T: The result of applying the function, either directly or after awaiting it.

    Note:
        This is an asynchronous method and must be awaited if the function returns an awaitable.
    """
    result = func(self._obj)
    if inspect.isawaitable(result):
        return await result
    else:
        return result

assign `async` ¶

assign(**kwargs) -> pd.DataFrame

Asynchronously assign new columns to the DataFrame, evaluating sequentially.

This method extends pandas' assign method by supporting asynchronous functions as column values and evaluating assignments sequentially, allowing later assignments to refer to columns created earlier in the same call.

For each key-value pair in kwargs: - If the value is a callable, it is invoked with the current state of the DataFrame (including columns created in previous steps of this assign call). If the result is awaitable, it is awaited; otherwise, it is used directly. - If the value is not callable, it is assigned directly to the new column.

Example

async def compute_column(df):
    # Simulate an asynchronous computation
    await asyncio.sleep(1)
    return df["existing_column"] * 2

async def use_new_column(df):
    # Access the column created in the previous step
    await asyncio.sleep(1)
    return df["new_column"] + 5


df = pd.DataFrame({"existing_column": [1, 2, 3]})
# Must be awaited
df = await df.aio.assign(
    new_column=compute_column,
    another_column=use_new_column
)

Parameters:

Name	Type	Description	Default
`**kwargs`		Column names as keys and either static values or callables (synchronous or asynchronous) as values.	`{}`

Returns:

Type	Description
`DataFrame`	pandas.DataFrame: A new DataFrame with the assigned columns.

Note

This is an asynchronous method and must be awaited.

Source code in src/openaivec/pandas_ext.py

async def assign(self, **kwargs) -> pd.DataFrame:
    """Asynchronously assign new columns to the DataFrame, evaluating sequentially.

    This method extends pandas' `assign` method by supporting asynchronous
    functions as column values and evaluating assignments sequentially, allowing
    later assignments to refer to columns created earlier in the same call.

    For each key-value pair in `kwargs`:
    - If the value is a callable, it is invoked with the current state of the DataFrame
      (including columns created in previous steps of this `assign` call).
      If the result is awaitable, it is awaited; otherwise, it is used directly.
    - If the value is not callable, it is assigned directly to the new column.

    Example:
        ```python
        async def compute_column(df):
            # Simulate an asynchronous computation
            await asyncio.sleep(1)
            return df["existing_column"] * 2

        async def use_new_column(df):
            # Access the column created in the previous step
            await asyncio.sleep(1)
            return df["new_column"] + 5


        df = pd.DataFrame({"existing_column": [1, 2, 3]})
        # Must be awaited
        df = await df.aio.assign(
            new_column=compute_column,
            another_column=use_new_column
        )
        ```

    Args:
        **kwargs: Column names as keys and either static values or callables
            (synchronous or asynchronous) as values.

    Returns:
        pandas.DataFrame: A new DataFrame with the assigned columns.

    Note:
        This is an asynchronous method and must be awaited.
    """
    df_current = self._obj.copy()
    for key, value in kwargs.items():
        if callable(value):
            result = value(df_current)
            if inspect.isawaitable(result):
                column_data = await result
            else:
                column_data = result
        else:
            column_data = value

        df_current[key] = column_data

    return df_current

fillna `async` ¶

fillna(
    target_column_name: str,
    max_examples: int = 500,
    batch_size: int | None = None,
    max_concurrency: int = 8,
    show_progress: bool = True,
) -> pd.DataFrame

Fill missing values in a DataFrame column using AI-powered inference (asynchronously).

This method uses machine learning to intelligently fill missing (NaN) values in a specified column by analyzing patterns from non-missing rows in the DataFrame. It creates a prepared task that provides examples of similar rows to help the AI model predict appropriate values for the missing entries.

Parameters:

Name	Type	Description	Default
`target_column_name`	`str`	The name of the column containing missing values that need to be filled.	required
`max_examples`	`int`	The maximum number of example rows to use for context when predicting missing values. Higher values may improve accuracy but increase API costs and processing time. Defaults to 500.	`500`
`batch_size`	`int \| None`	Number of requests sent in one batch to optimize API usage. Defaults to `None` (automatic batch size optimization based on execution time). Set to a positive integer for fixed batch size.	`None`
`max_concurrency`	`int`	Maximum number of concurrent requests. Defaults to 8.	`8`
`show_progress`	`bool`	Show progress bar in Jupyter notebooks. Defaults to `True`.	`True`

Returns:

Type	Description
`DataFrame`	pandas.DataFrame: A new DataFrame with missing values filled in the target column. The original DataFrame is not modified.

Example

df = pd.DataFrame({
    'name': ['Alice', 'Bob', None, 'David'],
    'age': [25, 30, 35, None],
    'city': ['Tokyo', 'Osaka', 'Kyoto', 'Tokyo']
})

# Fill missing values in the 'name' column (must be awaited)
filled_df = await df.aio.fillna('name')

# With progress bar for large datasets
large_df = pd.DataFrame({'name': [None] * 1000, 'age': list(range(1000))})
filled_df = await large_df.aio.fillna(
    'name',
    batch_size=32,
    max_concurrency=4,
    show_progress=True
)

Note

This is an asynchronous method and must be awaited. If the target column has no missing values, the original DataFrame is returned unchanged.

Source code in src/openaivec/pandas_ext.py

async def fillna(
    self,
    target_column_name: str,
    max_examples: int = 500,
    batch_size: int | None = None,
    max_concurrency: int = 8,
    show_progress: bool = True,
) -> pd.DataFrame:
    """Fill missing values in a DataFrame column using AI-powered inference (asynchronously).

    This method uses machine learning to intelligently fill missing (NaN) values
    in a specified column by analyzing patterns from non-missing rows in the DataFrame.
    It creates a prepared task that provides examples of similar rows to help the AI
    model predict appropriate values for the missing entries.

    Args:
        target_column_name (str): The name of the column containing missing values
            that need to be filled.
        max_examples (int, optional): The maximum number of example rows to use
            for context when predicting missing values. Higher values may improve
            accuracy but increase API costs and processing time. Defaults to 500.
        batch_size (int | None, optional): Number of requests sent in one batch
            to optimize API usage. Defaults to ``None`` (automatic batch size
            optimization based on execution time). Set to a positive integer for fixed batch size.
        max_concurrency (int, optional): Maximum number of concurrent
            requests. Defaults to 8.
        show_progress (bool, optional): Show progress bar in Jupyter notebooks. Defaults to ``True``.

    Returns:
        pandas.DataFrame: A new DataFrame with missing values filled in the target
            column. The original DataFrame is not modified.

    Example:
        ```python
        df = pd.DataFrame({
            'name': ['Alice', 'Bob', None, 'David'],
            'age': [25, 30, 35, None],
            'city': ['Tokyo', 'Osaka', 'Kyoto', 'Tokyo']
        })

        # Fill missing values in the 'name' column (must be awaited)
        filled_df = await df.aio.fillna('name')

        # With progress bar for large datasets
        large_df = pd.DataFrame({'name': [None] * 1000, 'age': list(range(1000))})
        filled_df = await large_df.aio.fillna(
            'name',
            batch_size=32,
            max_concurrency=4,
            show_progress=True
        )
        ```

    Note:
        This is an asynchronous method and must be awaited.
        If the target column has no missing values, the original DataFrame
        is returned unchanged.
    """

    task: PreparedTask = fillna(self._obj, target_column_name, max_examples)
    missing_rows = self._obj[self._obj[target_column_name].isna()]
    if missing_rows.empty:
        return self._obj

    filled_values: list[FillNaResponse] = await missing_rows.aio.task(
        task=task,
        batch_size=batch_size,
        max_concurrency=max_concurrency,
        show_progress=show_progress,
    )

    # get deep copy of the DataFrame to avoid modifying the original
    df = self._obj.copy()

    # Get the actual indices of missing rows to map the results correctly
    missing_indices = missing_rows.index.tolist()

    for i, result in enumerate(filled_values):
        if result.output is not None:
            # Use the actual index from the original DataFrame, not the relative index from result
            actual_index = missing_indices[i]
            df.at[actual_index, target_column_name] = result.output

    return df

Functions¶

set_client ¶

set_client(client: OpenAI) -> None

Register a custom OpenAI-compatible client for pandas helpers.

Parameters:

Name	Type	Description	Default
`client`	`OpenAI`	A pre-configured `openai.OpenAI` or `openai.AzureOpenAI` instance reused by every helper in this module.	required

Source code in src/openaivec/pandas_ext.py

def set_client(client: OpenAI) -> None:
    """Register a custom OpenAI-compatible client for pandas helpers.

    Args:
        client (OpenAI): A pre-configured `openai.OpenAI` or
            `openai.AzureOpenAI` instance reused by every helper in this module.
    """
    if client.__class__.__name__ == "AzureOpenAI" and hasattr(client, "base_url"):
        _check_azure_v1_api_url(str(client.base_url))

    CONTAINER.register(OpenAI, lambda: client)

get_client ¶

get_client() -> OpenAI

Get the currently registered OpenAI-compatible client.

Returns:

Name	Type	Description
`OpenAI`	`OpenAI`	The registered `openai.OpenAI` or `openai.AzureOpenAI` instance.

Source code in src/openaivec/pandas_ext.py

def get_client() -> OpenAI:
    """Get the currently registered OpenAI-compatible client.

    Returns:
        OpenAI: The registered `openai.OpenAI` or `openai.AzureOpenAI` instance.
    """
    return CONTAINER.resolve(OpenAI)

set_async_client ¶

set_async_client(client: AsyncOpenAI) -> None

Register a custom asynchronous OpenAI-compatible client.

Parameters:

Name	Type	Description	Default
`client`	`AsyncOpenAI`	A pre-configured `openai.AsyncOpenAI` or `openai.AsyncAzureOpenAI` instance reused by every helper in this module.	required

Source code in src/openaivec/pandas_ext.py

def set_async_client(client: AsyncOpenAI) -> None:
    """Register a custom asynchronous OpenAI-compatible client.

    Args:
        client (AsyncOpenAI): A pre-configured `openai.AsyncOpenAI` or
            `openai.AsyncAzureOpenAI` instance reused by every helper in this module.
    """
    if client.__class__.__name__ == "AsyncAzureOpenAI" and hasattr(client, "base_url"):
        _check_azure_v1_api_url(str(client.base_url))

    CONTAINER.register(AsyncOpenAI, lambda: client)

get_async_client ¶

get_async_client() -> AsyncOpenAI

Get the currently registered asynchronous OpenAI-compatible client.

Returns:

Name	Type	Description
`AsyncOpenAI`	`AsyncOpenAI`	The registered `openai.AsyncOpenAI` or `openai.AsyncAzureOpenAI` instance.

Source code in src/openaivec/pandas_ext.py

def get_async_client() -> AsyncOpenAI:
    """Get the currently registered asynchronous OpenAI-compatible client.

    Returns:
        AsyncOpenAI: The registered `openai.AsyncOpenAI` or `openai.AsyncAzureOpenAI` instance.
    """
    return CONTAINER.resolve(AsyncOpenAI)

set_responses_model ¶

set_responses_model(name: str) -> None

Override the model used for text responses.

Parameters:

Name	Type	Description	Default
`name`	`str`	For Azure OpenAI, use your deployment name. For OpenAI, use the model name (for example, `gpt-4.1-mini`).	required

Source code in src/openaivec/pandas_ext.py

def set_responses_model(name: str) -> None:
    """Override the model used for text responses.

    Args:
        name (str): For Azure OpenAI, use your deployment name. For OpenAI, use the model name
            (for example, ``gpt-4.1-mini``).
    """
    CONTAINER.register(ResponsesModelName, lambda: ResponsesModelName(name))

get_responses_model ¶

get_responses_model() -> str

Get the currently registered model name for text responses.

Returns:

Name	Type	Description
`str`	`str`	The model name (for example, `gpt-4.1-mini`).

Source code in src/openaivec/pandas_ext.py

def get_responses_model() -> str:
    """Get the currently registered model name for text responses.

    Returns:
        str: The model name (for example, ``gpt-4.1-mini``).
    """
    return CONTAINER.resolve(ResponsesModelName).value

set_embeddings_model ¶

set_embeddings_model(name: str) -> None

Override the model used for text embeddings.

Parameters:

Name	Type	Description	Default
`name`	`str`	For Azure OpenAI, use your deployment name. For OpenAI, use the model name, e.g. `text-embedding-3-small`.	required

Source code in src/openaivec/pandas_ext.py

def set_embeddings_model(name: str) -> None:
    """Override the model used for text embeddings.

    Args:
        name (str): For Azure OpenAI, use your deployment name. For OpenAI, use the model name,
            e.g. ``text-embedding-3-small``.
    """
    CONTAINER.register(EmbeddingsModelName, lambda: EmbeddingsModelName(name))

get_embeddings_model ¶

get_embeddings_model() -> str

Get the currently registered model name for text embeddings.

Returns:

Name	Type	Description
`str`	`str`	The model name (for example, `text-embedding-3-small`).

Source code in src/openaivec/pandas_ext.py

def get_embeddings_model() -> str:
    """Get the currently registered model name for text embeddings.

    Returns:
        str: The model name (for example, ``text-embedding-3-small``).
    """
    return CONTAINER.resolve(EmbeddingsModelName).value

Pandas Extension¶

openaivec.pandas_ext ¶

Setup¶

Classes¶

OpenAIVecSeriesAccessor ¶

Functions¶

responses_with_cache ¶

responses ¶

embeddings_with_cache ¶

embeddings ¶

task_with_cache ¶

task ¶

parse_with_cache ¶

parse ¶

infer_schema ¶

count_tokens ¶

extract ¶

OpenAIVecDataFrameAccessor ¶

Functions¶

responses_with_cache ¶

responses ¶

task_with_cache ¶

task ¶

parse_with_cache ¶

parse ¶

infer_schema ¶

extract ¶

fillna ¶

similarity ¶

AsyncOpenAIVecSeriesAccessor ¶

Functions¶

responses_with_cache async ¶

responses async ¶

embeddings_with_cache async ¶

embeddings async ¶

task_with_cache async ¶

task async ¶

parse_with_cache async ¶

parse async ¶

AsyncOpenAIVecDataFrameAccessor ¶

Functions¶

responses_with_cache async ¶

responses async ¶

task_with_cache async ¶

task async ¶

parse_with_cache async ¶

parse async ¶

pipe async ¶

assign async ¶

fillna async ¶

Functions¶

set_client ¶

get_client ¶

set_async_client ¶

get_async_client ¶

set_responses_model ¶

get_responses_model ¶

set_embeddings_model ¶

get_embeddings_model ¶

Modules¶

responses_with_cache `async` ¶

responses `async` ¶

embeddings_with_cache `async` ¶

embeddings `async` ¶

task_with_cache `async` ¶

task `async` ¶

parse_with_cache `async` ¶

parse `async` ¶

responses_with_cache `async` ¶

responses `async` ¶

task_with_cache `async` ¶

task `async` ¶

parse_with_cache `async` ¶

parse `async` ¶

pipe `async` ¶

assign `async` ¶

fillna `async` ¶