`ontocast.agent.check_chunks`¶

Chunk management agent for OntoCast.

This module provides functionality for managing document chunks in the OntoCast workflow. It handles chunk processing state transitions and ensures proper workflow progression through the chunk processing pipeline.

The module supports: - Checking if chunks are available for processing - Managing chunk state transitions - Updating processing status based on chunk availability - Logging chunk processing progress

`check_chunks_empty(state)` ¶

Check if chunks are available and manage chunk processing state.

This function checks if there are remaining chunks to process and manages the state transitions accordingly. If chunks are available, it sets up the next chunk for processing. If no chunks remain, it signals completion of the workflow.

The function performs the following operations: 1. Adds the current chunk to the processed list if it exists 2. Checks if there are remaining chunks to process 3. Sets up the next chunk and resets node visits if chunks remain 4. Sets appropriate status for workflow routing

Parameters:

Name	Type	Description	Default
`state`	`AgentState`	The current agent state containing chunks and processing status.	required

Returns:

Name	Type	Description
`AgentState`	`AgentState`	Updated agent state with chunk processing information.

Example

state = AgentState(chunks=[chunk1, chunk2], current_chunk=None) updated_state = check_chunks_empty(state) print(updated_state.current_chunk) # chunk1 print(updated_state.status) # Status.FAILED

Source code in ontocast/agent/check_chunks.py

def check_chunks_empty(state: AgentState) -> AgentState:
    """Check if chunks are available and manage chunk processing state.

    This function checks if there are remaining chunks to process and
    manages the state transitions accordingly. If chunks are available,
    it sets up the next chunk for processing. If no chunks remain,
    it signals completion of the workflow.

    The function performs the following operations:
    1. Adds the current chunk to the processed list if it exists
    2. Checks if there are remaining chunks to process
    3. Sets up the next chunk and resets node visits if chunks remain
    4. Sets appropriate status for workflow routing

    Args:
        state: The current agent state containing chunks and processing status.

    Returns:
        AgentState: Updated agent state with chunk processing information.

    Example:
        >>> state = AgentState(chunks=[chunk1, chunk2], current_chunk=None)
        >>> updated_state = check_chunks_empty(state)
        >>> print(updated_state.current_chunk)  # chunk1
        >>> print(updated_state.status)  # Status.FAILED
    """

    if CHUNK_NULL_IRI not in state.current_chunk.iri:
        state.current_chunk.processed = True
        state.current_chunk.graph.remap_namespaces(
            old_namespace=DEFAULT_CHUNK_IRI, new_namespace=state.current_chunk.namespace
        )
        state.chunks_processed.append(state.current_chunk)

    if state.chunks:
        state.current_chunk = state.chunks.pop(0)
        state.node_visits = defaultdict(int)
        state.status = Status.FAILED
    else:
        state.current_chunk = Chunk(
            text="",
            hid="default",
            doc_iri=CHUNK_NULL_IRI,
        )
        state.status = Status.SUCCESS
        logger.info(
            f"All chunks processed ({len(state.chunks_processed)} total), "
            "setting status to SUCCESS and proceeding to AGGREGATE_FACTS"
        )

    return state

ontocast.agent.check_chunks¶

check_chunks_empty(state) ¶

`ontocast.agent.check_chunks`¶

`check_chunks_empty(state)` ¶