Workflow Engine Architecture

The qdash.workflow.engine module provides the core infrastructure for calibration workflow execution: task lifecycle management, state tracking, scheduling, data persistence (MongoDB + filesystem), and hardware backend abstraction.

Architecture Diagram

Module Structure

engine/
├── __init__.py          # Public API exports
├── orchestrator.py      # CalibOrchestrator - session lifecycle
├── config.py            # CalibConfig - session configuration
├── task_runner.py       # Prefect task wrappers
├── params_updater.py    # Backend parameter updates
├── util.py              # Utility functions
│
├── task/                # Task execution layer
│   ├── context.py       # TaskContext - execution context
│   ├── executor.py      # TaskExecutor - task lifecycle
│   ├── state_manager.py # TaskStateManager - state tracking
│   ├── result_processor.py # Result validation
│   └── history_recorder.py # History recording
│
├── execution/           # Execution management layer
│   ├── service.py       # ExecutionService - session tracking
│   ├── state_manager.py # ExecutionStateManager
│   └── models.py        # Execution data models
│
├── scheduler/           # Scheduling layer
│   ├── cr_scheduler.py  # CRScheduler - 2-qubit scheduling
│   ├── one_qubit_scheduler.py  # 1-qubit scheduling
│   └── plugins.py       # Ordering strategies
│
├── repository/          # Data persistence layer
│   ├── protocols.py     # Repository interfaces
│   ├── mongo_impl.py    # MongoDB implementations
│   ├── mongo_execution.py  # Execution repository
│   └── filesystem_impl.py  # Filesystem implementations
│
└── backend/             # Hardware abstraction layer
    ├── base.py          # BaseBackend abstract class
    ├── factory.py       # Backend factory
    ├── qubex.py         # Qubex backend
    └── fake.py          # Fake backend for testing

Core Components

1. CalibOrchestrator

Location: engine/orchestrator.py

Purpose: Manages the complete lifecycle of a calibration session.

Responsibilities:

Creates directory structure for calibration data
Initializes ExecutionService, TaskContext, and Backend
Coordinates task execution via run_task()
Handles session completion and failure

Usage:

python

from qdash.workflow.engine import CalibOrchestrator, CalibConfig

config = CalibConfig(
    username="alice",
    chip_id="64Qv3",
    qids=["0", "1"],
    execution_id="20240101-001",
)
orchestrator = CalibOrchestrator(config)
orchestrator.initialize()

# Run tasks
result = orchestrator.run_task("CheckRabi", qid="0")

# Complete session
orchestrator.complete()

2. TaskContext

Location: engine/task/context.py

Purpose: Container for task execution state and results.

Key Attributes:

execution_id: Current execution identifier
task_result: Container for qubit/coupling/global task results
calib_data: Calibration data (parameters extracted from tasks)

3. TaskExecutor

Location: engine/task/executor.py

Purpose: Executes individual calibration tasks with proper lifecycle management.

Execution Flow:

See the Task Executor Flow diagram for the complete execution lifecycle, state machine, and repository pattern:

Task Executor Flow

4. TaskStateManager

Location: engine/task/state_manager.py

Purpose: Manages task state transitions and parameter storage.

State Transitions: SCHEDULED → RUNNING → COMPLETED / FAILED / CANCELLED (see Task Executor Flow diagram above)

Key Methods:

ensure_task_exists(): Create task entry if not exists
start_task(): Mark task as running
put_input_parameters(): Store input parameters
put_output_parameters(): Store output parameters
update_task_status_to_completed(): Mark success
update_task_status_to_failed(): Mark failure
end_task(): Record end timestamp

5. ExecutionService

Location: engine/execution/service.py

Purpose: Manages workflow execution sessions in MongoDB.

Responsibilities:

Creates and tracks execution records
Updates task results during execution
Manages execution status (RUNNING, COMPLETED, FAILED)
Handles tags and metadata

6. Schedulers

CRScheduler (2-Qubit)

Location: engine/scheduler/cr_scheduler.py

Purpose: Schedules 2-qubit (Cross-Resonance) calibration tasks.

Features:

Graph coloring for conflict avoidance
MUX-aware parallel grouping
Multiple coloring strategies

OneQubitScheduler (1-Qubit)

Location: engine/scheduler/one_qubit_scheduler.py

Purpose: Schedules 1-qubit calibration tasks.

Features:

Box-aware grouping (BOX_A, BOX_B, BOX_MIXED)
Synchronized execution mode
Pluggable ordering strategies

7. Repository Layer

Location: engine/repository/

Purpose: Data persistence abstraction using the Repository Pattern.

The Repository Pattern separates data access logic from business logic, enabling:

Testability: Swap MongoDB for InMemory implementations in tests
Flexibility: Easy to change persistence mechanisms
Clean Architecture: Business logic doesn't depend on database details

The Repository Pattern is visualized in the Task Executor Flow diagram (see above).

Protocols (interfaces in protocols.py):

Protocol	Purpose
`TaskResultHistoryRepository`	Task result history recording
`ChipRepository`	Chip configuration access
`ChipHistoryRepository`	Chip history snapshots
`CalibDataSaver`	Figure and raw data saving
`ExecutionRepository`	Execution session records
`CalibrationNoteRepository`	Calibration note storage
`QubitCalibrationRepository`	Qubit calibration data updates
`CouplingCalibrationRepository`	Coupling calibration data updates
`ExecutionCounterRepository`	Atomic execution ID counter
`ExecutionLockRepository`	Project execution locking
`UserRepository`	User preferences
`TaskRepository`	Task name lookup

MongoDB Implementations:

MongoTaskResultHistoryRepository
MongoChipRepository
MongoChipHistoryRepository
MongoExecutionRepository
MongoCalibrationNoteRepository
MongoQubitCalibrationRepository
MongoCouplingCalibrationRepository
MongoExecutionCounterRepository
MongoExecutionLockRepository
MongoUserRepository
MongoTaskRepository

InMemory Implementations (for testing):

InMemoryExecutionRepository
InMemoryChipRepository
InMemoryChipHistoryRepository
InMemoryTaskResultHistoryRepository
InMemoryCalibrationNoteRepository
InMemoryQubitCalibrationRepository
InMemoryCouplingCalibrationRepository
InMemoryExecutionCounterRepository
InMemoryExecutionLockRepository
InMemoryUserRepository
InMemoryTaskRepository

Filesystem Implementations:

FilesystemCalibDataSaver: Local filesystem for figures/data

Usage with Dependency Injection:

python

# Production code (MongoDB)
from qdash.repository import MongoChipRepository

chip_repo = MongoChipRepository()
chip = chip_repo.get_current_chip(username="alice")

# Test code (InMemory)
from qdash.repository.inmemory import InMemoryChipRepository

chip_repo = InMemoryChipRepository()
chip_repo.add_chip("alice", mock_chip)  # Test helper

# With DI in service
scheduler = CRScheduler(
    username="alice",
    chip_id="64Qv3",
    chip_repo=InMemoryChipRepository(),  # Inject for testing
)

8. Backend Layer

Location: engine/backend/

Purpose: Hardware abstraction.

BaseBackend Interface:

python

class BaseBackend(ABC):
    name: str

    @abstractmethod
    def connect(self) -> None: ...

    @abstractmethod
    def get_instance(self) -> Any: ...

    @abstractmethod
    def save_note(...) -> None: ...

    @abstractmethod
    def update_note(...) -> None: ...

Implementations:

QubexBackend: Real hardware via qubex library
FakeBackend: Simulation for testing

Data Flow

The data flow (Preprocess → Run → Postprocess) and persistence flow (TaskStateManager, TaskHistoryRecorder, FilesystemCalibDataSaver, ExecutionService) are illustrated in the Task Executor Flow diagram above.

Cancellation

Overview

Flow cancellation allows users to stop a running calibration from the UI. The cancellation lifecycle involves the API, Prefect, and the workflow engine.

Mechanism

Prefect 3 cancels flows by sending SIGTERM to the worker process. This means Python except blocks do not execute when a flow is cancelled. Instead, Prefect provides an on_cancellation hook that runs in a separate process after the SIGTERM kill.

Implementation

All top-level @flow decorators register the on_flow_cancellation hook:

python

from qdash.workflow.service.calib_service import on_flow_cancellation

@flow(on_cancellation=[on_flow_cancellation])
def my_calibration_flow(...):
    ...

The hook:

Reads flow run parameters (project_id, flow_run_id) from the Prefect flow run context
Initializes the database connection (since it runs in a new process)
Finds the execution by note.flow_run_id in execution_history
Updates all non-terminal tasks (running/scheduled/pending) to cancelled
Sets the execution status to cancelled
Releases the execution lock

flow_run_id Bridge

QDash uses date-based execution IDs (YYYYMMDD-NNN), while Prefect uses UUIDs for flow runs. The bridge is:

At flow start, CalibService._store_flow_run_id() stores the Prefect UUID in execution.note["flow_run_id"]
The cancel API accepts the Prefect flow_run_id (UUID) directly
The on_cancellation hook uses flow_run_id to look up the QDash execution

Status Transitions on Cancel

Entity	Before Cancel	After Cancel
Execution	`running`	`cancelled`
Task	`running` / `scheduled` / `pending`	`cancelled`
Task	`completed` / `failed` / `skipped`	(unchanged)

CalibService Methods

Method	Purpose
`on_flow_cancellation()`	Prefect hook — runs after SIGTERM
`cancel_calibration()`	In-process cancellation (for exception path)
`_cancel_executions_by_flow_run_id()`	Direct MongoDB update by flow_run_id
`_finalize_tasks_on_cancel()`	Batch-update non-terminal tasks
`_is_cancellation(e)`	Detect CancelledRun/CancelledError exception

Extension Points

Adding a New Backend

Create engine/backend/your_backend.py:

python

from qdash.workflow.engine.backend.base import BaseBackend

class YourBackend(BaseBackend):
    name = "your_backend"

    def connect(self) -> None:
        # Initialize hardware connection
        pass

    def get_instance(self) -> Any:
        # Return experiment session
        pass

Adding a New Scheduler Strategy

Implement the strategy in engine/scheduler/plugins.py
Register in the scheduler's strategy registry

Adding a New Repository Implementation

Define or use existing protocol from engine/repository/protocols.py:

python

@runtime_checkable
class YourRepository(Protocol):
    def find(self, id: str) -> YourModel | None: ...
    def save(self, model: YourModel) -> None: ...

Create MongoDB implementation:

python

# engine/repository/mongo_your.py
class MongoYourRepository:
    def find(self, id: str) -> YourModel | None:
        doc = YourDocument.find_one({"id": id}).run()
        return self._to_model(doc) if doc else None

    def save(self, model: YourModel) -> None:
        YourDocument.from_model(model).save()

Create InMemory implementation for testing:

python

# engine/repository/inmemory_impl.py
class InMemoryYourRepository:
    def __init__(self):
        self._store: dict[str, YourModel] = {}

    def find(self, id: str) -> YourModel | None:
        return self._store.get(id)

    def save(self, model: YourModel) -> None:
        self._store[model.id] = model

    def clear(self) -> None:  # Test helper
        self._store.clear()

Export from engine/repository/__init__.py
Use with dependency injection in services:

python

class YourService:
    def __init__(self, *, repo: YourRepository | None = None):
        if repo is None:
            from ... import MongoYourRepository
            repo = MongoYourRepository()
        self._repo = repo

Workflow Engine Architecture ​

Architecture Diagram ​

Module Structure ​

Core Components ​

1. CalibOrchestrator ​

2. TaskContext ​

3. TaskExecutor ​

4. TaskStateManager ​

5. ExecutionService ​

6. Schedulers ​

CRScheduler (2-Qubit) ​

OneQubitScheduler (1-Qubit) ​

7. Repository Layer ​

8. Backend Layer ​

Data Flow ​

Cancellation ​

Overview ​

Mechanism ​

Implementation ​

flow_run_id Bridge ​

Status Transitions on Cancel ​

CalibService Methods ​

Extension Points ​

Adding a New Backend ​

Adding a New Scheduler Strategy ​

Adding a New Repository Implementation ​

Workflow Engine Architecture

Architecture Diagram

Module Structure

Core Components

1. CalibOrchestrator

2. TaskContext

3. TaskExecutor

4. TaskStateManager

5. ExecutionService

6. Schedulers

CRScheduler (2-Qubit)

OneQubitScheduler (1-Qubit)

7. Repository Layer

8. Backend Layer

Data Flow

Cancellation

Overview

Mechanism

Implementation

flow_run_id Bridge

Status Transitions on Cancel

CalibService Methods

Extension Points

Adding a New Backend

Adding a New Scheduler Strategy

Adding a New Repository Implementation