Architecture¶

AMX is a CLI shell wrapped around a multi-agent inference engine, a backend-neutral metadata adapter, and a local SQLite catalog. Every component is replaceable through configuration — swap LLM providers, change DB backends, plug in different document or code roots — without re-architecting the pipeline.

High-level flow¶

flowchart TB
    subgraph CLI["AMX CLI Shell"]
        Setup["/setup"]
        Run["/run · /run-apply"]
        Ask["/ask · /search"]
        Hist["/history"]
    end

    subgraph Orchestrator["Orchestrator Agent"]
        direction LR
        Profile["Profile Agent<br/>(DB stats)"]
        RAG["RAG Agent<br/>(Documents)"]
        Code["Code Agent<br/>(Codebase)"]
        Merge["Merge &amp; Rank"]
        Profile --> Merge
        RAG --> Merge
        Code --> Merge
    end

    subgraph Sources["Evidence Sources"]
        DB["10 Database<br/>backends"]
        Docs["Local files · GitHub<br/>S3 · GDrive · SharePoint"]
        Repos["Codebase<br/>Python · SQL · Java · …"]
    end

    subgraph Storage["Local State (~/.amx/)"]
        Conf["config.yml"]
        HistDB["history.db<br/>(runs · catalog · sessions)"]
        Chroma["Chroma stores<br/>amx_search · amx_code · docs"]
    end

    LLM["LLM Provider<br/>OpenAI · Anthropic · Gemini · …"]

    Run --> Orchestrator
    Ask --> SearchAgent["Search Agent<br/>(planner · retrieve · verify)"]
    Profile -.reads.-> DB
    RAG -.reads.-> Docs
    RAG -.embeds via.-> Chroma
    Code -.reads.-> Repos
    Code -.embeds via.-> Chroma
    Orchestrator -.calls.-> LLM
    Merge --> HITL["Human-in-the-Loop<br/>Review"]
    HITL -->|writeback| DB
    HITL --> HistDB
    SearchAgent --> HistDB
    SearchAgent -.calls.-> LLM

Components¶

CLI shell (`amx.cli` + `amx.cli_support`)¶

Thin layer over Click and prompt_toolkit. Provides the slash-command REPL, namespace switching (/db, /llm, /code, /docs, /search, /analyze, /history, /session), tab completion, and the interactive review wizard.

Each namespace is a Click sub-command tree under amx.cli_support.commands.*. The CLI is not the public Python API — for programmatic use see Python API.

Multi-agent orchestrator (`amx.agents`)¶

The orchestrator dispatches three independent sub-agents in parallel and merges their suggestions:

Agent	Source	Module	Output
Profile Agent	Database	`amx.agents.profile_agent`	Per-column suggestions backed by stats
RAG Agent	Documents	`amx.agents.rag_agent`	Per-column suggestions backed by doc snippets
Code Agent	Codebase	`amx.agents.code_agent`	Per-column suggestions backed by code references

The merge step is itself an LLM call — the orchestrator prompt uses source precedence rather than averaging conflicting descriptions. See Agents for the details.

Search Agent (`amx.search.agent`)¶

Powers /ask — a separate, multi-step pipeline:

Interpretation — classify intent and pick an answer shape.
Retrieval planning — choose between catalog scan, vector lookup, or live DB probe.
Grounded retrieval — fetch evidence from amx_search Chroma + the SQLite catalog.
Live verification — for high-risk structural claims, run a read-only DB probe.
Synthesis — produce the answer and a follow-up action plan.

Read the Ask & Search page for usage; the answer-shape design is deliberate so superlative questions ("which table has the most rows?") get one fact instead of a dump.

Universal Metadata Interface (`amx.core.metadata`)¶

AbstractEntity and UniversalMetadataAdapter map backend-specific column / table profiles into a backend-neutral shape. This is the seam that lets the agents and review wizard treat PostgreSQL, Snowflake, Databricks, and BigQuery identically.

See Universal metadata.

Database adapters (`amx.db.adapters.*`)¶

Per-backend driver wrappers. Each adapter declares a BackendCapabilities flag set so unsupported list operations (e.g. ClickHouse UPDATE) short-circuit cleanly instead of raising opaque driver errors. Comments are written through the backend's native mechanism — COMMENT ON TABLE for Postgres, sp_addextendedproperty for SQL Server, ALTER … MODIFY COMMENT for ClickHouse, and so on.

Storage layer (`amx.storage`)¶

The local SQLite store at ~/.amx/history.db carries everything: runs, results, app events, the search catalog, session state, and the optional pending_shared_writes outbox for shared mode. Reads are always local; writes are dual-written to a shared backend when shared mode is enabled.

LLM provider (`amx.llm.provider`)¶

Single unified interface over LiteLLM with extensions for batch APIs (OpenAI Batch, Anthropic Message Batches), logprob collection, and reasoning-token budget management for o-series / kimi-k2-thinking / Claude extended thinking routes.

Data flow: a single `/run`¶

sequenceDiagram
    participant U as User
    participant CLI
    participant Orch as Orchestrator
    participant P as Profile Agent
    participant R as RAG Agent
    participant C as Code Agent
    participant LLM
    participant DB

    U->>CLI: /run sap_s6p.t001
    CLI->>Orch: dispatch run(scope=t001)
    par Profile branch
        Orch->>P: profile(t001)
        P->>DB: stats + samples + comments
        P->>LLM: chat(prompt, batch of columns)
        LLM-->>P: suggestions + logprobs
    and RAG branch
        Orch->>R: rag(t001)
        R->>R: vector search amx_docs
        R->>LLM: chat(prompt + retrieved chunks)
        LLM-->>R: suggestions
    and Code branch
        Orch->>C: code(t001)
        C->>C: vector search amx_code + literal scan
        C->>LLM: chat(prompt + code snippets)
        LLM-->>C: suggestions
    end
    Orch->>LLM: chat(merge prompt with all candidates)
    LLM-->>Orch: ranked suggestions per column
    Orch->>CLI: review payload
    CLI->>U: interactive review wizard
    U->>CLI: accept / pick / write / skip
    U->>CLI: /apply
    CLI->>DB: COMMENT ON COLUMN ...
    CLI->>CLI: persist to history.db

Where AMX is opinionated¶

Inference, not loading. AMX assumes the data is already in the database. Importing, ETL, and schema migration are out of scope.
Human review is mandatory by default. /run-apply exists, but the review wizard is the default path.
Local-first state. The local SQLite cache is always the source of truth for reads. Shared mode is a dual-write addition, not a replacement.
Provider-agnostic LLMs. Switch providers per profile. No Anthropic-only or OpenAI-only paths in the agent code.
Token usage is visible. /usage shows token totals; /llm-batch-size and /n-alternatives are explicit knobs you can tune down.

Reading further¶

Agents — what each sub-agent reads, what it sends to the LLM, how merging works.
Universal metadata — the backend-neutral entity model.
Human-in-the-loop — the review wizard's design.
Configuration → config.yml — the on-disk layout that ties the components together.

Architecture¶

High-level flow¶

Components¶

CLI shell (amx.cli + amx.cli_support)¶

Multi-agent orchestrator (amx.agents)¶

Search Agent (amx.search.agent)¶

Universal Metadata Interface (amx.core.metadata)¶

Database adapters (amx.db.adapters.*)¶

Storage layer (amx.storage)¶

LLM provider (amx.llm.provider)¶

Data flow: a single /run¶