Skip to content

Glossary (manifests & experiments)

Terms you will see in YAML files and CLI output every day. Each user guide also explains them in context; this page is the single reference list.

Term Meaning
Manifest A YAML file that declares an agent (agent.yaml), a multi-agent team (mas.yaml), an experiment (experiment.yaml), an overlay, a flavour, etc.
MAS Multi-agent system — several agents with a workflow (who talks to whom). Configured in a mas.yaml manifest.
Agent Single LLM actor with tools, skills, and plugins. Configured in an agent.yaml manifest.
Overlay A small YAML patch (kind: Overlay) merged on top of a manifest. CLI: -o path/to/overlay.yaml. Used to vary one knob (tools, memory, governance) without copying the whole file.
Flavour Deployment preset: model endpoint, instrumentation, transport. Referenced from experiments or config.yaml (e.g. standard:openai).
Lab (definition) A folder *.lab/ (under labs/) with experiment.yaml, datasets, overlays, and optional custom pipeline steps. Data/config, not code.
Package A published wheel (mas-runtime, mas-ctl, mas-lab, …). See packages reference.
Component One of the four lab sub-wheels under lab/components/ (mas-lab-core, mas-lab-bench, mas-lab-controller, mas-lab-content). The interactive tutorial runner (mas-lab-tutorial) is internal-only — see mas-lab-internal. Distinct from the internal mas.lab.* runtime modules — see ADR 0001.
Experiment The experiment: block (usually in experiment.yaml): what to run, how many times, and which pipeline builds results.
Scenario One column in an experiment matrix — a named setup (id) and which overlays apply. Declared under scenarios:.
Dataset Input items (prompts, expected fields) the experiment iterates over. Referenced as dataset: in experiment.yaml.
Run One execution of the agent/MAS for a given (scenario, dataset item, repeat index). Produces traces/events.jsonl.
n_runs How many times to repeat each (scenario, item) for variance.
Benchmark mas-lab benchmark run — executes all runs, then runs the pipeline.
Pipeline Ordered pipeline steps that read run artifacts and write CSV/PNG under results/.
Embedded pipeline The pipeline: list inside experiment.yaml — runs automatically after the benchmark execution phase.
Pipeline step One unit in the pipeline (e.g. extract_trace_stats, plotnine). Declared with name, type, config.
events.jsonl Machine-readable run log: one JSON object per line (model calls, tool calls, governance, routing).
Exchange log Human-readable transcript on stderr during mas-ctl chat (--trace). Not used for experiment scoring.
Trace cache Store of completed run logs; identical inputs reuse the cache instead of calling the model again.
Observability Settings that enable events.jsonl (manifest spec.observability, overlay, or --events).

Runtime internals: see the runtime/docs/ tree in the repository.

Hub: references/index.md · Run logs: cli/observability.md.