MAS-Lab paper¶

MAS-Lab: A Specification-Driven Validation Framework for Reliable Multi-Agent Systems

The article introduces the specification-driven model behind MAS-Lab. Section 5 reports experiments that are reproduced in this repository as three runnable labs.

What you can reproduce¶

Lab	Paper section	What it explores
`design-space.lab`	§5.1	Design patterns and team topologies
`lifecycle-control.lab`	§5.2	Governance and lifecycle control
`extensions.lab`	§5.3	Memory and context overlays

Source under labs/ in the repository.

Each lab is a single benchmark experiment. Run it, and MAS-Lab produces the figures and tables reported in the paper — charts, metrics, and comparison reports — without separate plotting scripts.

Hands-on introduction: Tutorial 3 — Run an experiment.

Quick reproduction¶

After Tutorial 0 (install and LLM access), from the repository root:

task reproduce

This runs all Section 5 experiments in sequence. To run one lab:

mas-lab benchmark run labs/design-space.lab/01-design-patterns/experiment.yaml --progress

Completed runs are cached — re-running refreshes reports without repeating LLM calls when inputs are unchanged. See User configuration for output locations.

Cite MAS-Lab¶

When you use the framework, benchmarks, or methodology, cite the paper — not only this Git Repository.

Authors: Jordan Augé, Giovanna Carofiglio, Giulio Grassi, Jacques Samain (Cisco Systems / Outshift)

arXiv: link will be posted here when the preprint is public. Watch outshift-open/mas-lab releases or this documentation site.

@article{maslab2026,
  title   = {{MAS-Lab}: A Specification-Driven Validation Framework for Reliable Multi-Agent Systems},
  author  = {Aug{\'e}, Jordan and Carofiglio, Giovanna and Grassi, Giulio and Samain, Jacques},
  year    = {2026},
  note    = {arXiv preprint, forthcoming},
  url     = {https://github.com/outshift-open/mas-lab}
}

Update note and url with the arXiv identifier when available.

MAS-Lab paper¶

What you can reproduce¶

Quick reproduction¶

Cite MAS-Lab¶

See also¶