Safety Scope
Mesmer is intended for authorized red-team work, defensive evaluation, and reproducible safety research.
Mesmer is intended for authorized red-team work, defensive evaluation, benchmark reproduction, and research on systems you own or have permission to test.
Public examples use benign canary-style objectives by default. Paper examples can load their original datasets for reproducibility when the user explicitly chooses to run them.
Use Mesmer to make LLM safety testing more inspectable:
- Keep objectives and targets explicit.
- Preserve replay messages and judgement details.
- Record logs and state transitions.
- Compare techniques under shared budgets.
- Avoid hiding target interaction in one-off scripts.
Do not use Mesmer against systems you do not own or have permission to test.