Josh Palmer
7a6c40872d
Agents: add system prompt safety guardrails (#5445)
* 🤖 agents: add system prompt safety guardrails
What:
- add safety guardrails to system prompt
- update system prompt docs
- update prompt tests
Why:
- discourage power-seeking or self-modification behavior
- clarify safety/oversight priority when conflicts arise
Tests:
- pnpm lint (pass)
- pnpm build (fails: DefaultResourceLoader missing in pi-coding-agent)
- pnpm test (not run; build failed)
* 🤖 agents: tighten safety wording for prompt guardrails
What:
- scope safety wording to system prompts/safety/tool policy changes
- document Safety inclusion in minimal prompt mode
- update safety prompt tests
Why:
- avoid blocking normal code changes or PR workflows
- keep prompt mode docs consistent with implementation
Tests:
- pnpm lint (pass)
- pnpm build (fails: DefaultResourceLoader missing in pi-coding-agent)
- pnpm test (not run; build failed)
* 🤖 docs: note safety guardrails are soft
What:
- document system prompt safety guardrails as advisory
- add security note on prompt guardrails vs hard controls
Why:
- clarify threat model and operator expectations
- avoid implying prompt text is an enforcement layer
Tests:
- pnpm lint (pass)
- pnpm build (fails: DefaultResourceLoader missing in pi-coding-agent)
- pnpm test (not run; build failed)
2026-01-31 15:50:15 +01:00
..
2026-01-31 16:19:20 +09:00
2026-01-31 15:50:15 +01:00
2026-01-31 07:59:01 +00:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 18:31:49 +09:00
2026-01-31 12:42:45 +01:00
2026-01-30 03:16:21 +01:00
2026-01-31 12:42:45 +01:00
2026-01-31 16:19:20 +09:00
2026-01-31 18:31:49 +09:00
2026-01-31 21:57:21 +09:00
2026-01-26 17:39:10 -08:00
2026-01-31 21:57:21 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 17:12:28 +09:00
2026-01-31 18:31:49 +09:00
2026-01-31 21:57:21 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:38:03 +09:00
2026-01-31 16:19:20 +09:00
2026-01-30 03:16:21 +01:00
2026-01-31 18:31:49 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-30 03:16:21 +01:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 21:57:21 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-30 03:16:21 +01:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:04:04 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 21:57:21 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 07:54:15 +00:00
2026-01-13 08:40:39 +00:00
2026-01-13 08:40:39 +00:00
2026-01-30 03:16:21 +01:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-14 15:02:19 +00:00
2026-01-30 03:16:21 +01:00
2026-01-30 03:16:21 +01:00
2026-01-31 16:19:20 +09:00
2026-01-18 19:34:21 +00:00
2026-01-14 15:02:19 +00:00
2026-01-14 15:02:19 +00:00
2026-01-30 03:16:21 +01:00
2026-01-18 03:55:14 +00:00
2026-01-31 16:19:20 +09:00
2026-01-31 16:19:20 +09:00
2026-01-30 03:16:21 +01:00