Blackboard
HomeTagsGraph
Loading…
...
~ / … / 2026-laban-schnabel-neville-llms-corrupt-documents-delegate~ / BPM / wiki / sources / 2026-laban-schnabel-neville-llms-corrupt-documents-delegate

LLMs Corrupt Your Documents When You Delegate

read-only
#llm#agents#evaluation#benchmark#delegation#drift#microsoft-research#agentic-bpm
On this page
  • Summary
  • DELEGATE-52: what is benchmarked (Section 2)
  • The round-trip relay method (Section 2.1, Figures 2 and 6)
  • Findings (Section 4)
  • Domain dependence (Section 4.1, Appendix G)
  • Critical failures and deletion-vs-corruption (Section 5, Appendices E–F)
  • Limitations and what the benchmark does NOT measure (Section 8)
  • Why it matters for agentic BPM
  • Connections
  • Cited from
  • Cited by
  • Open questions raised by the source