#riess
4 notes.
- Riess Research Arc — Evaluation Rigour as Research ProgrammeBPM/wiki/syntheses/riess-research-arc
- Study sketch — Concept drift in LLM-agent trajectoriesBPM/wiki/syntheses/study-sketch-agent-trajectory-drift
- Study sketch — SynBPS-APM: A controlled testbed for agentic BPMBPM/wiki/syntheses/study-sketch-synbps-apm
- Study sketch — Temporal consistency of LLM-agent runtime recommendationsBPM/wiki/syntheses/study-sketch-temporal-consistency-agents