Ryan Lee

⚠️ I am in the progress of migrating my website. Thank you for your patience!

Preview graphic for Agents in Practice #6

Agents in Practice #6: Patient Agents and Databricks’ Coding Benchmark

llm

agents

agents-in-practice

Agents in Practice #6 on a benchmark that tests an agent’s ability to wait and detect changes, and the Databricks team’s findings from its internal coding benchmark

Jul 30, 2026

Ryan Lee

Three progressively smaller blue brain forms compressing into an orange module above a compact local computer

Agents in Practice #5: Proactive Memory Agent, Unreliable Benchmarks, and the Rise of Local LLMs

llm

agents

agents-in-practice

Agents in Practice #5 on a memory agent that learns when to intervene, audits uncovering flaws in coding benchmarks, and a personal anecdote about increasingly capable local LLMs.

Jul 23, 2026

Ryan Lee

Open research notebook with a single connected spiral line representing self-referencing agent research logs

Agents in Practice #4: Coding Agent Latents, Test-Writing Benchmark, and AI Scientist Collapse

llm

agents

agents-in-practice

Agents in Practice #4 on coding-agent latent space, a new benchmark for test-writing agents, and a personal anecdote about AI Scientist collapse.

Jul 9, 2026

Ryan Lee

Preview graphic for Agents in Practice #3

Agents in Practice #3: hacking benchmarks, failure attribution, and token-profiling trajectories

llm

agents

agents-in-practice

Agents in Practice #3 on hacking well-known agent benchmarks without solving tasks, a new benchmark for failure attribution, and a personal anecdote on profiling trajectories for token usage.

Jul 2, 2026

Ryan Lee

Preview graphic for Agents in Practice #2

Agents in Practice #2: Specialized Repo Explorer, Agentic Resource Discovery, and Personal Benchmark

llm

agents

agents-in-practice

Agents in Practice #2 on the efficiency of specialized repo explorers for coding agents, the Agentic Resource Discovery protocol for discovering agentic resources, and a personal anecdote about benchmarking agentic models.

Jun 25, 2026

Ryan Lee

Preview graphic for Agents in Practice #1

Agents in Practice #1: Learning from Rollouts, Long-Horizon Coding, and Memory Guardrails

llm

agents

agents-in-practice

Agents in Practice #1 on agents learning from rollouts, long-horizon coding benchmarks, and an anecdote about keeping agent memory from becoming always-loaded clutter.

Jun 18, 2026

Ryan Lee