← Home

Blog

Toward Agents That Reason About Their Computation

A short summary of my paper calling for agents to be designed to reason about more of their computational processes.

Multi-ROM, Option execution, and interface for the Torch compiler

Support for Multiple ROMs, Option execution, and an interface for the Torch compiler.

Technical Discussion Series: Cluster Efficiency

Three concrete techniques for getting more experiments done on shared clusters: checkpointing into short queues, vectorising agents with vmap, and overlapping environment stepping with agent updates.