A short summary of my paper calling for agents to be designed to reason about more of their computational processes.
Support for Multiple ROMs, Option execution, and an interface for the Torch compiler.
Three concrete techniques for getting more experiments done on shared clusters: checkpointing into short queues, vectorising agents with vmap, and overlapping environment stepping with agent updates.