Module X
Agents
Chapter I
From Text to Action
A model that only produces text is a tool for generating language.
Give that model the ability to do things in the world, to run a search, execute code, read a file, send a message, and something changes. The output is no longer just words. It is steps. And steps can be chained.
The model reads a situation, decides on an action, observes the result, and decides what to do next. Each step informs the next. A task that would require many separate human decisions can be handed off entirely.
This is a small change in architecture and a large change in what is possible. A language model was always a reasoning engine of a kind. Connecting it to the world turns that reasoning into action.