Module X

Agents

Chapter I

From Text to Action

A model that only produces text is a tool for generating language.

Give that model the ability to do things in the world, to run a search, execute code, read a file, send a message, and something changes. The output is no longer just words. It is steps. And steps can be chained.

The model reads a situation, decides on an action, observes the result, and decides what to do next. Each step informs the next. A task that would require many separate human decisions can be handed off entirely.

This is a small change in architecture and a large change in what is possible. A language model was always a reasoning engine of a kind. Connecting it to the world turns that reasoning into action.