Chain-of-Thought Reasoning

Jul 28, 2025 ·

Chain-of-Thought Reasoning is an Agentic Reasoning technique where an LLM can reason in token space - in other words, it generates reasoning traces as text before generating an answer.

Originally described as a prompting technique Chain-of-Thought Prompting where the model was given few-shot examples of input/output examples with intermediary reasoning, but later, with the introduction of models like OpenAI's o1 and DeepSeek-R1-Zero, allowed the models to perform reasoning without few-shot examples, either by learning to reason as a fine-tuning step (by adding reasoning steps to the training data) or via reinforcement learning, where the model was rewarded for applying a thought process before returning an output.