Markov Decision Process (MDP)
A mathematical framework for modelling decision-making under uncertainty
A mathematical framework for modelling decision-making under uncertainty
a data structure where each node contains the hash of its child nodes
a sub graph of a connected graph that contains all vertices, but no cycles
a distribution-based sorting algorithm that works by dividing elements into buckets
a parameter that controls how confident Softmax predictions are
Routes LLM tasks to cheaper or more powerful models based on task novelty.
improve zero-shot prompt performance of LLMs by adding “Let’s think step by step” before each answer
improve the Encoder/Decoder alignment with an Attention Mechanism
a prompting and fine-tuning method that enables LLMs to engage in a "thinking" process before generating responses