Playing Atari with Deep Reinforcement Learning

a classic paper applying neural networks to RL for game playing
a classic paper applying neural networks to RL for game playing
A mathematical framework for modelling decision-making under uncertainty
a distribution-based sorting algorithm that works by dividing elements into buckets
Routes LLM tasks to cheaper or more powerful models based on task novelty.
improve zero-shot prompt performance of LLMs by adding “Let’s think step by step” before each answer