Learning to Reason without External Rewards
aka Self-Confidence is All You Need
aka Self-Confidence is All You Need
on John Carmack's Upperbound 25 Talk Notes
a few comparisons of Google's Imagen 4 vs OpenAI's gpt-image-1
Using evolutionary algorithms with LLM-coding agents
an alternative training method to backprop that does local layer learning
learn to reason without any human-annotated data.
a classic paper applying neural networks to RL for game playing
a reinforcement learning algorithm for finding optimal policies
A mathematical framework for modelling decision-making under uncertainty
a data structure where each node contains the hash of its child nodes