NotesByLex.com

Playing Atari with Deep Reinforcement Learning

May 05, 2025 reference/papers ReinforcementLearning GamePlayingAI

a classic paper applying neural networks to RL for game playing

Read More
Q-Learning

Apr 18, 2025 note ReinforcementLearning

a reinforcement learning algorithm for finding optimal policies

Read More
Markov Decision Process (MDP)

Mar 29, 2025 note ReinforcementLearning

A mathematical framework for modelling decision-making under uncertainty

Read More
Merkle Tree

Mar 15, 2025 note DataStructures Cryptography

a data structure where each node contains the hash of its child nodes

Read More
RSA

Mar 01, 2025 note Cryptography ComputerSecurity

a public-key encryption system reliant on the practical difficulty of factorising large numbers

Read More
Spanning Tree

Feb 16, 2025 note ComputerScience GraphTheory

a sub graph of a connected graph that contains all vertices, but no cycles

Read More
Bucket Sort

Feb 15, 2025 note ComputerScience SortingAlgorithms

a distribution-based sorting algorithm that works by dividing elements into buckets

Read More
Temperature Scaling

Jan 14, 2025 note LargeLanguageModels MachineLearning

a parameter that controls how confident Softmax predictions are

Read More
Few-Shot Knowledge-Distillation

Jan 12, 2025 note LLMPerformance LargeLanguageModels

Routes LLM tasks to cheaper or more powerful models based on task novelty.

Read More
Large Language Models are Zero-Shot Reasoners (May 2022)

Jan 08, 2025 reference/papers LargeLanguageModels PromptingTechniques

improve zero-shot prompt performance of LLMs by adding “Let’s think step by step” before each answer

Read More

All Notes