reference category - Notes by Lex

Articles in the reference category

«
1
2
3
4
5
»

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

May 12, 2025 reference/papers ReinforcementLearning ReasoningModels LargeLanguageModels

learn to reason without any human-annotated data.

Read More
Playing Atari with Deep Reinforcement Learning

May 05, 2025 reference/papers ReinforcementLearning GamePlayingAI

a classic paper applying neural networks to RL for game playing

Read More
Large Language Models are Zero-Shot Reasoners (May 2022)

Jan 08, 2025 reference/papers LargeLanguageModels PromptingTechniques

improve zero-shot prompt performance of LLMs by adding “Let’s think step by step” before each answer

Read More
Neural Machine Translation by Jointly Learning to Align and Translate (Sep 2014)

Oct 28, 2024 reference/papers AttentionMechanism

improve the Encoder/Decoder alignment with an Attention Mechanism

Read More
Thinking LLMs: General Instruction Following with Thought Generation (Oct 2024)

Oct 16, 2024 reference/papers AgenticReasoning System2Prompting

a prompting and fine-tuning method that enables LLMs to engage in a "thinking" process before generating responses

Read More
Mixtral of Experts (Jan 2024)

Oct 15, 2024 reference/papers

a Sparse Mixture of Experts (SMoE) language model

Read More
Evaluation of OpenAI o1: Opportunities and Challenges of AGI

Oct 14, 2024 reference/papers AgenticReasoning LargeLanguageModels

a comprehensive evaluation of o1-preview across many tasks and domains.

Read More
AI Meets the Classroom: When Does ChatGPT Harm Learning?

Oct 13, 2024 reference/papers LearningWithAI LearningandTeaching

LLMs can help and also hinder learning outcomes

Read More
No 'Zero-Shot' Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

Oct 03, 2024 reference/papers DatasetConcepts

a paper that shows a model needs to see a concept exponentially more times to achieve linear improvements

Read More
Neural Codec Language Models are Zero-Shot Text-to-Speech Synthesizers

Jan 31, 2024 reference/papers MachineLearning AudioEngineering SpeechSynthesis

VALL-E can generate speech in anyone's voice with only a 3-second sample of the speaker and some text

Read More

«
1
2
3
4
5
»