Articles in the reference category
-
-
-
-
NoProp: Training Neural Networks Without Back-Propagation or Forward-Propagation
an alternative training method to backprop that does local layer learning
-
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
learn to reason without any human-annotated data.
-
Playing Atari with Deep Reinforcement Learning
a classic paper applying neural networks to RL for game playing
-
Large Language Models are Zero-Shot Reasoners (May 2022)
improve zero-shot prompt performance of LLMs by adding “Let’s think step by step” before each answer
-
Neural Machine Translation by Jointly Learning to Align and Translate (Sep 2014)
improve the Encoder/Decoder alignment with an Attention Mechanism
-
Thinking LLMs: General Instruction Following with Thought Generation (Oct 2024)
a prompting and fine-tuning method that enables LLMs to engage in a "thinking" process before generating responses
-