Home About

Tags

MachineLearning (26) LinearAlgebra (16) GameDesign (12) ComputerScience (11) SoftwareEngineering (11) LargeLanguageModels (9) AudioEngineering (7) DiscreteMath (6) AutomatedTesting (6) Roblox (5) Zettelkasten (5) AgenticReasoning (4) More

Notes by Lex Toumbourou

Let It Wag

Oct 14, 2024 permanent

A dataset which contains least frequent "concepts" across various web scraped dataset. From paper No 'Zero-Shot' Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance.