Home About

Tags

MachineLearning (26) LinearAlgebra (16) SoftwareEngineering (15) GameDesign (12) LargeLanguageModels (11) ComputerScience (11) AgenticReasoning (7) AudioEngineering (7) Zettelkasten (6) DiscreteMath (6) AutomatedTesting (6) ReinforcementLearning (5) More

Notes by Lex Toumbourou

SkillsBench

Feb 23, 2026 permanent

See SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks