Articles in the permanent category
Few-Shot Knowledge-Distillation
Routes LLM tasks to cheaper or more powerful models based on task novelty.
Waffle Chart
A data visualization that uses squares along a 2D grid for representing proportion.
Scaled-Dot Product Attention
a method of computing a token representation that includes the context of surrounding tokens.
Gale-Shapley Algorithm
an algorithm that matches 2-equally sizes groups based on preferences.