No 'Zero-Shot' Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance
a paper that shows a model needs to see a concept exponentially more times to achieve linear improvements
a paper that shows a model needs to see a concept exponentially more times to achieve linear improvements