The collection contains the artefacts used to do the analysis for the blogpost: Diversity Vs Density: A strategy comparison for fine-tuning VLMs
Akhil Theerthala PRO
Akhil-Theerthala
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 11 hours ago
Akhil-Theerthala/DesnityVsDiversity
commented on
their
article
about 12 hours ago
Diversity Vs Density: A strategy comparison for fine-tuning VLMs
replied to
their
post
about 12 hours ago
Is it better to show a model too many images once (Diversity), or extract as much information from a small set of images?
I have always wanted to do an ablation study on this and recently I got the chance to do exactly that. Why? In applied domains like robotics, manufacturing, or banking, we rarely have the luxury of internet-scale diverse image datasets. We are often "Data Poor" in terms of diversity but "Data Rich" in depth.
The takeaway? Density is efficient for facts but dangerous for reasoning (logical collapse) if you don't have larger scale data.
More details:
https://huggingface.co/blog/Akhil-Theerthala/diversity-density-for-vision-language-models