[R] Adopting a human developmental visual diet yields robust, shape-based AI vision

Happy to announce an exciting new project from the lab: “Adopting a human developmental visual diet yields robust, shape-based AI vision”. An exciting case where brain inspiration profoundly changed and improved deep neural network representations for computer vision.

Link: https://arxiv.org/abs/2507.03168

The idea: instead of high-fidelity training from the get-go (the de facto gold standard), we simulate the visual development from newborns to 25 years of age by synthesising decades of developmental vision research into an AI preprocessing pipeline (evelopmental Visual iet – V).

We then test the resulting NNs across a range of conditions, each selected because they are challenging to AI:

  1. shape-texture bias
  2. recognising abstract shapes embedded in complex backgrounds
  3. robustness to image perturbations
  4. adversarial robustness.

We report a new SOTA on shape-bias (reaching human level), outperform AI foundation models in terms of abstract shape recognition, show better alignment with human behaviour upon image degradations, and improved robustness to adversarial noise – all with this one preprocessing trick.

This is observed across all conditions tested, and generalises across training datasets and multiple model architectures.

We are excited about this, because V may offers a resource-efficient path toward safer, perhaps more human-aligned AI vision. This work suggests that biology, neuroscience, and psychology have much to offer in guiding the next generation of artificial intelligence.

https://preview.redd.it/ycd830s4lpbf1.png?width=1308&format=png&auto=webp&s=92854b0f7a2c1922226e82b88394603ae19d9e84

https://preview.redd.it/a7ecwyqblpbf1.png?width=1434&format=png&auto=webp&s=a4eccba9c31306879c559070748f94d009b40671

https://preview.redd.it/zd6ceg18lpbf1.png?width=1418&format=png&auto=webp&s=0ec8921eae86d9c187d7d4c09850bc30a1acf9a4

Leave a Reply