September 18th, 2024

When computer vision works more like a brain, it sees more like people do

MIT researchers developed a computer vision model mimicking human brain processing, trained on monkey IT cortex data, enhancing object recognition and resistance to adversarial attacks, promoting collaboration between neuroscience and AI.

Read original articleLink Icon
When computer vision works more like a brain, it sees more like people do

Researchers at MIT have developed a computer vision model that mimics the brain's processing of visual information, specifically targeting the inferior temporal (IT) cortex, which is crucial for object recognition in humans and primates. Led by Professor James DiCarlo, the team trained an artificial neural network using data from the IT cortex of monkeys, resulting in a model that not only improved its ability to identify objects but also aligned more closely with human visual perception. This neurally aligned model demonstrated enhanced robustness against adversarial attacks—small distortions in images designed to confuse AI systems—indicating a more human-like processing capability. The findings suggest that integrating biological principles into AI development can yield significant advancements in computer vision, benefiting both fields by providing insights into human vision mechanisms and improving AI robustness. The research highlights the potential for a collaborative exchange between neuroscience and artificial intelligence, fostering progress in understanding and replicating human-like vision in machines.

- MIT researchers have created a computer vision model that mimics human brain processing.

- The model was trained using neural data from the monkey IT cortex, enhancing object recognition.

- It showed improved resistance to adversarial attacks compared to standard models.

- The study emphasizes the benefits of integrating neuroscience insights into AI development.

- This research fosters collaboration between neuroscience and artificial intelligence fields.

Link Icon 2 comments
By @mikewarot - 7 months
Tangent: We don't see the way we commonly think we do.

Unlike the human vision system, there's no mention of the fovea, the foveated gaze which forces us to spend attention and build a mental model of the world, nor the need to focus on objects at various distances. While it's astounding how well deep networks have handled image input, it's definitely not like that of a human, or most other animals.

By @gary_0 - 7 months
(2023)