CVPR 2025 Insights #5: Dialogues in AI Research

As CVPR 2025 wrapped up in Nashville, a spark of inspiration stayed with us — not just from the keynotes, papers, and posters, but from the passionate individuals behind the science. That’s why we created the Best of CVPR Series — a series of meetups and short interviews spotlighting the authors behind some of the most intriguing research presented this year. These weren’t just technical Q&As. We discussed career journeys, real-world impact, dreams, and occasionally gave a shoutout to a mother who inspired it all.

Join us on July 9th, 10th, and 11th to meet these authors in our online meetup series and dive deeper into their groundbreaking work. Now let’s meet the Researchers and their stories.

OpticalNet: Seeing Beyond the Diffraction Limit

Ruyi and Benquan from NTU Singapore and UT Austin are using AI to challenge physical limits in microscopy. Their fully software-based solution could disrupt semiconductor inspection with subwavelength imaging, eliminating the need for specialized optics. What excites them is that AI is starting to rival fundamental physics. They’re calling on the CVPR community to collaborate using their rich experimental datasets.

“AI at the edge of Big Data really has the ability to challenge some physical laws.” — Benquan

Nonisotropic Gaussian Diffusion for Human Motion Prediction

Cecilia Corelli from TUM brings a deeply human angle to technical innovation. Her work enables predictive motion modeling in robots, which is essential for elderly care and autonomous vehicles. With a solid grasp on mathematics and empathy, she reminds us that understanding AI means understanding your tools, not treating them like infallible gods.

“Know your data. Know your tools. And stay grounded.” — Cecilia

Interpretable Medical AI with Human-in-the-Loop Models

Huy from the University of Adelaide is building medical imaging tools that talk back with doctors. He’s bridging trust and performance in clinical AI by creating models that improve with expert feedback. Next up? Enabling doctors to chat with AI for collaborative diagnostics literally.

“The power of AI and human is complementary, not competitive.” — Huy

From Occlusion to Imagination: OFER’s Face Reconstruction Breakthrough

Pratheba of the Max Planck Institute is helping AI imagine faces behind occlusions using diffusion models. Initially sparked during her Microsoft internship in a COVID-era project, her work shows how inspiration meets technical depth. Her next goal? Tackling temporal consistency and audio-visual integration in videos

“You fail at so many attempts, but you push through. Then suddenly… something clicks.” — Pratheba

Driving with Foundation Models: A Vision for Autonomous Cars

Tin, PhD at Porsche and KIT, is envisioning a “Knight Rider” world, where foundational models can comprehend traffic scenes with context and physics. His advice is deceptively simple: follow your gut and keep it fun.

“Think simple. Don’t let people talk you into a niche. Just do what makes sense for you.” — Tin

RANGE: Encoding the Earth’s Secrets

Ayush from Washington University is pioneering representation learning for geospatial data. His RANGE model achieves strong performance on downstream tasks by leveraging the unique characteristics of GPS-based data. He’s now moving into generative territory, asking: What if we could generate our planet’s patterns?

“Find exciting problems. Read widely. Look for the gaps. Then start filling them.” — Ayush

Few-Shot Agriculture Vision with Grounding DINO

Dr. Sudhir Sornapudi from Corteva Agriscience is the leading innovator in AgTech. His team fine-tuned vision-language models for crop detection with only a handful of labels. Next? Optimizing for edge devices and applying LoRA techniques. He invites researchers to explore AgTech, where real-world impact meets scientific challenge.

“Ag is complex — and that’s what makes it beautiful. Come work with biologists and change the world.” — Sudhir

SmartHome-Bench: Multimodal AI for Real-World Anomalies

Xinyi & Congjing from the University of Washington created a benchmark for video anomaly detection using MLLMs. Their work, rooted in user-centric design, achieved a whopping 11% performance boost. Their next step? Personalizing anomaly detection based on household-specific preferences.

“Don’t just study what happened. Study why it happened.” — Xinyi & Congjing

Multi-View Anomaly Detection with Normalizing Flows

Mathis from Leibniz University Hannover is adding a new dimension — literally — to anomaly detection. His model achieves higher accuracy and better object understanding by leveraging multiple views. The next stop is flow matching and even 3D modeling.

“Pick a topic you’re passionate about. Then it won’t feel like work.” — Mathis

CLIP++: Scaling Down to Go Further

Rui from TUM trained CLIP-like models with synthetic captions and just 30 million images, outperforming the original at scale. His work unlocks new fine-grained representations for large language models and multimodal tasks.

“You don’t need a billion images. You need a clever setup.” — Rui

Maxing Out Medical Benchmarks

Max Gutbrod from OTH Regensburg introduced a new benchmark for out-of-distribution detection in medical imaging. His work reveals that what works in natural images might not translate to healthcare. Max is now returning to the core challenge — creating more trustworthy detection methods in medicine.

“Medical imaging needs fairness, trust, and robust benchmarks. That’s where the real impact lies.” — Max

Why This Series Matters

This series is more than just research summaries. It’s about people. Their stories, setbacks, motivations, and advice for the next generation. Whether you’re a seasoned researcher, an industry practitioner, or a student exploring AI, these voices offer inspiration and insight.

Join Us for the Best of CVPR Series Meetups

RSVP Here, July 9th, July 10th, July 11th.
Explore the Blogs, Day 1 (July 9th), Day 2 (July 10th), Day 3 (July 11th)

What is next?

If you’re interested in following along as I dive deeper into the world of AI and continue to grow professionally, feel free to connect or follow me on LinkedIn. Let’s inspire each other to embrace change and reach new heights!

You can find me at some Voxel51 events (https://voxel51.com/computer-vision-events/), or if you want to join this fantastic team, it’s worth taking a look at this page: https://voxel51.com/jobs/

Talk to a computer vision expert