AI, Machine Learning and Computer Vision Meetup
Feb 20, 2025 at 10 AM Pacific

Register for the Zoom
By submitting you (1) agree to Voxel51’s Terms of Service and Privacy Statement and (2) agree to receive occasional emails.
Exploring DeepSeek’s Janus-Pro Visual Question Answer Capabilities

Harpreet Sahota
Voxel51
DeepSeek‘s Janus-Pro is an advanced multimodal model designed for both multimodal understanding and visual generation, with a particular emphasis on improvements in understanding tasks. The model’s architecture is built upon the concept of decoupled visual encoding, which allows it to handle the differing representation needs of these two types of tasks more effectively.
In this talk, we’ll explore Janus-Pro’s Visual Question Answer (VQA) capabilities using FiftyOne’s Janus-Pro VQA Plugin.
The plugin provides a seamless interface to Janus Pro’s visual question understanding capabilities within FiftyOne, offering:
- Vision-language tasks
- Hardware acceleration (CUDA/MPS) when available
- Dynamic version selection from HuggingFace
- Full integration with FiftyOne’s Dataset and UI
Can’t wait to see it for yourself? Check out the FiftyOne Quickstart with Janus-Pro.
About the Speaker
Harpreet Sahota is a hacker-in-residence and machine learning engineer with a passion for deep learning and generative AI. He’s got a deep interest in RAG, Agents, and Multimodal AI.
Getting the Most Out of FiftyOne Open-Source for Gen AI Workflows

Maxime Brénon
Finegrain
In this talk we’ll explore how we maximize the potential of the FiftyOne open source SDK and App to efficiently store and annotate training data critical to Finegrain‘s Generative AI workflows. We will provide an overview of our cloud-based storage and hosting architecture, showcase how we leverage FiftyOne for training and applying models for semi-automatic data annotation, and demonstrate how we extend the CVAT integration to enable pixel-perfect side-by-side evaluation of our Generative AI models.
About the Speaker
Maxime Brénon is a machine learning and data engineer. An Xoogler he started his machine learning journey at Moodstocks when AlexNet was all the rage.
Fine Tuning Moondream2

Parsa Khazaeepou
Moondream AI
Stay tuned for the talk abstract!
About the Speaker
Parsa Khazaeepoul is the Head of Developer Relations at Moondream AI, where he focuses on making computer vision more accessible. A Summa Cum Laude graduate of the University of Washington’s Informatics program, Parsa also spearheaded developer relations at the AI2 Incubator and co-founded Turing Minds, a renowned speaker series featuring Turing Award winners and other leading figures in computer science. His work has impacted thousands through projects like CourseFinder and uwRMP, and he’s a recognized innovator in the Seattle tech scene, named to the Seattle Inno Under 25 Class of 2024.
Find a Meetup Near You
Join 12,000+ AI and ML enthusiasts who have already become members
The goal of the AI, Machine Learning, and Data Science Meetup network is to bring together a community of data scientists, machine learning engineers, and open source enthusiasts who want to share and expand their knowledge of AI and complementary technologies. If that’s you, we invite you to join the Meetup closest to your timezone.