Register for the event
In-person
EMEA
Meetups

Zurich AI, ML and Computer Vision Meetup - July 30, 2026

Jul 30, 2026
5:30 PM - 8:30 PM CEST
Westhive, Hardturmstrasse 161, 8005 Zürich, Switzerland
Speakers
About this event
Join our in-person meetup to hear talks from experts on cutting-edge topics across AI, ML, and computer vision. View more Computer Vision events here.
Schedule
Building Real-World Computer Vision Systems with Voxel51
This talk will explore practical workflows for building, evaluating, and improving modern computer vision systems. We'll dive into real-world approaches to dataset curation, model analysis, multimodal AI workflows, and production-ready vision pipelines using open-source technologies.
The session is designed for engineers, researchers, and AI practitioners looking to better understand how teams are developing and scaling computer vision applications today. Expect practical demos, technical insights, and discussions around the evolving AI tooling ecosystem.
Creating Micro Drama videos using Apertus, the Swiss Open Weights LLM
Independent authors struggle to reach younger audiences in the short-form video era, yet typical AI video generation remains a "prompt and pray" manual process. This session introduces ReelaTales, an agentic multimedia production pipeline that acts as an autonomous AI director to convert static manuscripts into cinematic, social-ready video trailers.
We will dive into the architecture of our orchestration fleet, detailing how we moved beyond third-party dependencies to leverage Apertus, the open-weights Swiss LLM, running on local infrastructure to handle semantic text analysis, emotional beat mapping, and cinematic shot choreography. Attendees will walk away with practical insights on chaining local foundation models with heavy video APIs to maintain strict narrative integrity and character consistency across automated production pipelines.
Beyond the hype: deploying foundation models for enterprise visual inspection
Automated visual inspection represents a massive market opportunity with the profound potential to save lives. In this talk, I will highlight recent success stories driven by algorithmic advancements in foundation models for visual inspection. I will then explore two real-world use cases (medical diagnostics and civil infrastructure monitoring) that IBM Research has tackled over the past few years.
Finally, I will conclude by outlining the remaining open challenges in deploying these systems to inspire future work within the community.
Agentic RAG and the complexity of navigating the unknown
Traditional Agentic RAG is great when we know what we’re looking for, or when the LLM knows how to plan the task. But what happens when our questions sound more like: Does a falling apple have anything to do with why we walk? Would AI, specifically Agentic RAG, know where to begin?
LLMs are promising, but they still have limits: they can over-rely on semantic similarity and miss the strange, indirect connections that often drive discovery. Innovation means navigating new territory, where the unknown is not a bug but the constant.
How do we use the technology we have to explore what we don’t yet fully know? How do we build AI systems that help us get deep insights into the topic we are interested on.