Register for the event
Virtual
Americas
Webinars & Workshops
From Research to Reality: Building GUI Agents That Actually Work - August 15, 2025
Aug 15, 2025
9 AM Pacific
Online. Register for the Zoom!
About this event
Welcome to the Visual Agents Workshop Series, your virtual pass to learn about visual agents - how they work, how to develop them and how to fine-tune them.
Host

Part 1: Navigating the GUI Agent Landscape

Understanding the Foundation Before Building
The GUI agent field is evolving rapidly, but success requires an understanding of what came before. In this opening session, we'll map the terrain of GUI agent research—from the early days of MiniWoB's simplified environments to today's complex, multimodal systems tackling real-world applications. You'll discover why standard vision models fail catastrophically on GUI tasks, explore the annotation bottlenecks that make GUI datasets so expensive to create, and understand the platform fragmentation that makes "click a button" mean twenty different things across datasets.
We'll dissect the most influential datasets (Mind2Web, AITW, Rico) and models that have shaped the field, examining their strengths, limitations, and the research gaps they reveal. By the end, you'll have a clear picture of where GUI agents excel, where they struggle, and, most importantly, where the opportunities lie for your own contributions.