W375 Westgate Building
10:00AM
Modern users face an unprecedented challenge: critical knowledge is distributed across massive, heterogeneous, and constantly evolving data sources, from the web to scientific literature and structured databases. While search engines excel at retrieving relevant documents and large language models (LLMs) excel at language understanding and reasoning, neither alone can reliably support complex, multi-step information-seeking tasks that require searching, reasoning, and acting in concert.
In this talk, I present my research on the science of information-seeking intelligence—a principled framework for building intelligent systems that can autonomously index large-scale data, reason over retrieved information, and take goal-directed actions in complex environments. I will describe three layers of contributions.
First, I introduce architecture foundations for modeling heterogeneous, structure-rich data, showing how jointly representing text and structural signals leads to more effective indexing and understanding. Second, I establish the core capabilities required for information-seeking intelligence, including generative retrieval and robust reasoning over noisy or partially relevant evidence. Finally, I present agentic systems trained via reinforcement learning that learn to interleave reasoning, search, and tool use without labeled trajectories, enabling scalable and adaptive behavior in real-world information environments.
Across these contributions, I demonstrate how integrating representation learning, retrieval, reasoning, and reinforcement learning yields systems that are both more capable and more interpretable. I conclude by outlining a research agenda toward general-purpose information-seeking agents, with implications for autonomous research assistants, scientific discovery, and reliable AI systems that broaden access to knowledge.
Additional Information:
Bowen Jin is a Ph.D. candidate in Computer Science at the University of Illinois at Urbana-Champaign, advised by Prof. Jiawei Han. His research focuses on the science of information-seeking intelligence, developing agentic systems that can search, reason, and act over large-scale, heterogeneous data by integrating foundation models, information retrieval, and reinforcement learning. Bowen has published extensively in top venues including NeurIPS, ICML, ICLR, KDD, ACL, SIGIR, and WWW, many of which are widely recognized. His research has been adopted in both academia and industry, including deployments at Google Cloud and widespread open-source use. He has received several honors, including the Apple PhD Fellowship, the Yunni & Maxine Pao Memorial Fellowship and NeurIPs top reviewer award.
Details...