Sessions

Plenary Sessions

Opening Plenary Session: TPC Vision

Monday, June 1, 14:00

This talk marks a pivotal moment in the evolution of scientific discovery, as AI, advanced computing, and human expertise converge to unlock a new era of continuous, real‑time innovation. It highlights the Genesis Mission as a bold national effort uniting national laboratories, industry, and academia to build an unprecedented scientific platform capable of accelerating breakthroughs at extraordinary speed. More than a technological shift, it is a call to action — an invitation for researchers and institutions to help shape a future where bold collaboration, shared purpose, and transformative innovation redefine what humanity can achieve.

Dario Gil, Under Secretary for Science, Department of Energy

The NAIRR Pilot, launched more than two years ago to connect the US-based research and education communities to critical AI resources including compute, data, software, models and expertise, is now supporting over 700 projects and enabling more than 7,000 students. The NAIRR Pilot has produced high-impact discoveries and novel AI models for science domains, and spurred start-up companies. The pilot is now transitioning to a sustainable model with a funded operations center that will be announced later this year. In this talk, NSF will discuss this novel public-private partnership and interagency collaboration model, key outcomes, lessons learned from the NAIRR pilot, and future directions.

Katie Antypas, Director, Office of Advanced Cyberinfrastructure, US National Science Foundation

Our panelists collectively steward hundreds of millions of dollars in AI and computing infrastructure, along with world-class scientific talent, vast experimental datasets, major instruments and laboratories, and leadership-class supercomputers across some of the world's leading national and regional initiatives. What could that combined capacity achieve if aimed together at shared scientific challenges? Could coordinated frontier AI systems deliver vaccines in days rather than months the next time a pandemic emerges? Could agentic models working across international infrastructures cut aircraft fuel consumption by 25% or more, or discover energy storage materials and fuels that outperform anything available today? What pre-competitive scientific grand challenges — ones that rise above regional interests — are ripe for a coordinated global push? And how do we organize now to be at-the-ready when the moment demands it?

Moderator: Debra Goldfarb, Amazon Web Services

Dario Gil, Department of Energy
Katie Antypas, US National Science Foundation
Rick Stevens, Argonne National Laboratory
Satoshi Matsuoka, RIKEN R-CCS
Per Öster, CSC, IT Center for Science

Plenary Session 2: Industry / Lab / Academia

Monday, June 1, 16:30

Agentic science reframes scientific practice around human–AI teams that co-generate hypotheses, run experiments, and analyze results. Drawing on examples from our replication project, we develop a resource model spanning input tokens, human review time, HPC cycles for training and testing models, and compute for running experiments and analyzing data. We ask: how do we practically accelerate science, and at what cost? Holding scientist headcount fixed, we explore how per-scientist AI investment shifts productivity — rethinking metrics beyond paper counts toward hypothesis depth, saturation, quality, and "deeper" science with greater per-paper impact. We frame all this within the DOE Genesis Mission.

Rick Stevens, Associate Laboratory Director – CELS and Argonne Distinguished Fellow, Argonne National Laboratory | Professor of Computer Science, The University of Chicago

AI is rewriting science’s operating system, replacing human-driven, literature-centric research with autonomous discovery loops. But unlike commercial LLMs, AI for Science demands a vast ecosystem of specialized models and tools — ultra-resolution vision foundation models (ORBIT-2, SC'25 Best Paper), surrogate simulators, gigapixel pathology learners, and agentic co-scientists orchestrating thousands of domain tools. Serving Japan’s 500,000 researchers alone would require ~700K Blackwell-equivalent GPUs; agentic workloads remain largely unquantified. No nation or hyperscaler can build this capability solo. Global open collaboration — shared compute, open models, federated data, interoperable stacks — exemplified by the DOE Genesis–RIKEN ARiSE partnership, is absolutely necessary.

Satoshi Matsuoka, Director, RIKEN R-CCS

Scientific discovery is being transformed by the convergence of high-performance computing, AI for science, and quantum computing, spanning seamlessly from on-premises to cloud. Compressing time to innovation is now the defining challenge, so let’s explore real-world examples of how the industry is empowering the scientific community to harness this boundary-free compute model for breakthrough discovery.

Thierry Pellegrino, Global Head of Advanced Computing, Amazon Web Services

The next leap in AI for science will not come from models alone, but from the systems that sustain them. Building on DOE’s Exascale Initiative and Project, the Genesis Mission marks a shift from standalone machines to persistent, AI-enabled discovery platforms. AI accelerates hypothesis generation, while high-fidelity simulation ensures validation and trust. As workflows become agentic, computation shifts toward tightly integrated systems spanning orchestration, data, and execution. DOE’s LUX and Discovery exemplify this transition, enabling continuous discovery at scale. Leadership will be defined by the ability to build, operate, and sustain trusted scientific ecosystems.

Thomas Zacharia, Senior Vice President, Strategic Technical Partnership and Public Policy, AMD

Plenary Session 3: Frontier Models and Systems

Tuesday, June 2, 8:30

Modern AI systems are often cast as products: a model, a chatbot, an API. But when we think about using AI for science, that framing is too narrow. What scientific communities need is open AI infrastructure: data, codebases, pre- and post-training recipes, documentation, evaluations, and access to model flows across stages of development, not just a final released set of model weights. This talk will use Ai2’s Olmo project portfolio as a case study in what it means to build that infrastructure in the open. Drawing on recent results from our team, including work that exposes and studies multiple stages of model construction rather than only final models, we will show that openness at the level of infrastructure is not only a scientific virtue, but a practical necessity. If researchers are going to build AI systems for their own communities, and if universities, nonprofits, and governments are going to harness AI to serve the public interest, they must be able to invest in, contribute to, and use open infrastructure. Our goal is not to reproduce commercial AI, but to create a healthier open ecosystem to accelerate scientific discovery.

Noah A. Smith, Vice Provost for AI, Charles and Lisa Simonyi Endowed Chair for Artificial Intelligence and Emerging Technologies, and Professor, Paul G. Allen School of Computer Science & Engineering, University of Washington

The Bavarian proposal for the "Blue Swan" European AI Gigafactory addresses the critical need for specialized computing infrastructure to train large-scale foundation models within the European research and industrial landscape. The technical framework integrates high-end GPUs into a coherent, HPC-oriented cluster architecture, building on Leibniz Supercomputing Center's (LRZ’s) pioneering work on holistic energy efficiency. This includes hot water direct liquid cooling, utilization of 100% renewable energy sources, and reuse of waste heat to achieve carbon-neutral operations. On this technological basis, Blue Swan employs a scientific approach to analyze demand and enable and scale industrial applications in dedicated domains. It also integrates national European data spaces to facilitate latency-optimized interoperability between industrial applications and academic research. Thus, Blue Swan intends to serve as a substantial AI resource and a technological validation point for energy-efficient, sovereign AI infrastructures in the exascale range.

Dieter Kranzlmüller, Chairman of the Board of the Leibniz Supercomputing Centre (LRZ) | Full Professor, Ludwig-Maximilians-University Munich (LMU)

Large models are usually measured by what goes in: parameters, tokens, compute. Agents shift the focus to what happens over time: plans, tool calls, experiments, revisions, failures, recoveries, and discoveries. This talk frames agents as the machinery that turns trillion-parameter models from predictors into participants in scientific and technical work. We’ll discuss why agentic systems change how we think about scale, evaluation, reliability, and control, and why the next frontier may be not just bigger models, but larger and more consequential processes built around them.

Ian Foster, Data Science and Learning Division Director, Argonne National Laboratory

Japan’s LLM ecosystem is rapidly moving from adaptation to original capability building. This talk will present lessons from the Swallow and LLM-jp projects: open collaboration, Japanese-centric data curation, multilingual evaluation, and scalable training on domestic infrastructure. Recent work will be presented that shows how pre-training data can be rewritten to improve math and code performance, and how mixture-of-experts sparsity should be optimized for reasoning through active FLOPs and tokens-per-parameter. Together, these efforts point toward transparent, reproducible, and locally-grounded LLM development with global scientific impact.

Rio Yokota, Professor, Institute of Science Tokyo | Team Principal, RIKEN Center for Computational Science

Plenary Session 4: Workforce & Emerging Leaders

Tuesday, June 2, 11:00

Scientific discovery is entering a new era in which frontier AI is becoming essential for exploring phenomena that are too complex, data-intensive, or time-consuming for traditional approaches alone. From accelerating the understanding of biological systems to enabling new approaches for hardware-software co-design, emerging scientific applications are driving demand for increasingly capable AI systems. This talk will highlight representative efforts from the Genesis Mission Seed Model Teams, the RIKEN Center for Computational Science, and the Barcelona Supercomputing Center, illustrating how scientific researchers are shaping the needs for the next generation of AI infrastructure, computing systems, and large-scale computational workflows.

Valerie Taylor, Director, Mathematics and Computer Science Division, Argonne National Laboratory

With access to powerful AI, the scarce resource shifts from information and knowledge to judgment and intellectual leaps. The next generation’s edge will NOT simply be doing what machines cannot, but holding what must remain non-delegable for a responsible scientist. We argue for redefining “fundamentals” as transferable operations: inference, calibration, causality, verification, taste. Training in the foundational disciplines of physics, chemistry, biology and mathematics will become more important (not less) and will also serve as domains where intellectual thinking and judgment are developed. We educate scientists not only to stay useful in an AI economy, but because understanding is what science is for.

Karthik Duraisamy, Professor of Aerospace Engineering, and Director, Michigan Institute for Computational Discovery and Engineering, University of Michigan

This talk presents MIST, a family of molecular foundation models with an order-of-magnitude more parameters and training data than prior works. MIST models predict more than 400 structure-property relationships and demonstrate state-of-the-art performance across diverse benchmarks spanning from physiology to electrochemistry. It will cover MIST's capacity to solve real-world problems across chemical space, from multiobjective electrolyte screening to olfactory perception mapping, along with a systematic application of mechanistic interpretability methods to uncover generalizable scientific concepts learned by the model, which reveals how models encode chemical knowledge. The talk will introduce innovations in training methodology, including hyperparameter-penalized neural scaling laws that reduce model development computational costs by an order of magnitude. Together, these methods and findings represent significant progress toward accelerating materials discovery using foundation models.

Anoushka Bhutani, PhD Student, Mechanical Engineering and Scientific Computing, University of Michigan

Language Models (LMs) often struggle to generate diverse, human-like creative content, raising concerns about the long-term homogenization of human thought through repeated exposure to similar outputs. Yet scalable methods for evaluating LM output diversity remain limited, especially beyond narrow tasks such as random number or name generation, or beyond repeated sampling from a single model. Infinity-Chat is a large-scale dataset of 26K diverse, real-world, open-ended user queries that admit a wide range of plausible answers with no single ground truth. It is the first comprehensive taxonomy for characterizing the full spectrum of open-ended prompts posed to LMs. This talk presents a large-scale study of mode collapse in LMs using Infinity-Chat, revealing a pronounced Artificial Hivemind effect in open-ended generation of LMs. Overall, Infinity-Chat presents the first large-scale resource for systematically studying real-world open-ended queries to LMs, revealing critical insights to guide future research for mitigating long-term AI safety risks posed by the Artificial Hivemind.

Liwei Jiang, PhD Student, Paul G. Allen School of Computer Science & Engineering, University of Washington

Lunch and Panel Discussion

Tuesday, June 2, 12:30

This panel explores practical lessons from real-world AI collaborations between industry and government, highlighting what drives successful partnerships — and what causes them to fail. Through case studies and firsthand experience, speakers will discuss key success factors, common pitfalls, and strategies for building effective cross-sector collaboration. The session will conclude with forward-looking ideas for future AI partnerships and actionable recommendations for initiating impactful public-private initiatives in an evolving technological and policy landscape. Attendees will walk away with practical, experience-based guidance for building or strengthening AI partnerships between the public and private sectors.

Moderator: Earl Joseph, CEO, Hyperion Research

Hal Finkel, U.S. Department of Energy
Bill Magro, Google Cloud
Molly Presley, Hammerspace
Samantika Sury, HPE

Lunch and Panel Discussion

Wednesday, June 3, 12:30

The real measure of success for TPC is the extent to which it enables breakthroughs in the global scientific community. One of TPC's goals is to “identify and incubate” collaborations. This panel of experts will describe their work and how it is accelerated by TPC participation. More importantly, this panel opens the door to opportunity. As we near the conclusion of TPC26, the most pertinent insights are the ways in which TPC directly benefits your work. This is where the collaboration begins.

Moderator: Addison Snell, Co-Founder & Chief Executive Officer, Intersect360 Research

Arvind Ramanathan, Argonne National Laboratory
Eliott Jacopen, RIKEN R-CCS
Christine Sweeny, Los Alamos National Laboratory
Rio Yokota, Institute of Science Tokyo

Plenary Session 5: TPC Collaborative Initiatives

Wednesday, June 3, 16:00

Scientific hypothesis generation is the most consequential step of the research workflow: every downstream action, from experiment design to simulation, observation, and analysis, inherits its quality from it. Researchers are starting to use frontier LLMs and multi-agent co-scientists as Hypothesis Generation Tools (HGTs) to accelerate scientific discovery, with the potential to exploit multi-disciplinary knowledge more effectively than any individual scientist. In practice, HGTs and humans work at different time scales: while a researcher will spend months to come up with a few hypotheses, HGTs can generate thousands of hypotheses for every single scientific question in a matter of hours or a few days. Although many of the HGT-generated hypotheses can be discarded quickly today, rejecting hypotheses will become harder as HGTs progress. This raises a novel question: what should a practical hypothesis generation process for science look like in the AI context? This talk will discuss the current state-of-the-art in hypothesis generation, available HGTs, an analysis of the generated hypotheses, some positive results, and evaluation outcomes. We will also introduce the following questions that the science community will have to answer: How many hypotheses should an HGT generate per problem? What scale of resources (tokens, GPUs) is needed to serve institutions with thousands of researchers? How can the generated hypotheses be filtered/ranked? Do we have the infrastructure to test tens, hundreds, or thousands of hypotheses per research problem?

Franck Cappello, R&D Lead, Senior Computer Scientist, Argonne National Laboratory

AI inferencing capability is increasingly important inside academia for both inference and training. Although options from industry are available, resources for AI development in academia are substantially smaller than their industrial counterparts. While AI training can be relatively simply supported by the batch-scheduled, GPU-enabled clusters in many HPC centers, inference workflows present new challenges. This talk will cover how inference is being addressed at the Texas Advanced Computing Center (the National Science Foundation’s Leadership-Class Computing Facility), examine the unmet needs we see from the user base, and provide a summary of TPC discussions between TACC and other centers around the world on how inference may be addressed in academic research.

Dan Stanzione, Executive Director, Texas Advanced Computing Center (TACC) | Associate Vice President for Research, UT-Austin

This closing session reflects on the highlights of TPC26 and looks ahead to what's next for the Trillion Parameter Consortium. We'll preview the fall hackathon and the SC26 workshop, which is now accepting 5-page papers through July 10. We'll also showcase collaborations advanced over the past year, including joint work on architecture and data for a large-scale, open frontier-class model partnership with CEPI to accelerate vaccine development, and expanding efforts in evaluation, agentic systems, alignment and safety, and MCP strategies.

Charlie Catlett, Executive Director, Trillion Parameter Consortium

Breakout Groups

TPC26 breakout groups are designed to identify, form, and pursue collaborations that will accelerate the development of new AI capabilities and services for scientific discovery. Some sessions are organized by TPC working groups, others are prospective working groups or birds-of-a-feather gatherings. Each session comprises a small set of lightning talks followed by group discussion, and all TPC26 participants are encouraged to submit lightning talk proposals.

The six-way parallel breakout schedule is loosely organized around eight themes: Driving Challenge Applications; AI Software Infrastructure/Frameworks; Infrastructure to Enable Shared Data & Computing; Open Frontier AI Systems; Open Suite for Evaluating Model Skills, Knowledge, Reasoning, and Safety; Open Frontier Models; Training- and Deployment-Level Safety and Alignment; and Expanding and Deepening the AI Workforce.

Driving Challenge Applications (Challenge Applications)

Room: Main Plenary Ballroom

Identify challenge applications for driving and evaluating the Infrastructure to Enable Shared Data & Computing, Open Frontier Models (Model Architecture & Performance Evaluation), and Open Frontier AI Systems tracks. Not centrally picking winners and losers, but asking the community to volunteer (and drive) scientific challenge applications, aiming for diversity on multiple axes (including industry applications).

AI for Material Sciences

Session 1

Tuesday, June 2, 14:00

Session 2

Tuesday, June 2, 16:00

Eliu Huerta, Argonne National Laboratory

Xiaoyun Wang, NVIDIA

This session will showcase how AI is revolutionizing materials discovery across quantum science, semiconductors, chemistry, energy, and advanced manufacturing. Bringing together world-class leaders from academia and industry, the session features keynote talks by Ted Sargent (Northwestern), Cameron J. Owen (Lila Sciences), Laura McGorman (Meta), and Arvin Kakekhani (PsiQuantum), alongside a special NVIDIA tutorial by Xiaoyun Wang and lightning talks from rising innovators in the field. Expect bold ideas, frontier AI methods, foundation models for science, autonomous experimentation, and next-generation computational workflows that are redefining the pace of scientific discovery.

Session 1
Human-in-the-Loop and Mixed Acceleration for Next-Generation Catalysis	Edward (Ted) Sargent (Northwestern University)
Discovering Structural Rules in Scattering Amplitudes via Information Lattice Learning	Hazu Yu (Kocree Inc.)
Open Source AI for Innovations in Energy and Materials Sciences	Laura Gorman (Meta)
AIMNet2: A Foundational Machine-Learned Interatomic Potential for General Chemistry, Reactions, and Open-Shell Systems	Shams Mehdi (Carnegie Mellon University)
Toward Scientific Superintelligence: Autonomous Agent Frameworks for Accelerated Materials Discovery	Cameron John Owen (Lila Sciences)
Session 2
Nahual: A Sequence Model for Language and Atoms	Austin Cheng (University of Toronto)
Quantum Computing, Embedding, and Machine Learning for Predictive Chemistry	Arvin Kakekhani (PsiQuantum)
Accelerating Scientific Discovery With NeMo Agent Toolkit	Xiaoyun Wang (NVIDIA)
AI and Experimental Automation	Emma Bouchard (Carnegie Mellon University)

GridAI Model Team

Wednesday, June 3, 8:30

Kibaek Kim, Argonne National Laboratory

Teja Kuruganti, Oak Ridge National Laboratory

This working group is organized around GridAI, a Genesis Mission seed project to scope a scalable AI platform for power grid modeling, analysis, and decision support. This session will introduce GridAI to the TPC community, describe the seed project’s goals and team, and feature invited contributions from participating institutions on grid-relevant AI directions. An open forum will follow to discuss shared challenges in data, modeling, software, and HPC infrastructure, and to explore connections with scientific ML and energy systems modeling. Researchers in AI/ML, scalable algorithms, optimization, and complex systems are invited to join and help shape the GridAI agenda.

Scalable Heterogeneous Graph Learning for Grid AI Foundation Models	Massimiliano Lupo Pasini (Oak Ridge National Laboratory)
GridMind: An Agentic Workflow for Weather-Driven Power Grid Risk Assessment	Hongwei Jin (Argonne National Laboratory)
AI Applications in Transmission Planning and Operations	Anish M. Gaikwad (Electric Power Research Institute)
Scaling Laws for Heterogeneous Graph Learning in Optimal Power Flow Applications	Emon Dey (Argonne National Laboratory)

BOF: Bio-Foundation Models, Agentic systems, and Biosecurity

Wednesday, June 3, 11:00

Arvind Ramanathan, Argonne National Laboratory

Newton Wahome, CEPI

Sarah Carter, CEPI

Bio-foundation models and agentic AI systems are reshaping biological discovery — from genome-scale language models and protein structure prediction to autonomous laboratory workflows — while simultaneously surfacing critical biosecurity risks that demand urgent community attention. This BOF convenes leading researchers from industry and national laboratories to examine scalable biological AI architectures, agentic orchestration for autonomous science, and governance frameworks for dual-use risks. Topics span model scaling behavior, biosafety-by-design, and policy-aligned deployment. Attendees will gain actionable insight into responsible bio-AI development, preparing the HPC and AI research community for safe, high-impact biological discovery at scale.

Scalable Agentic Reasoning for Designing Biologics Targeting Intrinsically Disordered Proteins

Archit Vasan (Argonne National Laboratory)

AI and Strategic Decision Support

Wednesday, June 3, 14:00

Frank Alexander, Argonne National Laboratory

Manish Parashar, The University of Utah

Building on the consortium’s collaborative development of foundation models for science and engineering, we’ll examine applications in disaster response, supply chain resilience, pandemic management, and other areas. The discussion will connect TPC’s work on scalable architectures, scientific data curation, and exascale optimization to breakthrough capabilities in time-critical decision support.

From a Napkin To a Workflow — Opportunities and Challenges for Workflow Composition	Ewa Deelman (University of Southern California)
Evaluating Epistemic Non-Triviality of Large Language Reasoning Models in Scientific Hypothesis Generation	Tirthankar Ghosal (Oak Ridge National Laboratory)

AI Software Infrastructure/Frameworks (Software Stack)

Room: Baltimore and Columbia

Develop software infrastructure and middleware to support the training, deployment, and integration of complex frontier-scale AI models and systems. Provide the technical backbone for the Open Frontier Models (Model Architecture & Performance Evaluation) and Open Frontier AI Systems tracks, while enabling integration with experimentation platforms, laboratories, instruments, and other real-world scientific environments.

BOF: AI Frameworks for Multimodal Data Access and Use

Tuesday, June 2, 14:00

Ilkay Altintas, San Diego Supercomputer Center

Manish Parashar, The University of Utah

AI is accelerating discovery, redefining the workforce, and transforming society, yet fragmented data and computing limit innovation. Federated, AI-ready frameworks can bridge distributed data repositories and computing resources through interoperable, production-grade services that enable seamless access, integration, and composability of data, models, and workflows across edge, cloud, and HPC. These frameworks support multimodal analysis, simulation, and end-to-end workflows. This BOF will examine architectures, highlight existing efforts, and explore paths toward a cohesive national ecosystem. Talks and discussions will identify needs, share use cases, and provide practical entry points for leveraging national cyberinfrastructure.

Operationalizing Sovereign Data for AI Inference and Physical AI	Molly Presley (Hammerspace)
From Access to Action: The NAIRR Pilot Portal and Its Interactive Sandboxes	Sandra Gesing (San Diego Supercomputer Center)

DWARF: Data Workflows, Agents Reasoning, and Frameworks

Session 1

Tuesday, June 2, 16:00

Session 2

Wednesday, June 3, 8:30

Robert Underwood, Argonne National Laboratory

Neeraj Kumar, Pacific Northwest National Laboratory

Ian Foster, Argonne National Laboratory

This multi-session track explores emerging systems and strategies for building intelligent, scalable platforms to accelerate scientific discovery. Talks and discussions will cover the design of agent-based architectures, integration of scientific workflows with large language models, scalable data pipelines, and novel reasoning frameworks. The session encourages both the software infrastructure and the users of that infrastructure. Participants will engage in dialogue on the future of scientific AI infrastructure and the coordination required to realize a distributed, agent-enabled discovery ecosystem.

Best Practices for Scientific Workflows	Robert Underwood (Argonne National Laboratory)
When More Cores Hurts: The Vector Database Scaling Paradox in HPC	Seth Ockerman (University of Wisconsin)
Agents with Agency	Yadu Babuji (University of Chicago)
Deploying Agentic AI Across the DOE Accelerator Complex: The MOAT Experience	Thorsten Hellert (Lawrence Berkeley National Laboratory)
Unified User Experience Across Heterogeneous GPU Clusters with Diamond	Zhao Zhang (Rutgers University)
Enabling Autonomous Scientific Discovery Through Agentic AI and High-Performance Computing	Murat Kecelli (Argonne National Laboratory)
Building Reusable and Trustworthy AI Co-Scientists: Lessons From Multi-Domain Scientific Deployments	Chandrachur Bhattachar (Argonne National Laboratory)
The Semiconductor Engineering Inflection	Srikanth Gubbala (Applied Materials)

AI Software Infrastructure Frameworks

Tuesday, June 2, 16:00

Mohamed Wahib, RIKEN R-CCS

Rio Yokota, Institute of Science Tokyo

This working group session will discuss the software infrastructure and middleware required to support frontier-scale AI for science, including frameworks for training, deployment, orchestration, and integration with HPC systems, scientific workflows, laboratories, instruments, and experimentation platforms. The session aims to identify common software challenges and collaboration opportunities needed to make large-scale AI systems usable, reliable, and interoperable across scientific environments.

AI and CI Services From NSF-AI Institute ICICLE	D.K. Panda (The Ohio State University)
Planned Reproducible Orchestration of Tools for LLM-Based Agents	Eliott Jacopin (Center for Biosystems Dynamics Research, RIKEN R-CSS)
MoE-Inference-Bench Performance Evaluation of Mixture of Expert Large Language and Vision Models	Murali Emani (Argonne National Laboratory)
NAPA Project: AI Testbed Deployment in Japan	Jason Haga (National Institute of Advanced Industrial Science and Technology (AIST))
Scaling MoE Inference with vLLM on Aurora	Padma Apparao (Intel)
ElMerFold: Exascale Distillation Workflows for Protein Structure Prediction on El Capitan	Nikoli Dryden (Lawrence Livermore National Laboratory)

BOF: Large Language Models for Scientific Software

Wednesday, June 3, 11:00

Mohammad Alaul Haque Monil, Oak Ridge National Laboratory

Keita Teranishi, Oak Ridge National Laboratory

Agentic AI is redefining HPC research by introducing intelligent, autonomous capabilities. This BOF explores how large language model (LLM) agents enable code translation, modernization, modeling, and tuning, allowing legacy scientific applications to be efficiently adapted for modern HPC architectures. Beyond code transformation, agentic systems can orchestrate and optimize complex, end-to-end HPC workflows with minimal human intervention. Participants will discuss emerging tools, challenges, and opportunities in deploying LLM-driven agents for scalable, reproducible, and adaptive research pipelines. The session aims to foster collaboration and share insights on advancing autonomous HPC systems powered by agentic AI technologies.

Applications of Deep Knowledge Graph	Geoffrey Fox (University of Virginia)
Curating Agentic Workflows with Knowledge Graphs and Operational Experience	Ana Gainaru (Oak Ridge National Laboratory)
A Tale of Two Agentic Frameworks: Empirical Studies of LangChain/LangGraph and AG2 in Autonomous Scientific Workflows	Meifeng Lin (Brookhaven National Laboratory)
Enabling Autonomous Scientific Discovery Through Agentic AI and HPC	Murat Keceli (Argonne National Laboratory)

BOF: Trillion Parameter Models for the Edge and Computing Continuum

Wednesday, June 3, 11:00

DK Panda, The Ohio State University

Barney Maccabe, University of Arizona

Computing continuum at the edge is emerging as a common environment for many applications — transportation, fire prevention, agriculture, medicine, manufacturing, etc. Typical computing environments at the edge (IOT devices, drones, etc.) do not have enough computing or storage capacity, which presents the challenge of how to use TPC Models in this environment. This BOF will focus on the latest state-of-the-art solutions along this direction as well as future opportunities and challenges.

Catalyzing AI-Enabled Science Across the Digital Continuum for Science	Manish Parashar (University of Utah)
Supporting the Continuum AI Ecosystem in the National Data Platform	Ilkay Altintas (San Diego Supercomputer Center)
Catalyzing AI-Enabled Science Across the Digital Continuum for Science	Manish Parashar (University of Utah)
Bridging the Computing Continuum	Sean Shahkarami (Northwestern University)

Infrastructure to Enable Shared Data & Computing (Shared Infrastructure)

Room: Annapolis and Columbia

Collectively build scientific training data resources and shared computing infrastructure for model training and further fine-tuning for general-purpose and domain-specific settings. Establish scalable and sustainable capabilities that serve as the foundation for the rest of the tracks.

BOF: Trustworthy Privacy-Preserved Federated Learning for Science

Tuesday, June 2, 14:00

Olivera Kolevska, Oak Ridge National Laboratory

Kibaek Kim, Argonne National Laboratory

Ravi Madduri, Argonne National Laboratory

Federated learning offers a promising approach for enabling collaborative scientific discovery while preserving the privacy of sensitive data across institutions. This BOF will bring together researchers and practitioners to discuss trustworthy, privacy-preserving federated learning frameworks tailored for scientific workloads. The discussion will focus on challenges such as secure model aggregation, data confidentiality, system scalability, and integration with distributed research infrastructures. Objectives include identifying common requirements, sharing emerging techniques, and fostering collaborations within the TPC community. The session is particularly relevant to TPC participants interested in distributed computing, secure data sharing, and scalable AI methods that support cross-institutional scientific research.

NeuroFL: OBI's Intelligence Network for Brain Health	Francis Jeanson (Ontario Brain Institute)
OmniFed: Towards Configurable Cross-Silo Federated Learning	Sahil Tyagi (Oak Ridge National Laboratory)
Differentially Private Federated Averaging with James-Stein Estimator	Minseck Ryu (Arizona State University)
Socio-Technical Infrastructure: Operationalizing FL Systems	Mohammed Manzari (Deloitte)
Are You Ready for Production Federated Learning?	Holger Roth (NVIDIA)
Federated LLM Training Across NNSA Labs	Max Carlson (Sandia National Laboratories)
Scalable Cross-Facility Federated Learning for Scientific Foundation Models on Multiple Supercomputers	Yijiang Li (Argonne National Laboratory)
Towards Trustworthy Federated AI: Privacy, Ownership Protection, and Model Editing	Olivera Kolevksa (Oak Ridge National Laboratory)
The Next Frontier: Federated AI with Flower	William Lindskog-Munzing (Flower Labs)

BOF: 50-State AI Plan: A Grassroots Approach to Building a US AI Continent

Tuesday, June 2, 16:00

Barr von Oehsen, Pittsburgh Supercomputing Center

Jack Wells, NVIDIA

This BOF explores the emerging fabric of state and regional initiatives that will empower American leadership in AI and quantum computing. Using Pennsylvania, Tennessee, Utah, Massachusetts, New York, New Jersey, and California as case studies, we examine how state-led initiatives are aligning regional assets with America's AI Action Plan, the Genesis Mission, and the National AI Research Resource (NAIRR) in building a US AI Continent. Crucially, we discuss how these and other federal initiatives can leverage state-level "factories" to advance research, workforce development, and economic growth by embracing grassroots innovation tailored to local and regional strengths. We highlight essential partnerships between academia, state governments, industry, philanthropy, and federal agencies to drive these efforts. Participants will discuss strategies for democratizing access to high-performance computing resources, streamlining technical deployment, and building the AI workforce pipeline. By fostering inclusive, state-led hubs, we can attract investment, train thousands of workers, and ensure US technological sovereignty in AI.

Catalyzing the Utah Responsible AI Innovation Ecosystem	Manish Parashar (The University of Utah)
Experimental Design for Foundation Models: From Uncertainty to Risk	Yu Wen (Stony Brook University)
The Pittsburgh Supercomputing Center, An Integrated National Resource for AI for Science	Paola Buitrago (Pittsburgh Supercomputing Center)
From Vision to Momentum: The AI Tennessee Blueprint	Tabitha Samuel (University of Tennessee)
The Keystone AI + Quantum Factory	Barr von Oehsen (Pittsburgh Supercomputing Center)

BOF: Self-Driving Labs for Accelerating Scientific Discovery at Scale

Wednesday, June 3, 8:30

Arvind Ramanathan, Argonne National Laboratory

Rio Yokota, Institute of Science Tokyo

Self-driving laboratories (SDLs) are transforming scientific discovery by integrating AI-driven hypothesis generation, robotic experimentation, and closed-loop active learning into fully autonomous workflows. This workshop convenes leading researchers from national laboratories, academia, and industry to examine the computational foundations of SDLs — spanning foundation models for experimental design, agentic orchestration across heterogeneous instruments, high-throughput data pipelines, and HPC integration at scale. Applications span drug discovery, materials design, enzyme engineering, and critical minerals extraction. Attendees will gain actionable insight into deploying autonomous discovery platforms, benchmarking SDL performance, and building the open infrastructure needed to accelerate science at exascale.

BOF: Bold New World of Heterogenous AI Computing

Session 1

Wednesday, June 3, 11:00

Session 2

Wednesday, June 3, 14:00

Satyam Srivastava, d-Matrix

Tom St. John, Gimlet Labs

As AI workloads diversity, no single accelerator wins on every axis of cost, power, and performance. Consequently, traditional monolithic architecture is hitting critical bottlenecks in scaling and efficiency. This BoF brings together perspectives from researchers, architects, and hardware vendors on building production systems that combine GPUs, ASICs, and more. Speakers will share insights spanning architecture and integration, software stacks and portability, workload scheduling across heterogeneous fabrics, and real-world performance benchmarks. Attendees will gain a strong grasp of the trade-offs, tooling gaps, and emerging best practices for designing AI infrastructure that treats heterogeneity as a foundational feature.

Session 1
Macroheterogeneity: Enabling Hybrid HPC and AI Workflows	Samantika Sury (HPE)
Inference-first AI Design	Tim Clarke (SambaNova)
AI, HPC, precision and the uncertain future	Robert Robey (AMD)
Off the Leash: Device-Native Heterogeneous Execution	Jason Saliya Ekanayake (d-Matrix)
MoE-Inference-Bench Performance Evaluation of Mixture of Expert Large Language and Vision Models	Murali Emani (Argonne)
MoE at Scale: From GPUs to Wafer-Scale AGI	Daria Soboleva (cerebras)
Session 2
Efficient and Scalable Agentic AI with Heterogeneous Systems	Zain Asgar (Gimlet)
Prefill Here, Decode There: Disaggregated LLM Serving Across GPUs and LPUs	Vineeth Gutta (Nvidia)
One Abstraction, Many Transports: Streaming Communication for Disaggregated Inference	Sai Rahul (d-Matrix)
The NAPA Project: Inference Systems	Jason Haga (AIST)
Optimizing Inference at Wafer Scale	Natalia Vassilieva (Cerebras)

BOF: Energy-Efficient and Sustainable AI

Wednesday, June 3, 14:00

Siddhartha Jana, Intel

Natalie Bates, Lawrence Berkeley National Laboratory

Shaohui Liu, Massachusetts Institute of Technology

This BOF will focus on the sustainability challenges of large-scale AI, aligned with the mission of advancing responsible and scalable AI for science. As trillion-parameter models demand unprecedented compute, energy, and data resources, critical questions arise around carbon footprint, infrastructure efficiency, and equitable access. The forum will convene researchers, industry practitioners, and infrastructure providers to examine trade-offs between performance and sustainability, share best practices in energy-efficient model design and deployment, and identify collaborative pathways for greener AI. By fostering cross-sector dialogue, this session aims to shape actionable strategies for sustainable AI at extreme scale.

Open Frontier AI Systems (AI Systems)

Room: Frederick

Develop frontier AI systems for science that incorporate reasoning models (start with SOA closed models, eventually include Open frontier model) and develop domain foundation models, knowledge graphs, agentic systems and orchestrations, simulators, and experiments.

BOF: AI Agents as Scientific Collaborators: Building Human-Agent Research Teams

Tuesday, June 2, 14:00

Rick Stevens, Argonne National Laboratory

Arvind Ramanathan, Argonne National Laboratory

Thomas Brettin, Argonne National Laboratory

Scientific AI agents are moving from tools to participants — attending conferences, contributing to research, and coordinating across institutions. This BOF features lightning talks delivered by AI agents alongside their human collaborators, showcasing live experiments in agentic research workflows. We then open the floor to explore a proposed international collaboration: a multi-institutional human-agent team working together to accelerate scientific discovery, reduce duplicated effort, and raise the quality of science.

Agent-Enabled Paper to Code Generation for ML Reproducibility	Zhao Zhang (Rutgers University)
Towards Intelligent CFD Workflow in the Era of Large Language Models	Shaowu Pan (Rensselaer Polytechnic Institute)
An AI Research Assistant for Automating the Computational Catalysis Pipeline	Ruchika Mahajan (Stanford University)
Model Capabilities Driving New Paradigms of Agentic Patterns	Matt Baughaman (Princeton Plasma Physics Laboratory)
Toward Agentic Closed-Loop AI for Battery Science: From SpectraQuery to Multimodal Experimental Agents	Sreya Vangara (Stanford University)

Toward Scientific AI Platforms: Inference, Agents, and AI Services at HPC Facilities

Wednesday, June 3, 8:30

Venkatram Vishwanath, Argonne National Laboratory

Ilkay Altintas, San Diego Supercomputer Center

Building on last year’s session on inference-for-science services, this session expands its focus to the broader landscape of scientific AI platforms at HPC facilities. As foundation models, domain-specific AI systems, and agentic workflows gain traction, HPC centers are actively developing infrastructure for scalable inference, AI agents, AI-ready data services, model gateways, and facility-scale AI services. This session will convene members from international HPC centers, application teams, vendors, and the open source community to share emerging best practices for deploying reliable, reproducible, and secure AI services for science. Discussion topics will span simulation and workflow integration, heterogeneous architectures, orchestration, agent skills, observability, sustainability, and workforce development. The session will gather use cases, identify shared technical and operational gaps, and define next steps for continued collaboration across the TPC community.

The NAPA Project: Inference Systems	Jason Haga (National Institute of Advanced Industrial Science and Technology (AIST))
Agentic AI-Driven Inference Service for Scientific Applications	Shaojia Fan (National Center for High-Performance Computing)
DragonHPC: A High-Performance Distributed Runtime for AI‑Coupled HPC Workflows	Samantika Sury (HPE)
ALCF Inference Service: Facility-Scale AI Services for Science	Venkatram Vishwanath (Argonne National Laboratory)
Optimizing Inference at Wafer Scale?	Daria Soboleva (Cerebras)

BOF: Human-AI Collaboration

Wednesday, June 3, 14:00

Anurag Acharya, Pacific Northwest National Laboratory

Patrick Emami, National Renewable Energy Laboratory

This BOF centers on frameworks for Human-AI co-intelligence in scientific discovery, positioning collaboration — not autonomy — as the primary design goal. Building on our concept of Human-AI Virtual Laboratories, we will explore how mixed-initiative interaction, role differentiation, and coordinated workflows could enable scientists and AI systems to function as true teammates. The discussion will hopefully surface key design principles spanning agency, communication, and coordination, alongside open challenges in building such systems in practice. We will also briefly discuss evaluation as a supporting concern, focusing on how to assess collaborative effectiveness in these new paradigms.

Building for Human-AI Scientific Co-Intelligence	Patrick Emami (National Laboratory of the Rockies)
Evaluating Memory Condensation Strategies for Coding Agents in Data-Driven Scientific Discovery	Sameera Horawalavithana (Pacific Northwest National Laboratory)

Open Suite for Evaluating Model Skills, Knowledge, Reasoning, & Safety (Evaluation)

Room: Chesapeake A

Develop an open suite of tools, methods, and benchmarks for evaluating the scientific skills, knowledge, reasoning, agentic capabilities, and safety/security of frontier models and AI systems.

Evaluation of LLMs for Science

Session 1

Tuesday, June 2, 14:00

Session 2

Tuesday, June 2, 16:00

Franck Cappello, Argonne National Laboratory

Sandeep Madireddy, Argonne National Laboratory

One of the main thrusts behind the rapid evolution of LLMs is the availability of benchmarks that assess the skills and trustworthiness of LLMs. Not only do they enable a rigorous evaluation of LLMs skills and trustworthiness from accepted metrics, but they also generate competition between LLM developers. While several frameworks/benchmarks have emerged as de facto standards for the evaluation of general-purpose LLMs (Eleuther AI Harness and HELM for skills, DecodingTrust for trustworthiness), only very few of them specifically are related to science. In this segment, we will discuss the challenges of developing methods to evaluate the skills, trustworthiness, and safety of large Foundation Models for science. This track will include multiple sessions focused on different facets of model evaluation.

Session 1: Evaluation of Code Generation, Translation, and Parallelization
Reliable Evaluation of LLMs for Scientific Parallel Code Translation	Le Chen (Argonne National Laboratory)
Benchmarking LLM-Generated Parallel Code for Task-Based Workflow Programming Models	Eduardo Iraola de Aceve (Barcelona Supercomputing Center)
CelloAI Benchmarks: Evaluating LLMs for HEP/HPC Software	Kriti Chopra (Brookhaven National Laboratory)
Session 2: Principled Approaches to AI Scaling, Reasoning, and Benchmarking
Why Neural Scaling Laws Require an Information-Theoretic Foundation	Sujata Sinha (Virginia Polytechnic Institute and State University)
Building Reusable and Trustworthy AI Co-Scientists: Lessons from Multi-Domain Scientific Deployments	Chandrachur Bhattacharya (Argonne National Laboratory)
Difficulty-Oriented Reasoning Effort Modeling and Systematic Evaluation of Science Problems for Large Language Models	Martin Foltin (HPE)
Pier: Efficient Large Language Model Pretraining with Relaxed Global Communication	Zhao Zhang (Rutgers University)

Agentic Reasoning with Scientific Foundation Models

Wednesday, June 3, 8:30

Ayan Biswas, Los Alamos National Laboratory

Christine Sweeney, Los Alamos National Laboratory

Scientific AI is rapidly shifting from predictive models toward agents that can plan, use tools, run simulations, analyze data, and automate parts of the research workflow. This session asks what “reasoning” means when scientific AI systems act, not just answer. Scientific agents must satisfy constraints that general-purpose LLM agents often do not: physical consistency, numerical validity, uncertainty quantification, provenance, reproducibility, security, and appropriate human oversight. We will discuss how agentic automation changes evaluation for SciML and AI for science. Which tasks should agents automate? How should they decide when to invoke solvers, query data, generate hypotheses, or ask for human input? How do we distinguish scientific reasoning from brittle tool use, brute-force search, or persuasive but invalid explanations? The session will touch upon topics such as community needs for benchmarks, testbeds, safety practices, provenance standards, and evaluation frameworks for reliable, interpretable, and useful scientific agents.

Accelerating Scientific Discovery with NeMo Agent Toolkit	Xiaoyun Wang (NVIDIA)
Simulation Surrogates Using Gaussian Splatting and Diffusion	Han-Wei Shen (The Ohio State University)
Evaluating Agents Without Ground Truth	Kirsten Hofmockel (Pacific Northwest National Laboratory)

Open Frontier Models (Open Models)

Room: Chesapeake A

Build frontier-scale, open AI models using shared data and computing infrastructure (from the Infrastructure to Enable Shared Data and Computing track), harnessing distributed resources across TPC partner institutions. Ensure that all core components are openly available to enable transparency, reuse, and scientific progress.

Model Architectures and Performance Evaluation

Session 1

Wednesday, June 3, 11:00

Session 2

Wednesday, June 3, 14:00

Rio Yokota, Institute of Science Tokyo

Murali Emani, Argonne National Laboratory

Architectures for AI models are evolving rapidly, with frequent innovations in transformer variants, to reduce the cost of attention and kv-cache in long-context/agentic reasoning, their framework support, parallelism strategies, and system-level optimizations. Identifying the optimal architecture and framework for training foundation models on scientific data is vital to unlocking the next generation of AI for science. Equally crucial is efficient inference, which enables the practical use of pre-trained models in downstream scientific applications. This multi-session track will bring together researchers and practitioners to discuss cutting-edge strategies for large-scale training, inference, and agentic scaling, alongside robust workflows to integrate them.

Scaling MoE Models to Exascale With Open Software on Aurora	Padma Apparao (Intel)
Olmo Hybrid: From Theory to Practice and Back	William Merrill (Tokyo Technological Institute)
RingX: Scalable and Efficient Long-Context Learning for Scientific Foundation Models on HPC	Junqi Yin (Oak Ridge National Laboratory)
AIMNet2: A Foundational Machine-Learned Interatomic Potential for General Chemistry, Reactions, and Open-Shell Systems	Shams Mehdi (Carnegie Mellon University)
Closing the Loop: Federated Learning for Distributed Scientific Systems	Chandreyee Bhowmick (Oak Ridge National Laboratory)

Training- and Deployment-Level Safety and Alignment (Safety & Alignment)

Room: Frederick

Develop methods to embed safety and alignment into the training and deployment of frontier-scale models and AI systems. Focus on system-level mechanisms that maintain alignment with scientific objectives and constraints, and with broader societal values, at extreme scale and in high-impact settings.

BOF: Safety and Alignment in Agentic Systems

Tuesday, June 2, 16:00

Rio Yokota, Institute of Science Tokyo

Agentic systems are being built or proposed for a diverse range of applications, from literature review to laboratory automation. Each domain brings common as well as unique safety requirements and alignment/containment strategies. This BOF seeks to find common approaches that might be applied across different application domains, ideally identifying architectural and design approaches that are broadly applicable to developing operational agentic systems for science and engineering.

A Comprehensive Multilingual Jailbreak Evaluation of Open-Source Large Language Models	Kashyap Manjusha (University of Illinois Urbana-Champaign)
A Full-Stack Approach to Frontier Model Safety: Red-Teaming, Interpretability, Unlearning, and Formal Verification	Sumit Kumar Jha (University of Florida)
The Need for New Safety Measures for LLMs in Scientific Applications	Saket Chaturvec (Argonne National Laboratory)

Expanding and Deepening the AI Workforce (Workforce)

Room: Columbia

Identify and report on progress in developing the workforce required to achieve the rest of the tracks, with particular attention to emerging and evolving roles across the frontier AI stack. Examine needs across all career stages and share recent experiences and lessons learned to inform sustainable talent development.

BOF: Building the AI-Ready Science and Engineering Workforce: Skills, Systems, and Collaboration Required For Success

Tuesday, June 2, 14:00

Doug Norton, The HPC-AI Society

As AI transforms scientific discovery and engineering design, the greatest challenge is not just computational scale, it’s human scale. This BOF features a panel that explores how research institutions, industry, and consortia can cultivate and incentivize the next generation of scientists and engineers to harness AI-driven workflows. We will explore the evolving skill sets required for hybrid computing environments that integrate HPC, AI, and emerging quantum systems; strategies for interdisciplinary training that bridges data science, physics, and engineering; and frameworks for workforce development that ensure equitable access to these transformative tools. Panelists will share insights from active programs and collaborations, including CMU/PSC ByteBoost and the Argonne/University of Alaska Wildfire Analysis project, building the talent pipeline for AI-enabled science at trillion-parameter scale.

Igniting Alaska’s AI Future: Developing Alaska’s AI Talent Through Wildfire Analysis	Arghya Kusum Das (University of Alaska Fairbanks)
ByteBoost: Building the AI Workforce for Advanced Computing Systems	Paola A. Buitrago (Carnegie Mellon University \| Pittsburgh Supercomputing Center)

BOF: Who Builds the Future? Workforce Challenges in Trillion-Parameter Scientific Computing

Wednesday, June 3, 8:30

Lois Curfman McInnes, Argonne National Laboratory

Arfon Smith, Schmidt Sciences

Michelle Barker, Research Software Alliance

Anshu Dubey, RIKEN R-CCS

Daniel S. Katz, University of Illinois Urbana-Champaign

The computational science and research software engineering (RSE) communities are at a turning point as trillion-parameter-scale AI systems reshape code generation, simulation, and scientific workflows. Yet software ecosystems, team structures, expected roles, and collaboration practices were designed for a pre-AI era. Addressing this shift requires more than technical integration; it demands rethinking how we organize, incentivize, and sustain research software at scale. While progress has focused on models and infrastructure, critical workforce issues — including cross-disciplinary collaboration, evolving human and AI contributions, team dynamics, and incentive structures — remain underexplored. This BOF invites community input on these emerging challenges and opportunities.

Iterative Co-Design, Co-Development, and Co-Delivery: Accelerating S&T Productivity	Mary Ann Leung (Sustainable Horizons Institute)
Enabling Research Software Engineers to Leverage AI at APL: Policies, Tools, Training, and Use Cases	Jon Vandegriff (Johns Hopkins University)
Lesson Learned from Hosting a Frontier AI and LLM Tutorial Series for a Mixed Audience	Meifeng Lin (Brookhaven National Laboratory)
From Writing Software to Ensuring It's Written Well: The Evolving RSE Role	Arfon Smith (Schmidt Sciences)
How Fast Can We Evolve the Workforce?	Roscoe Giles (Boston University)
Designing Scientific Computing Ecosystems for the AI Era	Lois Curfman McInnes (Argonne National Laboratory)

Sessions

Plenary Sessions

Opening Plenary Session: TPC Vision

Plenary Session 2: Industry / Lab / Academia

Plenary Session 3: Frontier Models and Systems

Plenary Session 4: Workforce & Emerging Leaders

Lunch and Panel Discussion

Lunch and Panel Discussion

Plenary Session 5: TPC Collaborative Initiatives

Breakout Groups

Driving Challenge Applications (Challenge Applications)

AI for Material Sciences

GridAI Model Team

BOF: Bio-Foundation Models, Agentic systems, and Biosecurity

AI and Strategic Decision Support

AI Software Infrastructure/Frameworks (Software Stack)

BOF: AI Frameworks for Multimodal Data Access and Use

DWARF: Data Workflows, Agents Reasoning, and Frameworks

AI Software Infrastructure Frameworks

BOF: Large Language Models for Scientific Software

BOF: Trillion Parameter Models for the Edge and Computing Continuum

Infrastructure to Enable Shared Data & Computing (Shared Infrastructure)

BOF: Trustworthy Privacy-Preserved Federated Learning for Science

BOF: 50-State AI Plan: A Grassroots Approach to Building a US AI Continent

BOF: Self-Driving Labs for Accelerating Scientific Discovery at Scale

BOF: Bold New World of Heterogenous AI Computing

BOF: Energy-Efficient and Sustainable AI

Open Frontier AI Systems (AI Systems)

BOF: AI Agents as Scientific Collaborators: Building Human-Agent Research Teams

Toward Scientific AI Platforms: Inference, Agents, and AI Services at HPC Facilities

BOF: Human-AI Collaboration

Open Suite for Evaluating Model Skills, Knowledge, Reasoning, & Safety (Evaluation)

Evaluation of LLMs for Science

Agentic Reasoning with Scientific Foundation Models

Open Frontier Models (Open Models)

Model Architectures and Performance Evaluation

Training- and Deployment-Level Safety and Alignment (Safety & Alignment)

BOF: Safety and Alignment in Agentic Systems

Expanding and Deepening the AI Workforce (Workforce)

BOF: Building the AI-Ready Science and Engineering Workforce: Skills, Systems, and Collaboration Required For Success

BOF: Who Builds the Future? Workforce Challenges in Trillion-Parameter Scientific Computing

Countdown to

Keep Me Posted

Please email info@tpc26.org with any questions.

Quick Links

Agenda

Hackathons

Tutorials

What Is TPC?

Sponsor

Travel

Register