2026 Workshop: Database to Discovery in CEDAR
Katherine Cariglia
CEDAR science increasingly depends on combining data and metadata across multiple repositories, instruments, and communities, but researchers and database teams still face recurring barriers in discovery, access, interpretation, interoperability, and reproducible analysis. This session will bring together database administrators, cyberinfrastructure developers, and research users from Madrigal, SuperDARN, SuperMAG, OMNIWeb/CDAWeb, GIRO, AMISR, and related resources to discuss practical advances in geospace data infrastructure. Topics will include interoperable APIs and metadata (for example HAPI and SPASE aligned workflows), AI assisted data navigation and interpretation, community repositories of analysis tools and notebooks, scientific visualization, and strategies to assess how database infrastructures improve research productivity and usability for the CEDAR community.
10:00-10:05 Introduction
10:05-10:15 Pablo Reyes The AMISR database and REproducible Software Environment
10:15-10:25 Alex Chartier Ensuring data access, interoperability and reproducibility for dynamic NSF research facilities
10:25-10:35 Giorgio Picanco GIRO/DIDBase: Current Capabilities, Challenges, and Future Directions for Data Products and Ionospheric Services
10:35-10:45 James Lewis From Metadata to Meaning: Lessons from Building Cross-Mission Analysis Tools
10:45-10:55 Buffer
10:55-11:05 Jesper Gjerloev The physics view: Extracting relevant information from large repositories to understand processes.
11:05-11:15 Katherine Cariglia CEDAR Madrigal Database Updates 2026
11:15-11:25 Jennifer Knuth Data Access in Heliophysics: HAPI and LaTiS from a User’s Perspective
11:25-11:35 Enrique Rojas From Data Discovery to Idea Discovery: Papers, Workflows, and Agents as a New Layer for Madrigal
11:35-12:00 Open discussion
This session is motivated by a shared CEDAR need: frontier atmosphere-ionosphere-magnetosphere studies now require multi-source data synthesis, but the effort required to discover, access, align, interpret, and reuse data remains high and uneven across repositories. The proposed session is designed to be useful for both database teams and research scientists. For database administrators, it provides a venue to compare implementation strategies for metadata, APIs, provenance, discoverability, and user support. For researchers, it focuses on practical workflows that reduce time to science, improve reproducibility, and make cross-database studies easier to execute. We specifically want to invite perspectives from various database teams so the discussion is centered on common design patterns, pain points, and opportunities for coordinated progress across CEDAR and neighboring heliophysics data ecosystems. A useful outcome would be a short community roadmap identifying near-term pilot collaborations, shared priorities for metadata/access improvements, and candidate evaluation metrics for user-facing tools.