Sept. 18 2025

San Francisco

ModernDataInfrastructureSummit

Build infrastructure that can handle AI, ML, and realtime workloads at scale

Join engineers and architects at MDI Summit to learn the architectures and technologies powering today's most demanding workloads.

About

ShapingtheFutureofData-DrivenInnovation

Who's it for?

If you're building, running, or modernizing large-scale data systems, MDI Summit is for you. AI, ML, and realtime workloads are putting unprecedented strain on data infrastructure. This event brings together the engineers, architects, and infrastructure leaders who are solving those challenges today.

What you'll learn

From real-time ML feature stores to globally distributed databases to LLMs serving millions, you'll hear how leading teams are designing scalable, resilient, and cost-efficient systems for the AI era. Walk away with strategies, tools, and connections to keep your infrastructure ahead of what's next.

Schedule

9:30 - 10:00

Check-in & Breakfast

10:00 - 10:15

Opening Remarks

10:15 - 13:10

Morning Sessions

Lunch

14:15 - 16:30

Afternoon Sessions

16:30 - 18:30

Happy Hour

Talks

ChatGPT Ain't Got $%@& On Me! The Future of Automated Database Tuning

LLMs are coming for your query planner, but can they really out-tune decades of handcrafted algorithms? This talk dives into cutting-edge research on autonomous agents that tune PostgreSQL end-to-end, covering knobs, indexes, and query plans. Learn what works, what doesn't, and what's next on the road to self-driving databases.

Andy Pavlo

Associate Professor of Databaseology, Carnegie Mellon University

The Past Present and Future of In-Memory Datastores

In-memory datastores have been an essential component of the data infrastructure of the internet for the past 30 years. From static websites, to web 2.0, to applications powered by ML and AI, their capabilities, use-cases, and architectural designs are an evolving story that mirrors the growth of the internet itself. Join us on a trip through the past, present, and future of this foundational data technology.

Oded Poncz

Co-Founder & CEO, Dragonfly

From Consistent Hashing to… Elasticity!

Learn how ScyllaDB reimagined consistent hashing with tablets to unlock truly elastic, topology-agnostic scaling without painful rebalancing or overcommitted nodes.

Felipe Cardeneti Mendes

Technical Director, ScyllaDB

Observability Data Lake: Motivation and Methodology

Big data meets observability. This session explores how to build scalable, cost-effective observability infrastructure using open formats and protocols.

Ning Sun

CTO & Co-founder, Greptime

Ruihang Xia

SWE, Greptime

Deploying Agentic AI safely and scalably in the modern enterprise: What we can learn from two decades of streaming practice across the industry

Agentic AI raises the stakes. Learn how decades of streaming system best practices offer a stable foundation for deploying autonomous agents in production.

Tyler Akidau

CTO, Redpanda

Ducks on a Lake: Scaling Data Lakes to Warehouse Performance

Explore DuckLake, a new open table format from DuckDB's creators, designed to bring warehouse-grade performance and metadata control to data lakes.

Ryan Boyd

Co-founder, MotherDuck

Scaling Without Burn: Cloud Cost Efficiency at Billion-User Scale

What happens when you hit hypergrowth and your cloud bill hits back? Learn how ShareChat cut cloud spend from $150M to <$20M without slowing down, along with some hard lessons on culture, architecture, and building cost-efficient infra that can actually keep up.

Arya Ketan

Distinguished Engineer, ShareChat

FrequentlyAskedQuestions

Have more questions? Please reach out via our contact form

Sept. 18 2025

San Francisco

Reserveyourticketstothesummittoday.

Get tickets

This premier event brings together industry leaders, technologists, and innovators to explore how modern architectures and advanced technologies are transforming the way we collect, store, and leverage data.

From cloud-native solutions to AI-driven analytics, gain the insights you need to build scalable, resilient, and forward-thinking data systems. Don't miss this opportunity to connect, learn, and lead in the data-driven era.