ModernDataInfrastructureSummit
Build infrastructure that can handle AI, ML, and realtime workloads at scale
Join engineers and architects at MDI Summit to learn the architectures and technologies powering today's most demanding workloads.
Register nowAbout
ShapingtheFutureofData-DrivenInnovation
Who's it for?
If you're building, running, or modernizing large-scale data systems, MDI Summit is for you. AI, ML, and realtime workloads are putting unprecedented strain on data infrastructure. This event brings together the engineers, architects, and infrastructure leaders who are solving those challenges today.
What you'll learn
From real-time ML feature stores to globally distributed databases to LLMs serving millions, you'll hear how leading teams are designing scalable, resilient, and cost-efficient systems for the AI era. Walk away with strategies, tools, and connections to keep your infrastructure ahead of what's next.
Schedule
Talks
ChatGPT Ain't Got $%@& On Me! The Future of Automated Database Tuning
LLMs are coming for your query planner, but can they really out-tune decades of handcrafted algorithms? This talk dives into cutting-edge research on autonomous agents that tune PostgreSQL end-to-end, covering knobs, indexes, and query plans. Learn what works, what doesn't, and what's next on the road to self-driving databases.

Andy Pavlo
Associate Professor of Databaseology, Carnegie Mellon University
The Past Present and Future of In-Memory Datastores
In-memory datastores have been an essential component of the data infrastructure of the internet for the past 30 years. From static websites, to web 2.0, to applications powered by ML and AI, their capabilities, use-cases, and architectural designs are an evolving story that mirrors the growth of the internet itself. Join us on a trip through the past, present, and future of this foundational data technology.

Oded Poncz
Co-Founder & CEO, Dragonfly
From Consistent Hashing to… Elasticity!
Learn how ScyllaDB reimagined consistent hashing with tablets to unlock truly elastic, topology-agnostic scaling without painful rebalancing or overcommitted nodes.

Felipe Cardeneti Mendes
Technical Director, ScyllaDB
Observability Data Lake: Motivation and Methodology
Big data meets observability. This session explores how to build scalable, cost-effective observability infrastructure using open formats and protocols.

Ning Sun
CTO & Co-founder, Greptime

Ruihang Xia
SWE, Greptime
Deploying Agentic AI safely and scalably in the modern enterprise: What we can learn from two decades of streaming practice across the industry
Agentic AI raises the stakes. Learn how decades of streaming system best practices offer a stable foundation for deploying autonomous agents in production.

Tyler Akidau
CTO, Redpanda
Ducks on a Lake: Scaling Data Lakes to Warehouse Performance
Explore DuckLake, a new open table format from DuckDB's creators, designed to bring warehouse-grade performance and metadata control to data lakes.

Ryan Boyd
Co-founder, MotherDuck
Scaling Without Burn: Cloud Cost Efficiency at Billion-User Scale
What happens when you hit hypergrowth and your cloud bill hits back? Learn how ShareChat cut cloud spend from $150M to <$20M without slowing down, along with some hard lessons on culture, architecture, and building cost-efficient infra that can actually keep up.

Arya Ketan
Distinguished Engineer, ShareChat
FrequentlyAskedQuestions
Have more questions? Please reach out via our contact form
This premier event brings together industry leaders, technologists, and innovators to explore how modern architectures and advanced technologies are transforming the way we collect, store, and leverage data.
From cloud-native solutions to AI-driven analytics, gain the insights you need to build scalable, resilient, and forward-thinking data systems. Don't miss this opportunity to connect, learn, and lead in the data-driven era.