Presto and Apache Iceberg - Building out Modern Open Data Lakes

  Переглядів 6,052

Presto Foundation

Presto Foundation

День тому

Apache Iceberg is an open table format for huge analytic datasets. Many companies like Twitter use it widely to improve performance of interactive querying on data lakes. At Twitter, engineers built the integrations between Presto and Iceberg to bring high-performance and efficiency of Iceberg to the Presto ecosystem. During this session, Daniel will present an introduction to Apache Iceberg and Chunxu will discuss the Presto - Iceberg integration and share what they’ve learned during the development and usage of these next gen projects.
Speakers:
Chunxu Tang, Sr. Software Engineer at Twitter
Chunxu is a software engineer in Twitter's Interactive Query team where he works on developing and maintaining Presto and Druid services. He received his doctoral degree from Syracuse University, where he did research on machine learning and distributed collaboration systems.
Daniel Weeks, Co-Founder, Tabular
Daniel Weeks is the Co-creator of Apache Iceberg and the Cofounder of Tabular. He led the Big Data Compute team at Netflix, which focuses on building out big data processing engines like Spark, Presto, Druid, etc., in the cloud. He has spent the last 18+ years designing and developing large scale distributed systems with a focus on data processing and open source technologies.

КОМЕНТАРІ
Why You Shouldn’t Care About Iceberg | Tabular
20:26
Data Council
Переглядів 11 тис.
AWS re:Invent 2021 - Building a data lake on Amazon S3
54:52
AWS Events
Переглядів 30 тис.
What is Apache Iceberg?
12:54
IBM Technology
Переглядів 10 тис.
Tabular at Trino Fest - CDC patterns in Apache Iceberg
31:06
Trino
Переглядів 2,5 тис.
Presto 101: An Introduction to Open Source Presto
20:38
Databricks
Переглядів 8 тис.
Is THIS the Best Modern Data Format?
5:53
nullQueries
Переглядів 4 тис.
Apache Iceberg on AWS with S3 and Athena [FULL COURSE IN 30MIN]
28:04
Johnny Chivers
Переглядів 16 тис.