Skip to contentRU

If you have a ticket, log in to watch the video

TalkDate: 14.09 / Start: 00:00 – Finish: 00:00

Fast data processing in Data Lake with Trino

Database Internals

In RussianComplexity -

Presentation pdf

The speaker will cover the implementation and practical use of key optimizations that allow Trino and related commercial products to quickly "grind" data from your lake: using Parquet and ORC metadata to reduce the amount of read-out data (project/filter/aggregate pushdown), dynamic filtering (runtime filtering), late materialization of columns (late materialization), and as many as three local caches: metadata cache, data cache and intermediate query results cache.

#trino
#cedrusdata
#optimization

Speakers

Vladimir Ozerov
Querify Labs

Invited experts

Artem Aliev

Other talks on «Database Internals»
- Watch recording
  Speed-up queries: How to Cook ClickHouse Well-done
  Kuzma Leshakov
  Yandex Cloud
  Room 1In RussianComplexity -
- Watch recording
  ACID Transactions in Apache Cassandra 5.0
  Aleksandr Volochnev
  Datastax
  In RussianComplexity -
- Watch recording
  How We Adapted Dynamic YTsaurus Tables to Store Blobs
  Maksim Babenko
  Yandex
  Room 1In RussianComplexity -
- Watch recording
  Compression, encryption and more: changing the behavior and guarantees of a distributed database
  Anton Vinogradov
  Apache Software Foundation
  Room 1In RussianComplexity -
- Watch recording
  Scheduling a Billion of Tasks per Day
  Ignat Kolesnichenko
  YTsaurus
  In RussianComplexity -
- Watch recording
  Moving Towards Universality: A Hybrid OLTP Database with OLAP Query Support
  Aleksei Dmitriev
  Yandex
  Room 2In RussianComplexity -
- Watch recording
  A distributed SQL query engine for data analytics
  Alexey Ozeritskii
  Yandex
  Room 2In RussianComplexity -
- Watch recording
  Predictive Analysis of Parasitic Load on GreenPlum Clusters
  Mark Lebedev
  GlowByte Consulting
  Pavel Ternyuk
  Data Sapience
  Room 2In RussianComplexity -
- Watch recording
  Application of TLA+ for Efficient Testing of Distributed Systems
  Nikita Siniachenko
  VK
  Evgenii Chernatskiy
  VK
  Room 3In RussianComplexity -
- Watch recording
  What it takes to achieve linearizability in a distributed system
  Sergey Petrenko
  Tarantool
  Room 3In RussianComplexity -
- Watch recording
  Deep Dive Into Query Performance
  Peter Zaitsev
  Percona
  In RussianComplexity -