
Aleksandr Tokarev
Yandex
If you have a ticket, log in to watch the video
LoginApache Spark is an advanced framework for processing large amounts of unstructured data. One of its advantages is its ability to work with almost any distributed data storage system, as well as the ability to run with any cluster resource management system.
I will tell you how we integrated Apache Spark to work with the YTsaurus resource scheduler.

Yandex

Ozon