Aleksandr Tokarev
Company: Yandex
Apache Spark is an advanced framework for processing large amounts of unstructured data. One of its advantages is its ability to work with almost any distributed data storage system, as well as the ability to run with any cluster resource management system.
I will tell you how we integrated Apache Spark to work with the YTsaurus resource scheduler.
Company: Yandex
Company: Ozon