Talk type: Talk

Writing Your Own Cluster Manager for Apache Spark

  • Talk in Russian

Apache Spark is an advanced framework for processing large amounts of unstructured data. One of its advantages is its ability to work with almost any distributed data storage system, as well as the ability to run with any cluster resource management system.

I will tell you how we integrated Apache Spark to work with the YTsaurus resource scheduler.