Schedule

The program hasn't been finally approved yet, so there still might be some changes.

Click on the title to read the talk description. All talks with their short descriptions are here.

Scroll to top

Day 1. December 09

Time UTC+03:00  & Track
Lecture
Track 1
Track 2
Track 3
Track 4

No talks in Favorites yet

17:00 - 17:15

Conference opening

17:15 - 18:30
17:15 Track 1
17:15 Track 2
17:15 Track 3
17:15 Track 4
Highly Normalized Hybrid Model, or how we implemented the storage model
Evgeny Ermakov
Yandex.Taxi
Nikolay Grebenshchikov
Yandex.Taxi
#dwh  #anchor  #datavault  #architecture  #datamodeling 
18:30 - 19:00

Break

19:00 - 20:15
19:00 Track 1
19:00 Track 2
19:00 Track 3
19:00 Track 4
20:15 - 20:30

Conference closing

Day 2. December 10

Time UTC+03:00  & Track
Lecture
Track 1
Track 2
Track 3
Track 4

No talks in Favorites yet

10:30 - 10:45

Conference opening

10:45 - 12:00
10:45 Track 1
10:45 Track 2
10:45 Track 3
NeoFS: Storing object data according to your rules
Stanislav Bogatyrev
NEO Saint Petersburg Competence Center
#objectstorage  #datastorage 
10:45 Track 4
Segmentation: A single window of knowledge about a user
Olga Makarova
ivi
Maria Nosareva
ivi
#streaming  #architecture  #tooling  #analytics 
12:00 - 12:30

Break

12:30 - 13:45
12:30 Track 1
Working with data at a low level
Nikolay Markov
Aligned Research Group
#python  #blas  #optimization 
12:30 Track 2
Digitizing a worker in real-time
Alexey Konyaev
CROC
#stream_analytics  #architecture  #iot 
12:30 Track 3
CI/CD for ML models and datasets
Mikhail Maryfich
Mail.Ru Group
#mlops  #orchestration  #tooling 
12:30 Track 4
13:45 - 14:00

Conference closing

Day 3. December 11

Time UTC+03:00  & Track
Lecture
Track 1
Track 2
Track 3
Track 4

No talks in Favorites yet

17:00 - 17:15

Conference opening

17:15 - 18:30
17:15 Track 1
Advanced usage patterns of Scala UDF in PySpark
Andrey Titov
NVIDIA
#python  #optimization  #spark  #scala 
17:15 Track 2
17:15 Track 3
17:15 Track 4
18:30 - 19:00

Break

19:00 - 20:15
17:15 Track 1
Advanced usage patterns of Scala UDF in PySpark
Andrey Titov
NVIDIA
#python  #optimization  #spark  #scala 
19:00 Track 2
19:00 Track 3
20:15 - 20:30

Conference closing

Day 4. December 12

Time UTC+03:00  & Track
Lecture
Track 1
Track 2
Track 3
Track 4

No talks in Favorites yet

10:30 - 10:45

Conference opening

10:45 - 12:00
10:45 Track 2
12:00 - 12:30

Break

12:30 - 13:45
12:30 Track 1
Our repository for web analytics
Artur Hachuyan
Tazeros
#architecture  #analytics 
12:30 Track 2
13:45 - 14:00

Conference closing

Maksim Statsenko

"If artificial intelligence is our future, then big data is the coal of the locomotive that will bring us into it".

Maksim is working with data for 10 years. He has been building ETL Pipelines, Data Storages, analyzing Data, and working on Visualisation in government companies (RCOI), energy companies (MOEK, GAZPROM), Banks (BRC, VTB24), and IT-companies (Yandex, Mail.Ru). Big Data is his wife and mistress. He's always ready to talk about it.

Stanislav Bogatyrev

Co-founder and CIO of NEO Saint Petersburg Competence Center, where he's a lead of NeoFS development.

Before that, for over 15 years he worked in infrastructure and storage systems at Samsung Research, Clodo.ru, and DellEMC.

Nikolay Averin

For the last 3 years, Nikolay has been working for Miro. Migrates the service's data from Redis to PostgreSQL. Implements the multi-tenant storage architecture at the application layer. Works on scaling and fault tolerance of databases. Half a backend engineer, half a DBA.

Nikolay Grebenshchikov

TBD

Pavel Yakunin

Lead developer and the team lead for the big data team at Deutsche Bank Investment division.

Pavel joined Deutsche in 2014. Before that, he managed to get a Ph.D. in quantum optics, spent some time as a developer in a small hedge fund, and then in Yandex. Pavel is building Big Data in DB with the team for almost four years now and would be happy to share his experience.

Evgeny Legky

Evgeny is a founder and CEO at Retable — a powerful cloud platform for visually exploring, cleaning and preparing data for Data Scientists and Data Engineers.

Also, Evgeny is a strategy consultant for high-growth Silicon Valley startups to help them build scalable data stacks and data-oriented products.

Till that has been a founder and CEO at Segmento, one of the biggest RTB startups in Russia (acquired by Sberbank), and a co-founder at Hintlab AI laboratory. Worked as a software developer at JetBrains and LG.

Evgeny Rizhik

Evgeny has been with Microsoft since 2006 and is currently a Partner Software Engineer. 6 years ago Evgeny's team started a new innovative project in the field of analytics over big data, where he was one of the founders and lead developers. The project has grown into a large cloud service Azure Data Explorer (Kusto).

Roman Korobeynikov

All life goes with the flow. First, it was the Faculty of Mechanics and Mathematics of Rostov University, then the IT industry. Roman worked in banks, integrators, game dev, and startups first as a developer, then a CTO. Currently Roman is a Java architect at VirtualHealth.

Dmitry Bugaychenko

Graduated from St. Petersburg State University in 2004, got a PhD degree in the field of the formal logical methods in 2007. Spent almost 9 years in outsourcing without losing contact with the university and research community. Big data analysis at Odnoklassniki became for Dmitry an unique chance to combine theoretical knowledge and scientific foundation with the development of real and popular products. And this chance he gladly took advantage of by coming there in 2011. Joined Sberbank team in 2019.

Artur Hachuyan

TBD

An infrastructure-dealing engineer with almost 10 years of software development using various programming languages and platforms. About 8 years of Python programming experience as well as ~3 years of using Go, good knowledge of web technologies.

Teaching, mentoring, writing and translating articles on Python, Linux, Big Data, clouds, networking and algorithms. Expertise includes distributed and high-loaded systems, networking, algorithms, concurrency/parallelism, capacity planning and basic statistical data analysis. DevOps and CI/CD enthusiast.

Andrey Titov

Data Engineering, Spark Structured Streaming, Python, Scala.

Aleksandr Ermakov

TBD

Alexey Konyaev

Lead developer at CROC. Over 15 years of experience in IT. While working at CROK, Alexey took part in a project for the Federal State Statistics Service to automate the processing of statistical data from all over Russia. Now he's developing a system based on Indoor navigation technologies using Apache Kafka, Clickhouse, and Esper (complex event processing system).

Maria Nosareva

Big Data developer at ivi.

Andrey Zhukov

TBD

Olga Makarova

Product analyst at ivi and Yandex. ivi Big Data team manager.

Neville is a software engineer at Spotify who works mainly on data infrastructure for machine learning and advanced analytics. In the past few years he has been driving the adoption of Scala and new data tools for music recommendation, including Scalding, Spark, Storm and Parquet. Before that he worked on search quality at Yahoo! and old school distributed systems like MPI.

Vladislav Shishkov

TBD

Andrey Kuznetsov

7 years of full-time IT teaching, Information security and math stats. Ph.D. Currently works on developing recommender systems at ok.ru.

Valentin Azanov

TBD

Machine Learning Engineer at Mail.Ru Group, specializes in Deep Learning. Mikhail has been engaged in machine learning for over 4 years, solves E2E problems, from problem formulation to rolling out into industrial operation and further system support. In his professional career, he values the reproducibility of results and good development processes above all.

Bronislav Zhitnikov

Bronislav has been working in IT for over 17 years and near 15 years in Software development. For the last 2 years, he has been developing the project "Raw data to DataWarehouse" in a Tinkoff as an Architect and a product owner. Creates an inner product base on Apache NiFi. Bronislav is also an administrator and active member of the NiFi Users community in Russia (@nifiusers).

TBD

Aleksandr Sloutsky

Alexander Sloutsky is the Principal Engineering Manager of Microsoft's leading Big Data technology code-named Kusto. He is one of the four founders of this innovative project that has been powering Microsoft's digital transformation during past years. Before starting his work on Kusto, Alexander and his team were working on SQL Server Analysis Services - moving the technology into the Azure cloud.

TBD

Jacek Laskowski is a Seasoned IT Professional specializing in Apache Spark, Delta Lake, Apache Kafka and Kafka Streams (with brief forays into a wider data engineering space). Jacek offers software development and consultancy services with very hands-on in-depth workshops and mentoring. He is best known by his online books available free of charge at https://books.japila.pl/.