Submit a talk Buy a ticket

About the conference

Data Engineering conference SmartData 2021 will take place in October 11-14.

Streaming

  • Flink
  • Spark
  • Kafka
  • Beam
  • Pulsar

DBMS and big data storage

Using classic relational, columnar, NoSQL, SMP/MPP storages to build DWH:

  • Hive, Impala, Presto, Vertica, ClickHouse, Cassandra
  • Teradata, Redshift, GreenPlum, Exadata
  • MSSQL, PostgreSQL
  • MongoDB, DynamoDB
  • S3, ADLS, GCS, HDFS

DWH architecture

  • Data modeling
  • Examples of building corporate data warehouses
  • Operational analytics
  • Ad-hoc reporting
  • Hadoop
  • Iceberg, DeltaLake

Data governance

  • Data security
  • Data quality
  • Metadata and catalog management
  • Master data management
  • Data migration

ETL building technologies

  • Spark
  • Hadoop MapReduce
  • Sqoop
  • NiFi
  • Performance analysis and optimization

Orchestration and MLOps

  • Airflow, Luigi, Oozie
  • MLflow
  • Dagster

Other

  • Data engineering not for data engineers
  • CI/CD for data pipelines
  • Testing

Cloud solutions

  • Snowflake
  • Databricks
  • AWS, GCP, Azure

So, if you are interested in data engineering, if you want to be the first one to learn about the emerging technologies — join us!

Conference features

  • Virtual platform and networking
  • 4K video
  • Livestream and recordings
  • We stand by what we do. If you are not satisfied with your experience, we'll give you your money back
  • Contests and talks from the partners
  • Online discussion zones

Speakers

Andy Pavlo
Andy Pavlo Carnegie Mellon University
Associate Professor of Databaseology in the Computer Science Department at Carnegie Mellon University. His research interest...
Andy Pavlo

Associate Professor of Databaseology in the Computer Science Department at Carnegie Mellon University. His research interest is in database management systems, specifically main memory systems, self-driving/autonomous architectures, transaction processing systems, and large-scale data analytics. At CMU, Andy is a member of the Database Group and the Parallel Data Laboratory. He's the co-founder and CEO of OtterTune.

Evgeny Ermakov
Evgeny Ermakov Yandex Go
More than 10 years of experience in IT. Architect of data warehouses and analysis systems at...

More than 10 years of experience in IT. Architect of data warehouses and analysis systems at Mail.ru Group and Yandex.Go. Candidate of Technical Sciences, author of more than 10 papers in data analysis, co-author of a monograph on the theory and practice of parallel database analysis.

Nikolay Grebenshchikov
Nikolay Grebenshchikov Yandex.Go
Over 15 years of experience in the IT field. For the last 1.5 years, Nikolay has...
Nikolay Grebenshchikov

Over 15 years of experience in the IT field. For the last 1.5 years, Nikolay has been developing data storage at Yandex.Go. Specializes in MPP Greenplum DBMS.

Kirill Rybachuk
Kirill Rybachuk Cherry Labs
8 years in the machine learning industry, 4 years in developing computer vision systems at Cherry...
Kirill Rybachuk

8 years in the machine learning industry, 4 years in developing computer vision systems at Cherry Labs. Interested in building ML pipelines, optimizing models, making stuff automated and flexible, for the needs of both production and research.

Nikolay Golov
Nikolay Golov ManyChat
Nikolay is the Head of Data Engineering of ManyChat (SaaS startup), responsible for the implementation and...
Nikolay Golov

Nikolay is the Head of Data Engineering of ManyChat (SaaS startup), responsible for the implementation and growth of its Data Platform (AWS+Redis+Snowflake+Tableau). Previously, from 2013 till 2019 he's headed the Data Platform of Avito, Craigslist of Russia, which grew to a multi-billion dollar company from a small startup. In Avito he was responsible for analytical databases (Vertica, ClickHouse), OLTP engines (PostgreSQL, Redis, MongoDB), and data buses (Kafka) for analytics and micro-services integration. In parallel with those jobs, Nikolay is a researcher of Higher School Economics in Moscow, Russia, having few international publications about data warehousing (Anchor Modeling) and aspects of big data processing.

Partners

We would not be able to hold SmartData on a regular basis without the tremendous support of our partners. Our conference is growing and evolving thanks to their efforts.

Information partners

If you want to become a partner of our conference, please contact us via email: partners@cppconf.ru.