Buy a ticket Schedule

About the conference

Data Engineering conference SmartData 2021 will take place in October 11-14.

Streaming

  • Flink
  • Spark
  • Kafka
  • Beam
  • Pulsar

DBMS and big data storage

Using classic relational, columnar, NoSQL, SMP/MPP storages to build DWH:

  • Hive, Impala, Presto, Vertica, ClickHouse, Cassandra
  • Teradata, Redshift, GreenPlum, Exadata
  • MSSQL, PostgreSQL
  • MongoDB, DynamoDB
  • S3, ADLS, GCS, HDFS

DWH architecture

  • Data modeling
  • Examples of building corporate data warehouses
  • Operational analytics
  • Ad-hoc reporting
  • Hadoop
  • Iceberg, DeltaLake

Data governance

  • Data security
  • Data quality
  • Metadata and catalog management
  • Master data management
  • Data migration

ETL building technologies

  • Spark
  • Hadoop MapReduce
  • Sqoop
  • NiFi
  • Performance analysis and optimization

Orchestration and MLOps

  • Airflow, Luigi, Oozie
  • MLflow
  • Dagster

Other

  • Data engineering not for data engineers
  • CI/CD for data pipelines
  • Testing

Cloud solutions

  • Snowflake
  • Databricks
  • AWS, GCP, Azure

So, if you are interested in data engineering, if you want to be the first one to learn about the emerging technologies — join us!

Conference features

  • 4K video
  • Livestream and recordings
  • We stand by what we do. If you are not satisfied with your experience, we'll give you your money back
  • Contests and talks from the partners
  • Online discussion zones

Speakers

Jacek Laskowski
Jacek Laskowski
Jacek is an IT freelancer specializing in Apache Spark, Delta Lake, Apache Kafka and Kafka Streams...

Jacek is an IT freelancer specializing in Apache Spark, Delta Lake, Apache Kafka and Kafka Streams (with brief forays into a wider data engineering space, e.g. Presto). Jacek offers software development and consultancy services with very hands-on in-depth workshops and mentoring. He is best known by his online books available free of charge at https://books.japila.pl/.

Andy Pavlo
Andy Pavlo Carnegie Mellon University
Andy Pavlo is an Associate Professor of Databaseology in the Computer Science Department at Carnegie Mellon...

Andy Pavlo is an Associate Professor of Databaseology in the Computer Science Department at Carnegie Mellon University. He is also the co-founder of OtterTune.

Ash Berlin-Taylor
Ash Berlin-Taylor Astronomer.io
Ash has been a contributor to Airflow for almost four years and is a member of...

Ash has been a contributor to Airflow for almost four years and is a member of the Project Management Committee (a.k.a. the Core team) for almost as long. He was the Release Manager for much of the 1.10 release series and he also re-wrote much of the Scheduler internals to be highly-available and increase performance by an order of magnitude (AIP-15).

Outside of Airflow he is the Director of Airflow Engineering at Astronomer.io where he runs the team of developers contribute to the open source Airflow project.

Aleksandr Volochnev
Aleksandr Volochnev Datastax
After many years in Software Development as a developer, technical lead, DevOps engineer, and architect, Aleks...

After many years in Software Development as a developer, technical lead, DevOps engineer, and architect, Aleks focused on cloud computing and distributed systems. Professional Cloud Architect and Developer Advocate, he shares his knowledge and expertise in the field of high-performant and disaster tolerant systems.

Sabir Akhadov
Sabir Akhadov Databricks Inc
Sabir is a software engineer at Databricks working on optimizing physical data layouts for the best...
Sabir Akhadov

Sabir is a software engineer at Databricks working on optimizing physical data layouts for the best performance. Before that, he worked in Databricks performance engineering and benchmarking team.

Sabir was born in Kazakhstan and since then has lived in 4 different countries. He's interested in learning new languages, technologies, and sports, mostly powerlifting and Russian kettlebells.

Tejas Chopra
Tejas Chopra Netflix
Tejas Chopra is a Senior Software Engineer, working in the Data Storage Platform team at Netflix,...
Tejas Chopra

Tejas Chopra is a Senior Software Engineer, working in the Data Storage Platform team at Netflix, where he is responsible for architecting storage solutions to support Netflix Studios and Netflix Streaming Platform. Tejas has worked on distributed file systems & backend architectures, both in on-premise and cloud environments as part of several startups in his career. Tejas is an International Keynote Speaker and periodically conducts seminars on Micro services, NFTs, Software Development & Cloud Computing and has a Masters Degree in Electrical & Computer Engineering from Carnegie Mellon University, with a specialization in Computer Systems.

Vladimir Ozerov
Vladimir Ozerov Querify Labs
Vladimir Ozerov is the founder of Querify Labs, where he manages the research and development of...

Vladimir Ozerov is the founder of Querify Labs, where he manages the research and development of innovative data management products for technology companies. Before that, Vladimir worked on in-memory data platforms Apache Ignite and Hazelcast for more than eight years, focusing on distributed data processing. Vladimir is a committer to Apache Calcite and Apache Ignite projects.

Dmitry Bugaychenko
Dmitry Bugaychenko Sber
Graduated from St. Petersburg State University in 2004, got a PhD degree in the field of...
Dmitry Bugaychenko

Graduated from St. Petersburg State University in 2004, got a PhD degree in the field of the formal logical methods in 2007. Spent almost 9 years in outsourcing without losing contact with the university and research community. Big data analysis at Odnoklassniki became for Dmitry an unique chance to combine theoretical knowledge and scientific foundation with the development of real and popular products. And this chance he gladly took advantage of by coming there in 2011. Joined Sberbank team in 2019.

Nikolay Golov
Nikolay Golov ManyChat
Nikolay is the Head of Data Engineering of ManyChat (SaaS startup), responsible for the implementation and...
Nikolay Golov

Nikolay is the Head of Data Engineering of ManyChat (SaaS startup), responsible for the implementation and growth of its Data Platform (AWS+Redis+Snowflake+Tableau). Previously, from 2013 till 2019 he's headed the Data Platform of Avito, Craigslist of Russia, which grew to a multi-billion dollar company from a small startup. In Avito he was responsible for analytical databases (Vertica, ClickHouse), OLTP engines (PostgreSQL, Redis, MongoDB), and data buses (Kafka) for analytics and micro-services integration. In parallel with those jobs, Nikolay is a researcher of Higher School Economics in Moscow, Russia, having few international publications about data warehousing (Anchor Modeling) and aspects of big data processing.

Partners

We would not be able to hold SmartData on a regular basis without the tremendous support of our partners. Our conference is growing and evolving thanks to their efforts.

Platinum partner

Gold partners

Silver partners

Information partners

If you want to become a partner of our conference, please contact us via email: partners@cppconf.ru.