Skip to contentRU

If you have a ticket, log in to watch the video

Talk

Use Cases

Date: 04.09 / Start: 00:00 – Finish: 00:00

ML Inference Neural Network Services in Yandex Advertising

In RussianComplexity -

Presentation pdf

How to create efficient neural network inference services on the scale of tens of thousands of cores and hundreds of GPUs for a dozen of customers.

The talk is aimed at those who: are engaged in MLOps, ML Inference; are interested in how inference services look like in Yandex Advertising; have built large systems of services that are constrained by CPU and mem; like to develop their services in C++ and invest in efficiency and optimisations.

Speakers

Dmitrii Ulianin
Yandex

Invited experts

Egor Shestopalov
T-Bank

Other talks on «Use Cases»
- Watch recording
  dbt in Action: Real-life Case Studies and Lifehacks
  Antony Aleksandrov
  Detsky Mir
  Room 2In RussianComplexity -
- Watch recording
  From Manual Labour to Automatic Generation of Data Quality Checks
  Aleksandr Madumarov
  Innovation Centre "Bezopasnyj transport" GKU CODD
  Room 3In RussianComplexity -
- Watch recording
  How We Reduced the TTM of Dashboard Creation
  Anar Baghirov
  Avito
  Room 2In RussianComplexity -
- Watch recording
  Using Probabilistic Data Structures to Optimise ETL Processes
  Dmitrii Vertlib
  Chestnyj znak
  In RussianComplexity -
- Watch recording
  From Hype to Production: Data Mesh on Airflow + dbt
  Nikita Yurasov
  Toloka
  Leonid Kozhinov
  Toloka
  In RussianComplexity -
- Watch recording
  How to Build a RAG Pipeline Using LLamaIndex
  Alsu Nurutdinova
  Positive Technologies
  Alina Kocheva
  Positive Technologies
  In RussianComplexity -
- Watch recording
  Every Byte Is worth Its Weight in Gold. Experience of Building a DMP in Yandex Advertising
  Aleksei Stytsenko
  Yandex
  Room 1In RussianComplexity -
- Watch recording
  The State of Data, RU Edition
  Oleg Kochergin
  Positive Technologies
  Room 2In RussianComplexity -
- Watch recording
  How We Tested 5 Ways to Upload Data to Greenplum and What Was the Outcome
  Tatiana Didova
  AERO
  Room 2In RussianComplexity -
- Watch recording
  VKontakte Serialiser Optimisations
  Ilia Kokorin
  VK
  Ilia Asadullin
  VK
  Room 3In RussianComplexity -