YTsaurus: The open-source analysis platform for 42 million monthly active users in Yandex Go

Maxim Pchelin, product team lead for Yandex Go’s Data Management Platform (DMP), on how to use YTsaurus for managing massive data flows

Yandex Go, an integrated service within the broader Yandex ecosystem, poses a data management challenge as tough as you will likely encounter in any industry. It’s a single app that brings multiple essential services together. Users can request rides, rent scooters, and order food, groceries, and other items for delivery. From a data management perspective, this translates to 42 million MAU (monthly active users) to handle and to process. So how do we manage it?

I’m Maxim Pchelin. I led the Product team for Yandex Go’s Data Management Platform (DMP). I co-authored this article with Vladimir Verstov, the Head of DMP development at Yandex Go. Our goal is to illustrate our process for managing massive data flows and serve as a guide for those looking to do the same. We’ll discuss our data architecture and tech stack and introduce you to YTsaurus — the data management core that made this impossible task possible for Yandex Go. We’ll explore how we used YTsaurus, what advantages it brought to our work, and how it can benefit yours, regardless of the scale of your operation.

Read more on Medium.

Sign in to save this post