Selected projects

Benchmarks for Billion-Scale Similarity Search
To encourage future developments of scalable similarity search algorithms, we release two billion-scale datasets that can serve as representative benchmarks for researchers from the machine learning and algorithmic communities interested in efficient similarity search.
April 26, 2021
A library to train large neural networks across the internet. Imagine training one huge transformer on thousands of computers from universities, companies, and volunteers.
September 1, 2020
Deep image prior
Image restoration with neural networks but without learning.
October 26, 2018
CatBoost is a fast, scalable, high performance gradient boosting on decision trees library. Used for ranking, classification, regression and other ML tasks.
October 17, 2018