Christopher Olston

2papers

2 Papers

DCDec 17, 2017Code
TensorFlow-Serving: Flexible, High-Performance ML Serving

Christopher Olston, Noah Fiedel, Kiril Gorovoy et al.

We describe TensorFlow-Serving, a system to serve machine learning models inside Google which is also available in the cloud and via open-source. It is extremely flexible in terms of the types of ML platforms it supports, and ways to integrate with systems that convey new models and updated versions from training to serving. At the same time, the core code paths around model lookup and inference have been carefully optimized to avoid performance pitfalls observed in naive implementations. Google uses it in many production deployments, including a multi-tenant model hosting service called TFS^2.

DBDec 23, 2020
Learned Indexes for a Google-scale Disk-based Database

Hussam Abu-Libdeh, Deniz Altınbüken, Alex Beutel et al.

There is great excitement about learned index structures, but understandable skepticism about the practicality of a new method uprooting decades of research on B-Trees. In this paper, we work to remove some of that uncertainty by demonstrating how a learned index can be integrated in a distributed, disk-based database system: Google's Bigtable. We detail several design decisions we made to integrate learned indexes in Bigtable. Our results show that integrating learned index significantly improves the end-to-end read latency and throughput for Bigtable.