Adaptive Distributed Caching for Scalable Machine Learning Services