Переглядів 20
ASPLOS'24: International Conference on Architectural Support for Programming Languages and Operating Systems
Lightning Talks - Session 2D: ML Inference Systems
Paper Title: Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling
Authors: Sohaib Ahmad and Hui Guan (University of Massachusetts Amherst); Brian D. Friedman and Thomas Williams (Nokia Bell Labs); Ramesh K. Sitaraman (University of Massachusetts Amherst); Thomas Woo (Nokia Bell Labs)