Cited By
View all- Ahmad SGuan HSitaraman RMencagli GDazzi PLowenthal DBadia R(2024)Loki: A System for Serving ML Inference Pipelines with Hardware and Accuracy ScalingProceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing10.1145/3625549.3658688(267-280)Online publication date: 3-Jun-2024
- Ahmad SGuan HFriedman BWilliams TSitaraman RWoo TTsafrir DMusuvathi MGupta RAbu-Ghazaleh N(2024)Proteus: A High-Throughput Inference-Serving System with Accuracy ScalingProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 110.1145/3617232.3624849(318-334)Online publication date: 27-Apr-2024
- Cao BSharma AO’Gorman LCoss MJain S(2024)A Lightweight Measure of Classification Difficulty from Application Dataset CharacteristicsPattern Recognition10.1007/978-3-031-78169-8_29(439-455)Online publication date: 30-Nov-2024
- Show More Cited By