Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2024
QoS-Diff: Adaptive Auto-tuning Framework for Low-latency Diffusion Model Inference
MMAsia '24: Proceedings of the 6th ACM International Conference on Multimedia in AsiaArticle No.: 113, Pages 1–7https://doi.org/10.1145/3696409.3700277Diffusion models are pivotal for generating high-quality images, yet they encounter latency and throughput challenges in data center environments, particularly in meeting stringent service level objectives (SLOs). This paper introduces the Quality of ...
- short-paperDecember 2024
Using Isoefficiency as a Metric to Assess Disaggregated Memory Systems for High Performance Computing
MEMSYS '24: Proceedings of the International Symposium on Memory SystemsPages 192–197https://doi.org/10.1145/3695794.3695812Memory disaggregation is an approach to decouple compute and memory to minimize the total cost of ownership. However, analytical methods to study the impact of this approach are not readily available for high performance computing use cases. In this ...
- bookNovember 2012
Java Microarchitectures
Java is an exciting new object-oriented technology. Hardware for supporting objects and other features of Java such as multithreading, dynamic linking and loading is the focus of this book. The impact of Java's features on micro-architectural resources ...