Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJanuary 2025
Prompt sketching for large language models
ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 147, Pages 3674–3706Many recent prompting strategies for large language models (LLMs) query the model multiple times sequentially - first to produce intermediate results and then the final answer. However, using these methods, both decoder and model are unaware of potential ...
- research-articleJanuary 2025
Guiding LLMs the right way: fast, non-invasive constrained generation
ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 146, Pages 3658–3673To ensure that text generated by large language models (LLMs) is in an expected format, constrained decoding proposes to enforce strict formal language constraints during generation. However, as we show in this work, not only do such methods incur ...
- research-articleMay 2024
Connecting certified and adversarial training
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 3211, Pages 73422–73440Training certifiably robust neural networks remains a notoriously hard problem. While adversarial training optimizes under-approximations of the worst-case loss, which leads to insufficient regularization for certification, sound certified training ...
- research-articleMay 2024
Automated classification of model errors on ImageNet
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 1603, Pages 36826–36885While the ImageNet dataset has been driving computer vision research over the past decade, significant label noise and ambiguity have made top-1 accuracy an insufficient measure of further progress. To address this, new label-sets and evaluation ...
- research-articleNovember 2023
The Graph Database Interface: Scaling Online Transactional and Analytical Graph Workloads to Hundreds of Thousands of Cores
- Maciej Besta,
- Robert Gerstenberger,
- Marc Fischer,
- Michal Podstawski,
- Nils Blach,
- Berke Egeli,
- Georgy Mitenkov,
- Wojciech Chlapek,
- Marek Michalewicz,
- Hubert Niewiadomski,
- Juergen Mueller,
- Torsten Hoefler
SC '23: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisArticle No.: 22, Pages 1–18https://doi.org/10.1145/3581784.3607068Graph databases (GDBs) are crucial in academic and industry applications. The key challenges in developing GDBs are achieving high performance, scalability, programmability, and portability. To tackle these challenges, we harness established practices ...
-
- research-articleOctober 2023
The Impact of Occlusion on Depth Perception at Arm's Length
IEEE Transactions on Visualization and Computer Graphics (ITVC), Volume 29, Issue 11Pages 4494–4502https://doi.org/10.1109/TVCG.2023.3320239This paper investigates the accuracy of Augmented Reality (AR) technologies, particularly commercially available optical see-through displays, in depicting virtual content inside the human body for surgical planning. Their inherent limitations result in ...
- surveySeptember 2023
Demystifying Graph Databases: Analysis and Taxonomy of Data Organization, System Designs, and Graph Queries
- Maciej Besta,
- Robert Gerstenberger,
- Emanuel Peter,
- Marc Fischer,
- Michał Podstawski,
- Claude Barthels,
- Gustavo Alonso,
- Torsten Hoefler
ACM Computing Surveys (CSUR), Volume 56, Issue 2Article No.: 31, Pages 1–40https://doi.org/10.1145/3604932Numerous irregular graph datasets, for example social networks or web graphs, may contain even trillions of edges. Often, their structure changes over time and they have domain-specific rich data associated with vertices and edges. Graph database systems ...
Prompting Is Programming: A Query Language for Large Language Models
Proceedings of the ACM on Programming Languages (PACMPL), Volume 7, Issue PLDIArticle No.: 186, Pages 1946–1969https://doi.org/10.1145/3591300Large language models have demonstrated outstanding performance on a wide range of tasks such as question answering and code generation. On a high level, given an input, a language model can be used to automatically complete the sequence in a ...
Abstract Interpretation of Fixpoint Iterators with Applications to Neural Networks
Proceedings of the ACM on Programming Languages (PACMPL), Volume 7, Issue PLDIArticle No.: 138, Pages 786–810https://doi.org/10.1145/3591252We present a new abstract interpretation framework for the precise over-approximation of numerical fixpoint iterators. Our key observation is that unlike in standard abstract interpretation (AI), typically used to over-approximate all reachable ...
- research-articleJune 2023
Practice of Streaming Processing of Dynamic Graphs: Concepts, Models, and Systems
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 34, Issue 6Pages 1860–1876https://doi.org/10.1109/TPDS.2021.3131677Graph processing has become an important part of various areas of computing, including machine learning, medical applications, social network analysis, computational sciences, and others. A growing amount of the associated graph processing workloads are <...
- research-articleApril 2024
(De-)Randomized smoothing for decision stump ensembles
NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing SystemsArticle No.: 222, Pages 3066–3081Tree-based models are used in many high-stakes application domains such as finance and medicine, where robustness and interpretability are of utmost importance. Yet, methods for improving and certifying their robustness are severely under-explored, in ...
- research-articleNovember 2022
Private and Reliable Neural Network Inference
CCS '22: Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications SecurityPages 1663–1677https://doi.org/10.1145/3548606.3560709Reliable neural networks (NNs) provide important inference-time reliability guarantees such as fairness and robustness. Complementarily, privacy-preserving NN inference protects the privacy of client data. So far these two emerging areas have been ...
- ArticleAugust 2022
Shared Certificates for Neural Network Verification
AbstractExisting neural network verifiers compute a proof that each input is handled correctly under a given perturbation by propagating a symbolic abstraction of reachable values at each layer. This process is repeated from scratch independently for each ...
- ArticleJune 2022
Comprehensive Analysis of Software-Based Fault Tolerance with Arithmetic Coding for Performant Encoding of Integer Calculations
AbstractSafety-critical systems are becoming more complex with use cases like autonomous driving or human-robot collaboration. Therefore, the performance impact of software-based fault-tolerance methods is challenging. Using software-based fault tolerance ...
- research-articleSeptember 2021
Reinforcement learning methods based on GPU accelerated industrial control hardware
Neural Computing and Applications (NCAA), Volume 33, Issue 18Pages 12191–12207https://doi.org/10.1007/s00521-021-05848-4AbstractReinforcement learning is a promising approach for manufacturing processes. Process knowledge can be gained automatically, and autonomous tuning of control is possible. However, the use of reinforcement learning in a production environment imposes ...
- research-articleDecember 2020
Certified defense to image transformations via randomized smoothing
NIPS '20: Proceedings of the 34th International Conference on Neural Information Processing SystemsArticle No.: 704, Pages 8404–8417We extend randomized smoothing to cover parameterized transformations (e.g., rotations, translations) and certify robustness in the parameter space (e.g., rotation angle). This is particularly challenging as interpolation and rounding effects mean that ...
- research-articleDecember 2020
Learning certified individually fair representations
NIPS '20: Proceedings of the 34th International Conference on Neural Information Processing SystemsArticle No.: 636, Pages 7584–7596Fair representation learning provides an effective way of enforcing fairness constraints without compromising utility for downstream users. A desirable family of such fairness constraints, each requiring similar treatment for similar individuals, is ...
- research-articleApril 2020
Substream-Centric Maximum Matchings on FPGA
ACM Transactions on Reconfigurable Technology and Systems (TRETS), Volume 13, Issue 2Article No.: 8, Pages 1–33https://doi.org/10.1145/3377871Developing high-performance and energy-efficient algorithms for maximum matchings is becoming increasingly important in social network analysis, computational sciences, scheduling, and others. In this work, we propose the first maximum matching ...
- research-articleMarch 2019
Practice Prize Paper–Managing Advertising Campaigns for New Product Launches: An Application at Mercedes-Benz
This paper introduces a new decision support tool to optimize advertising campaigns for new product launches based on learnings from an ex post analysis of prior campaigns.
The launch of a new product is one of the most critical activities that product and brand managers are faced with. It requires a substantial communications budget to introduce the new product to the market. As the number of media channels proliferates, ...
- research-articleFebruary 2019
Substream-Centric Maximum Matchings on FPGA
FPGA '19: Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysPages 152–161https://doi.org/10.1145/3289602.3293916Developing high-performance and energy-efficient algorithms for maximum matchings is becoming increasingly important in social network analysis, computational sciences, scheduling, and others. In this work, we propose the first maximum matching ...