[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

How Much Data Is Sufficient to Learn High-Performing Algorithms?

Published: 05 October 2024 Publication History

Abstract

Algorithms often have tunable parameters that impact performance metrics such as runtime and solution quality. For many algorithms used in practice, no parameter settings admit meaningful worst-case bounds, so the parameters are made available for the user to tune. Alternatively, parameters may be tuned implicitly within the proof of a worst-case approximation ratio or runtime bound. Worst-case instances, however, may be rare or nonexistent in practice. A growing body of research has demonstrated that a data-driven approach to parameter tuning can lead to significant improvements in performance. This approach uses a training set of problem instances sampled from an unknown, application-specific distribution and returns a parameter setting with strong average performance on the training set.
We provide techniques for deriving generalization guarantees that bound the difference between the algorithm’s average performance over the training set and its expected performance on the unknown distribution. Our results apply no matter how the parameters are tuned, be it via an automated or manual approach. The challenge is that for many types of algorithms, performance is a volatile function of the parameters: slightly perturbing the parameters can cause a large change in behavior. Prior research [e.g., 12, 16, 20, 62] has proved generalization bounds by employing case-by-case analyses of greedy algorithms, clustering algorithms, integer programming algorithms, and selling mechanisms. We streamline these analyses with a general theorem that applies whenever an algorithm’s performance is a piecewise-constant, piecewise-linear, or—more generally—piecewise-structured function of its parameters. Our results, which are tight up to logarithmic factors in the worst case, also imply novel bounds for configuring dynamic programming algorithms from computational biology.

References

[1]
Tobias Achterberg. 2009. SCIP: Solving constraint integer programs. Mathematical Programming Computation 1, 1 (2009), 1–41.
[2]
Saba Ahmadi, Hedyeh Beyhaghi, Avrim Blum, and Keziah Naggita. 2022. Setting fair incentives to maximize improvement. arXiv:2203.00134. Retrieved from https://arxiv.org/abs/2203.00134
[3]
Noga Alon, Moshe Babaioff, Yannai A. Gonczarowski, Yishay Mansour, Shay Moran, and Amir Yehudayoff. 2017. Submultiplicative Glivenko-Cantelli and uniform convergence of revenues. Proceedings of the Annual Conference on Neural Information Processing Systems.
[4]
Martin Anthony and Peter Bartlett. 2009. Neural Network Learning: Theoretical Foundations. Cambridge University Press.
[5]
Patrick Assouad. 1983. Densité et dimension. Annales de l’Institut Fourier 33, 3 (1983), 233–282.
[6]
Maria-Florina Balcan, Travis Dick, and Manuel Lang. 2020. Learning to link. In Proceedings of the International Conference on Learning Representations.
[7]
Maria-Florina Balcan, Travis Dick, and Wesley Pegden. 2020. Semi-bandit optimization in the dispersed setting. In Proceedings of the Conference on Uncertainty in Artificial Intelligence.
[8]
Maria-Florina Balcan. 2020. Data-driven algorithm design. In Proceedings of the Beyond Worst Case Analysis of Algorithms, Tim Roughgarden (Ed.). Cambridge University Press.
[9]
Maria-Florina Balcan, Avrim Blum, Jason D. Hartline, and Yishay Mansour. 2005. Mechanism design via machine learning. In Proceedings of the Annual Symposium on Foundations of Computer Science. 605–614.
[10]
Maria-Florina Balcan, Dan DeBlasio, Travis Dick, Carl Kingsford, Tuomas Sandholm, and Ellen Vitercik. 2019. How much data is sufficient to learn high-performing algorithms? Generalization guarantees for data-driven algorithm design. arXiv:1908.02894. Retrieved from https://arxiv.org/abs/1908.02894
[11]
Maria-Florina Balcan, Dan DeBlasio, Travis Dick, Carl Kingsford, Tuomas Sandholm, and Ellen Vitercik. 2021. How much data is sufficient to learn high-performing algorithms? Generalization guarantees for data-driven algorithm design. In Proceedings of the Annual Symposium on Theory of Computing.
[12]
Maria-Florina Balcan, Travis Dick, Tuomas Sandholm, and Ellen Vitercik. 2018. Learning to branch. International Conference on Machine Learning (2018).
[13]
Maria-Florina Balcan, Travis Dick, and Ellen Vitercik. 2018. Dispersion for data-driven algorithm design, online learning, and private optimization. In Proceedings of the Annual Symposium on Foundations of Computer Science .
[14]
Maria-Florina Balcan, Travis Dick, and Colin White. 2018. Data-driven clustering via parameterized Lloyd’s families. In Proceedings of the Annual Conference on Neural Information Processing Systems. 10641–10651.
[15]
Maria-Florina Balcan, Mikhail Khodak, Dravyansh Sharma, and Ameet Talwalkar. 2022. Provably tuning the ElasticNet across instances. In Proceedings of the Annual Conference on Neural Information Processing Systems.
[16]
Maria-Florina Balcan, Vaishnavh Nagarajan, Ellen Vitercik, and Colin White. 2017. Learning-theoretic foundations of algorithm configuration for combinatorial partitioning problems. Conference on Learning Theory (2017).
[17]
Maria-Florina Balcan, Siddharth Prasad, Tuomas Sandholm, and Ellen Vitercik. 2021. Sample complexity of tree search configuration: Cutting planes and beyond. In Proceedings of the Annual Conference on Neural Information Processing Systems.
[18]
Maria-Florina Balcan, Siddharth Prasad, Tuomas Sandholm, and Ellen Vitercik. 2022. Structural analysis of branch-and-cut and the learnability of gomory mixed integer cuts. In Proceedings of the Annual Conference on Neural Information Processing Systems.
[19]
Maria-Florina Balcan, Tuomas Sandholm, and Ellen Vitercik. [n.d.]. Generalization guarantees for multi-item profit maximization: Pricing, auctions, and randomized mechanisms. Operations Research ([n. d.]), (to appear).
[20]
Maria-Florina Balcan, Tuomas Sandholm, and Ellen Vitercik. 2018. A general theory of sample complexity for multi-item profit maximization. In Proceedings of the ACM Conference on Economics and Computation. Extended abstract.
[21]
Maria-Florina Balcan, Tuomas Sandholm, and Ellen Vitercik. 2020. Learning to optimize computational resources: Frugal training with generalization guarantees. AAAI Conference on Artificial Intelligence (2020).
[22]
Maria-Florina Balcan, Tuomas Sandholm, and Ellen Vitercik. 2021. Generalization in portfolio-based algorithm selection. In Proceedings of the AAAI Conference on Artificial Intelligence.
[23]
Maria-Florina Balcan and Dravyansh Sharma. 2021. Data driven semi-supervised learning. In Proceedings of the Annual Conference on Neural Information Processing Systems.
[24]
Peter Bartlett, Piotr Indyk, and Tal Wagner. 2022. Generalization bounds for data-driven numerical linear algebra. In Proceedings of the Conference on Learning Theory.
[25]
Jon Louis Bentley, David S. Johnson, Frank Thomson Leighton, Catherine C. McGeoch, and Lyle A. McGeoch. 1984. Some unexpected expected behavior results for bin packing. In Proceedings of the Annual Symposium on Theory of Computing. 279–288.
[26]
Avrim Blum, Chen Dan, and Saeed Seddighin. 2021. Learning complexity of simulated annealing. In Proceedings of the International Conference on Artificial Intelligence and Statistics.
[27]
Sébastien Bubeck, Nikhil R. Devanur, Zhiyi Huang, and Rad Niazadeh. 2017. Online auctions and multi-scale online learning. Proceedings of the ACM Conference on Economics and Computation (2017).
[28]
Robert Creighton Buck. 1943. Partition of space. The American Mathematical Monthly 50 (1943), 541–544.
[29]
Yang Cai and Constantinos Daskalakis. 2017. Learning multi-item auctions with (or without) samples. In Proceedings of the Annual Symposium on Foundations of Computer Science.
[30]
Shuchi Chawla, Evangelia Gergatsouli, Yifeng Teng, Christos Tzamos, and Ruimin Zhang. 2020. Pandora’s box with correlations: Learning and approximation. In Proceedings of the Annual Symposium on Foundations of Computer Science.
[31]
Antonia Chmiela, Elias B. Khalil, Ambros Gleixner, Andrea Lodi, and Sebastian Pokutta. 2021. Learning to schedule heuristics in branch-and-bound. arXiv:2103.10294. Retrieved from https://arxiv.org/abs/2103.10294
[32]
Ed H. Clarke. 1971. Multipart pricing of public goods. Public Choice 11 (1971), 17–33.
[33]
Vincent Cohen-Addad and Varun Kanade. 2017. Online optimization of smoothed piecewise constant functions. In Proceedings of the International Conference on Artificial Intelligence and Statistics.
[34]
Richard Cole and Tim Roughgarden. 2014. The sample complexity of revenue maximization. In Proceedings of the Annual Symposium on Theory of Computing.
[35]
Dan DeBlasio and John D. Kececioglu. 2018. Parameter Advising for Multiple Sequence Alignment. Springer.
[36]
Nikhil R. Devanur, Zhiyi Huang, and Christos-Alexandros Psomas. 2016. The sample complexity of auctions with side information. In Proceedings of the Annual Symposium on Theory of Computing.
[37]
Paul Dütting, Silvio Lattanzi, Renato Paes Leme, and Sergei Vassilvitskii. 2020. Secretaries with advice. arXiv:2011.06726. Retrieved from https://arxiv.org/abs/2011.06726
[38]
Talya Eden, Piotr Indyk, Shyam Narayanan, Ronitt Rubinfeld, Sandeep Silwal, and Tal Wagner. 2021. Learning-based support estimation in sublinear time. In Proceedings of the International Conference on Learning Representations.
[39]
Robert C. Edgar. 2010. Quality measures for protein alignment benchmarks. Nucleic Acids Research 38, 7 (2010), 2145–2153.
[40]
Edith Elkind. 2007. Designing and learning optimal finite support auctions. In Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms.
[41]
Marc Etheve, Zacharie Alès, Côme Bissuel, Olivier Juan, and Safia Kedad-Sidhoum. 2020. Reinforcement learning for variable selection in a branch and bound algorithm. Springer, 176–185.
[42]
Étienne Bamas, Andreas Maggiori, Lars Rohwedder, and Ola Svensson. 2020. Learning augmented energy minimization via speed scaling. In Proceedings of the Annual Conference on Neural Information Processing Systems.
[43]
Étienne Bamas, Andreas Maggiori, and Ola Svensson. 2020. The primal-dual method for learning augmented algorithms. In Proceedings of the Annual Conference on Neural Information Processing Systems.
[44]
Uriel Feige and Michael Langberg. 2006. The RPR\(^2\) rounding technique for semidefinite programs. Journal of Algorithms 60, 1 (2006), 1–23.
[45]
Da-Fei Feng and Russell F. Doolittle. 1987. Progressive sequence alignment as a prerequisite to correct phylogenetic trees. Journal of Molecular Evolution 25, 4 (1987), 351–360.
[46]
Aaron Ferber, Bryan Wilder, Bistra Dilkina, and Milind Tambe. 2020. MIPaaL: Mixed integer program as a layer. In Proceedings of the AAAI Conference on Artificial Intelligence. 1504–1511.
[47]
David Fernández-Baca, Timo Seppäläinen, and Giora Slutzki. 2004. Parametric multiple sequence alignment and phylogeny construction. Journal of Discrete Algorithms 2, 2 (2004), 271–287.
[48]
Darya Filippova, Rob Patro, Geet Duggal, and Carl Kingsford. 2014. Identification of alternative topological domains in chromatin. Algorithms for Molecular Biology 9, 1 (2014), 14.
[49]
Nikolaus Furian, Michael O’sullivan, Cameron Walker, and Eranda Çela. 2021. A machine learning-based branch and price algorithm for a sampled vehicle routing problem. OR Spectrum (2021), 1–40.
[50]
Vikas Garg and Adam Kalai. 2018. Supervising unsupervised learning. In Proceedings of the Annual Conference on Neural Information Processing Systems.
[51]
Andrew Gilpin and Tuomas Sandholm. 2011. Information-theoretic approaches to branching in search. Discrete Optimization 8, 2 (2011), 147–159. Early version in IJCAI-07.
[52]
Ashish Goel, Anilesh K. Krishnaswamy, Sukolsak Sakshuwong, and Tanja Aitamurto. 2019. Knapsack voting for participatory budgeting. ACM Transactions on Economics and Computation 7, 2 (2019), 1–27.
[53]
Michel X. Goemans and David P. Williamson. 1995. Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming. Journal of the ACM 42, 6 (1995), 1115–1145.
[54]
Ken Goldberg, Theresa Roeder, Dhruv Gupta, and Chris Perkins. 2001. Eigentaste: A constant time collaborative filtering algorithm. Information Retrieval 4, 2 (2001), 133–151.
[55]
Paul Goldberg and Mark Jerrum. 1993. Bounding the Vapnik-Chervonenkis dimension of concept classes parameterized by real numbers. In Proceedings of the Conference on Learning Theory.
[56]
Yannai A. Gonczarowski and Noam Nisan. 2017. Efficient empirical revenue maximization in single-parameter auction environments. In Proceedings of the Annual Symposium on Theory of Computing. 856–868.
[57]
Yannai A. Gonczarowski and S. Matthew Weinberg. 2018. The sample complexity of up-to-\(\varepsilon\) multi-dimensional revenue maximization. In Proceedings of the Annual Symposium on Foundations of Computer Science.
[58]
Yannai A. Gonczarowski and S. Matthew Weinberg. 2021. The sample complexity of up-to-\(\varepsilon\) multi-dimensional revenue maximization. Journal of the ACM 68, 3 (2021), 1–28.
[59]
Osamu Gotoh. 1982. An improved algorithm for matching biological sequences. Journal of Molecular Biology 162, 3 (1982), 705–708.
[60]
Theodore Groves. 1973. Incentives in teams. Econometrica 41 (1973), 617–631.
[61]
Chenghao Guo, Zhiyi Huang, and Xinzhi Zhang. 2019. Settling the sample complexity of single-parameter revenue maximization. Proceedings of the Annual Symposium on Theory of Computing (2019).
[62]
Rishi Gupta and Tim Roughgarden. 2017. A PAC approach to application-specific algorithm selection. SIAM Journal on Computing 46, 3 (2017), 992–1017.
[63]
Dan Gusfield, Krishnan Balasubramanian, and Dalit Naor. 1994. Parametric optimization of sequence alignment. Algorithmica 12, 4-5 (1994), 312–326.
[64]
Dan Gusfield and Paul Stelling. 1996. Parametric and inverse-parametric sequence alignment with XPARAL. In Proceedings of the Methods in Enzymology. Elsevier, 481–494.
[65]
Jason Hartline and Samuel Taggart. 2016. Non-revelation mechanism design. arXiv:1608.01875. Retrieved from https://arxiv.org/abs/1608.01875
[66]
Desmond G. Higgins and Paul M. Sharp. 1988. CLUSTAL: A package for performing multiple sequence alignment on a microcomputer. Gene 73, 1 (1988), 237–244.
[67]
Robert W. Holley, Jean Apgar, George A. Everett, James T. Madison, Mark Marquisee, Susan H. Merrill, John Robert Penswick, and Ada Zamir. 1965. Structure of a ribonucleic acid. Science 147, 3664 (1965), 1462–1465.
[68]
Eric Horvitz, Yongshao Ruan, Carla Gomez, Henry Kautz, Bart Selman, and Max Chickering. 2001. A Bayesian approach to tackling hard computational problems. In Proceedings of the Conference on Uncertainty in Artificial Intelligence.
[69]
Chen-Yu Hsu, Piotr Indyk, Dina Katabi, and Ali Vakilian. 2019. Learning-based frequency estimation algorithms. In Proceedings of the International Conference on Learning Representations.
[70]
Zhiyi Huang, Yishay Mansour, and Tim Roughgarden. 2015. Making the most of your samples. In Proceedings of the ACM Conference on Economics and Computation.
[71]
Frank Hutter, Holger Hoos, Kevin Leyton-Brown, and Thomas Stützle. 2009. ParamILS: An automatic algorithm configuration framework. Journal of Artificial Intelligence Research 36, 1 (2009), 267–306.
[72]
Piotr Indyk, Frederik Mallmann-Trenn, Slobodan Mitrović, and Ronitt Rubinfeld. 2020. Online page migration with ML advice. arXiv:2006.05028. Retrieved from https://arxiv.org/abs/2006.05028
[73]
Raj Iyer, David Karger, Hariharan Rahul, and Mikkel Thorup. 2002. An experimental study of polylogarithmic, fully dynamic, connectivity algorithms. ACM Journal of Experimental Algorithmics 6(2002), 4–es.
[74]
Serdar Kadioglu, Yuri Malitsky, Meinolf Sellmann, and Kevin Tierney. 2010. ISAC-instance-specific algorithm configuration. In Proceedings of the European Conference on Artificial Intelligence.
[75]
John D Kececioglu and Dean Starrett. 2004. Aligning alignments exactly. In Proceedings of the Annual International Conference on Computational Molecular Biology. 85–96.
[76]
Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar, and Sergei Vassilvitskii. 2022. Learning predictions for algorithms with predictions. In Proceedings of the Annual Conference on Neural Information Processing Systems.
[77]
Eagu Kim and John Kececioglu. 2007. Inverse sequence alignment from partial examples. Proceedings of the International Workshop on Algorithms in Bioinformatics (2007), 359–370.
[78]
Robert Kleinberg, Kevin Leyton-Brown, and Brendan Lucier. 2017. Efficiency through procrastination: Approximately optimal algorithm configuration with runtime guarantees. In Proceedings of the International Joint Conference on Artificial Intelligence.
[79]
Robert Kleinberg, Kevin Leyton-Brown, Brendan Lucier, and Devon Graham. 2019. Procrastinating with confidence: Near-optimal, anytime, adaptive algorithm configuration. Proceedings of the Annual Conference on Neural Information Processing Systems (2019).
[80]
James Kotary, Ferdinando Fioretto, Pascal Van Hentenryck, and Bryan Wilder. 2021. End-to-end constrained optimization learning: A survey. arXiv:2103.16378. Retrieved from https://arxiv.org/abs/2103.16378
[81]
Ailsa H. Land and Alison G. Doig. 1960. An automatic method of solving discrete programming problems. Econometrica: Journal of the Econometric Society (1960), 497–520.
[82]
Thomas Lavastida, Benjamin Moseley, R. Ravi, and Chenyang Xu. 2020. Learnable and instance-robust predictions for online matching, flows and load balancing. arXiv:2011.11743. Retrieved from https://arxiv.org/abs/2011.11743
[83]
Kevin Leyton-Brown, Eugene Nudelman, and Yoav Shoham. 2009. Empirical hardness models: Methodology and a case study on combinatorial auctions. Journal of the ACM 56, 4 (2009), 1–52.
[84]
Erez Lieberman-Aiden, Nynke L. van Berkum, Louise Williams, Maxim Imakaev, Tobias Ragoczy, Agnes Telling, Ido Amit, Bryan R. Lajoie, Peter J. Sabo, Michael O. Dorschner, Richard Sandstrom, Bradley Bernstein, M. A. Bender, Mark Groudine, Andreas Gnirke, John Stamatoyannopoulos, Leonid A. Mirny, Eric S. Lander, and Job Dekker. 2009. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 5950 (2009), 289–293. DOI:DOI:
[85]
Anton Likhodedov and Tuomas Sandholm. 2004. Methods for boosting revenue in combinatorial auctions. In Proceedings of the National Conference on Artificial Intelligence. San Jose, CA, 232–237.
[86]
Anton Likhodedov and Tuomas Sandholm. 2005. Approximating revenue-maximizing combinatorial auctions. In Proceedings of the National Conference on Artificial Intelligence. Pittsburgh, PA.
[87]
Jeff Linderoth and Martin Savelsbergh. 1999. A computational study of search strategies for mixed integer programming. INFORMS Journal of Computing 11 (1999), 173–187.
[88]
Darío G. Lupiáñez, Malte Spielmann, and Stefan Mundlos. 2016. Breaking TADs: How alterations of chromatin domains result in disease. Trends in Genetics 32, 4 (2016), 225–237.
[89]
Thodoris Lykouris and Sergei Vassilvitskii. 2018. Competitive caching with machine learned advice. In Proceedings of the International Conference on Machine Learning.
[90]
Catherine C. McGeoch. 2012. A Guide to Experimental Algorithmics. Cambridge University Press.
[91]
Debasis Mishra and Arunava Sen. 2012. Roberts’ theorem with neutrality: A social welfare ordering approach. Games and Economic Behavior 75, 1 (2012), 283–298.
[92]
Michael Mitzenmacher. 2018. A model for learned bloom filters and optimizing by sandwiching. In Proceedings of the Annual Conference on Neural Information Processing Systems. 464–473.
[93]
Mehryar Mohri and Andrés Muñoz. 2014. Learning theory and algorithms for revenue optimization in second price auctions with reserve. In Proceedings of the International Conference on Machine Learning.
[94]
Jamie Morgenstern and Tim Roughgarden. 2015. On the pseudo-dimension of nearly optimal auctions. In Proceedings of the Annual Conference on Neural Information Processing Systems.
[95]
Jamie Morgenstern and Tim Roughgarden. 2016. Learning simple auctions. In Proceedings of the Conference on Learning Theory.
[96]
Andrés Muñoz Medina and Sergei Vassilvitskii. 2017. Revenue optimization with approximate bid predictions. Proceedings of the Annual Conference on Neural Information Processing Systems (2017).
[97]
Swaprava Nath and Tuomas Sandholm. 2019. Efficiency and budget balance in general quasi-linear domains. Games and Economic Behavior 113 (2019), 673–693.
[98]
Saket Navlakha, James White, Niranjan Nagarajan, Mihai Pop, and Carl Kingsford. 2009. Finding biologically accurate clusterings in hierarchical tree decompositions using the variation of information. In Proceedings of the Annual International Conference on Research in Computational Molecular Biology. Springer, 400–417.
[99]
Noam Nisan, Tim Roughgarden, Eva Tardos, and Vijay V. Vazirani. 2007. Algorithmic Game Theory. Cambridge University press.
[100]
Ruth Nussinov and Ann B. Jacobson. 1980. Fast algorithm for predicting the secondary structure of single-stranded RNA. Proceedings of the National Academy of Sciences 77, 11 (1980), 6309–6313.
[101]
Lior Pachter and Bernd Sturmfels. 2004. Parametric inference for biological sequence analysis. Proceedings of the National Academy of Sciences 101, 46 (2004), 16138–16143. DOI:DOI:
[102]
Lior Pachter and Bernd Sturmfels. 2004. Tropical geometry of statistical models. Proceedings of the National Academy of Sciences 101, 46 (2004), 16132–16137. DOI:DOI:
[103]
David Pollard. 1984. Convergence of Stochastic Processes. Springer.
[104]
Antoine Prouvost, Justin Dumouchelle, Lara Scavuzzo, Maxime Gasse, Didier Chételat, and Andrea Lodi. 2020. Ecole: A gym-like library for machine learning in combinatorial optimization solvers. arXiv:2011.06069. Retrieved from https://arxiv.org/abs/2011.06069
[105]
Manish Purohit, Zoya Svitkina, and Ravi Kumar. 2018. Improving online algorithms via ML predictions. In Proceedings of the Annual Conference on Neural Information Processing Systems. 9661–9670.
[106]
Kevin Roberts. 1979. The characterization of implementable social choice rules. In Proceedings of the Aggregation and Revelation of Preferences, J.-J. Laffont (Ed.). North-Holland Publishing Company.
[107]
Tim Roughgarden and Okke Schrijvers. 2016. Ironing in the dark. In Proceedings of the ACM Conference on Economics and Computation.
[108]
Shinsaku Sakaue and Taihei Oki. 2022. Sample complexity of learning heuristic functions for greedy-best-first and A* search. In Proceedings of the Annual Conference on Neural Information Processing Systems.
[109]
Tuomas Sandholm. 2013. Very-large-scale generalized combinatorial multi-attribute auctions: Lessons from conducting $60 Billion of sourcing. In Proceedings of the Handbook of Market Design, Zvika Neeman, Alvin Roth, and Nir Vulkan (Eds.). Oxford University Press.
[110]
Tuomas Sandholm and Anton Likhodedov. 2015. Automated design of revenue-maximizing combinatorial auctions. Operations Research 63, 5 (2015), 1000–1025. Special issue on Computational Economics. Subsumes and extends over a AAAI-05 paper and a AAAI-04 paper.
[111]
J. Michael Sauder, Jonathan W. Arthur, and Roland L. Dunbrack Jr.2000. Large-scale comparison of protein sequence alignment algorithms with structure alignments. Proteins: Structure, Function, and Bioinformatics 40, 1 (2000), 6–22.
[112]
Norbert Sauer. 1972. On the density of families of sets. Journal of Combinatorial Theory, Series A 13, 1 (1972), 145–147.
[113]
Daniel Selsam and Nikolaj Bjørner. 2019. Guiding high-performance SAT solvers with unsat-core predictions. In Proceedings of the International Conference on Theory and Applications of Satisfiability Testing. Springer, 336–353.
[114]
Shai Shalev-Shwartz and Shai Ben-David. 2014. Understanding Machine Learning: From Theory to Algorithms. Cambridge University Press.
[115]
Yifei Shen, Yuanming Shi, Jun Zhang, and Khaled B. Letaief. 2019. LORM: Learning to optimize for resource management in wireless networks with few training samples. IEEE Transactions on Wireless Communications 19, 1 (2019), 665–679.
[116]
Jialin Song, Ravi Lanka, Yisong Yue, and Bistra Dilkina. 2020. A general large neighborhood search framework for solving integer programs. In Proceedings of the Annual Conference on Neural Information Processing Systems .
[117]
Vasilis Syrgkanis. 2017. A sample complexity measure with applications to learning optimal auctions. Proceedings of the Annual Conference on Neural Information Processing Systems (2017).
[118]
Yunhao Tang, Shipra Agrawal, and Yuri Faenza. 2020. Reinforcement learning for integer programming: Learning to cut. In Proceedings of the International Conference on Machine Learning.
[119]
Timo Tossavainen. 2006. On the zeros of finite sums of exponential functions. Australian Mathematical Society Gazette 33, 1 (2006), 47–50.
[120]
Vladimir Vapnik and Alexey Chervonenkis. 1971. On the uniform convergence of relative frequencies of events to their probabilities. Theory of Probability and its Applications 16, 2 (1971), 264–280.
[121]
William Vickrey. 1961. Counterspeculation, auctions, and competitive sealed tenders. Journal of Finance 16 (1961), 8–37.
[122]
Lusheng Wang and Tao Jiang. 1994. On the complexity of multiple sequence alignment. Journal of Computational Biology 1, 4 (1994), 337–348.
[123]
Michael S. Waterman, Temple F. Smith, and William A. Beyer. 1976. Some biological sequence metrics. Advances in Mathematics 20, 3 (1976), 367–387.
[124]
Hugues Wattez, Frédéric Koriche, Christophe Lecoutre, Anastasia Paparrizou, and Sébastien Tabary. 2020. Learning variable ordering heuristics with multi-armed bandits and restarts.
[125]
Alexander Wei and Fred Zhang. 2020. Optimal robustness-consistency tradeoffs for learning-augmented online algorithms. In Proceedings of the Annual Conference on Neural Information Processing Systems.
[126]
Gellért Weisz, András György, and Csaba Szepesvári. 2018. LeapsAndBounds: A method for approximately optimal algorithm configuration. In Proceedings of the International Conference on Machine Learning.
[127]
Gellért Weisz, András György, and Csaba Szepesvári. 2019. CapsAndRuns: An improved method for approximately optimal algorithm configuration. In Proceedings of the International Conference on Machine Learning.
[128]
Travis J. Wheeler and John D. Kececioglu. 2007. Multiple alignment by aligning alignments. Bioinformatics 23, 13 (072007), i559–i568.
[129]
Lin Xu, Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown. 2008. SATzilla: Portfolio-based algorithm selection for SAT. Journal of Artificial Intelligence Research 32, 1 (2008), 565–606.
[130]
Lin Xu, Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown. 2011. Hydra-MIP: Automated algorithm configuration and selection for mixed integer programming. In Proceedings of the RCRA Workshop on Experimental Evaluation of Algorithms for Solving Problems with Combinatorial Explosion at the International Joint Conference on Artificial Intelligence.
[131]
Giulia Zarpellon, Jason Jo, Andrea Lodi, and Yoshua Bengio. 2021. Parameterizing branch-and-bound search trees to learn branching policies. In Proceedings of the AAAI Conference on Artificial Intelligence.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Journal of the ACM
Journal of the ACM  Volume 71, Issue 5
October 2024
294 pages
EISSN:1557-735X
DOI:10.1145/3613651
  • Editor:
  • Venkatesan Guruswami
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 October 2024
Online AM: 29 July 2024
Accepted: 25 June 2024
Revised: 15 January 2024
Received: 21 April 2021
Published in JACM Volume 71, Issue 5

Check for updates

Author Tags

  1. Automated algorithm design
  2. data-driven algorithm design
  3. automated algorithm configuration
  4. machine learning theory
  5. computational biology

Qualifiers

  • Research-article

Funding Sources

  • Gordon and Betty Moore Foundation’s Data-Driven Discovery Initiative
  • US National Institutes of Health
  • US National Science Foundation
  • US Army Research Office
  • Vannevar Bush Faculty Fellowship to T.S., the Office of Naval Research
  • Defense Advanced Research Projects Agency under cooperative agreement

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 508
    Total Downloads
  • Downloads (Last 12 months)508
  • Downloads (Last 6 weeks)145
Reflects downloads up to 12 Dec 2024

Other Metrics

Citations

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media