Abstract
In the classic k-center problem, we are given a metric graph, and the objective is to select k nodes as centers such that the maximum distance from any vertex to its closest center is minimized. In this paper, we consider two important generalizations of k-center, the matroid center problem and the knapsack center problem. Both problems are motivated by recent content distribution network applications. Our contributions can be summarized as follows: (1) We consider the matroid center problem in which the centers are required to form an independent set of a given matroid. We show this problem is NP-hard even on a line. We present a 3-approximation algorithm for the problem on general metrics. We also consider the outlier version of the problem where a given number of vertices can be excluded as outliers from the solution. We present a 7-approximation for the outlier version. (2) We consider the (multi-)knapsack center problem in which the centers are required to satisfy one (or more) knapsack constraint(s). It is known that the knapsack center problem with a single knapsack constraint admits a 3-approximation. However, when there are at least two knapsack constraints, we show this problem is not approximable at all. To complement the hardness result, we present a polynomial time algorithm that gives a 3-approximate solution such that one knapsack constraint is satisfied and the others may be violated by at most a factor of \(1+\epsilon \). We also obtain a 3-approximation for the outlier version that may violate the knapsack constraint by \(1+\epsilon \).
Similar content being viewed by others
Notes
Let \(B_1,B_2,\ldots , B_b\) be a collection of disjoint subsets of V and \(d_i\) be integers such that \(1\le d_i\le |B_i|\) for all \(1\le i\le b\). We say a set \(I\subseteq V\) is independent if \(|I\cap B_i|\le d_i\) for \(1\le i\le b\). All such independent sets form a partition matroid.
We note that Golovin et al. [13] claimed (without a proof) that, in our notations, most existing approximation algorithms for k-median achieve an \(O(\lambda )\)-approximation on spaces satisfying \((\lambda ,2)\)-relaxed TI. By a scrutiny of the existing k-median algorithms, we are not able to reproduce the same result and the correct approximation ratio should be roughly \(O(\lambda ^{c_0})\). However, the results of [13] are not affected in any essential way since this only changes the constant hidden in the big-oh notation.
References
Aggarwal, G., Panigrahy, R., Feder, T., Thomas, D., Kenthapadi, K., Khuller, S., Zhu, A.: Achieving Anonymity via Clustering, vol. 6, p. 49. ACM (2010)
Charikar, M., Khuller, S., Mount, D.M., Narasimhan, G.: Algorithms for facility location problems with outliers. In: Proceedings of the Twelfth Annual ACM–SIAM Symposium on Discrete Algorithms, pp. 642–651. Society for Industrial and Applied Mathematics (2001)
Charikar, M., Li, S.: A dependent lp-rounding approach for the k-median problem. In: International Colloquium on Automata, Languages, and Programming, pp. 194–205. Springer, Berlin (2012)
Chechik, S., Peleg, D.: The fault tolerant capacitated k-center problem. In: Structural Information and Communication Complexity, pp. 13–24. Springer, Berlin (2012)
Chen, D.Z., Wang, H.: Efficient algorithms for the weighted k-center problem on a real line. In: International Symposium on Algorithm and Computation, pp. 584–593. Springer, Berlin (2011)
Chen, K.: A constant factor approximation algorithm for k-median clustering with outliers. In: Proceedings of the Nineteenth Annual ACM–SIAM Symposium on Discrete Algorithms, pp. 826–835. Society for Industrial and Applied Mathematics (2008)
Chuzhoy, J., Guha, S., Halperin, E., Khanna, S., Kortsarz, G., Krauthgamer, R., Naor, J.: Asymmetric \(k\)-center is \(\log ^{*}n\)-hard to approximate. J. ACM 52(4), 538–551 (2005)
Cole, R.: Slowing down sorting networks to obtain faster sorting algorithms. J. ACM 34(1), 200–208 (1987)
Cygan, M., Hajiaghayi, M., Khuller, S.: LP rounding for \(k\)-centers with non-uniform hard capacities. In: IEEE 53rd Annual Symposium on Foundations of Computer Science, pp. 273–282 (2012)
Edmonds, J., Ray Fulkerson, D.: Bottleneck extrema. J. Comb. Theory 8(3), 299–306 (1970)
Frederickson, G.N.: Parametric search and locating supply centers in trees. In: Algorithms and Data Structures, pp. 299–319. Springer, Berlin (1991)
Garey, M.R., Johnson, D.S.: Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman, San Francisco (1979)
Golovin, D., Gupta, A., Kumar, A., Tangwongsan, K., Hariharan, R., Mukund, M., Vinay, V.: All-norms and all-l_p-norms approximation algorithms. In: IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science, vol. 2, pp. 199–210. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik
Gonzalez, T.F.: Clustering to minimize the maximum intercluster distance. Theor. Comput. Sci. 38, 293–306 (1985)
Grandoni, F., Zenklusen, R.: Approximation schemes for multi-budgeted independence systems. In: European Symposium on Algorithms, pp. 536–548 (2010)
Hajiaghayi, M., Khandekar, R., Kortsarz, G.: Budgeted red–blue median and its generalizations. In: European Symposium on Algorithms, pp. 314–325 (2011)
Hochbaum, D., Shmoys, D.: A unified approach to approximation algorithms for bottleneck problems. J. ACM 33(3), 533–550 (1986)
Hochbaum, D.S., Shmoys, D.B.: A best possible heuristic for the k-center problem. Math. Oper. Res. 10(2), 180–184 (1985)
Kellerer, H., Pferschy, U., Pisinger, D.: Knapsack Problems. Springer, Berlin (2004)
Khuller, S., Pless, R., Sussmann, Y.J.: Fault tolerant k-center problems. Theor. Comput. Sci. 242(1), 237–245 (2000)
Khuller, S., Saha, B., Sarpatwar, K.K.: New approximation results for resource replication problems. In: Approximation, Randomization, and Combinatorial Optimization, volume 7408 of LNCS, pp. 218–230. Springer, Berlin (2012)
Krishnaswamy, R., Kumar, A., Nagarajan, V., Sabharwal, Y., Saha, B.: The matroid median problem. In: Proceedings of the Twenty-Second Annual ACM–SIAM Symposium on Discrete Algorithms, pp. 1117–1130. SIAM (2011)
Kumar, A.: Constant factor approximation algorithm for the knapsack median problem. In: Proceedings of the Twenty-Third Annual ACM–SIAM Symposium on Discrete Algorithms, pp. 824–832. SIAM (2012)
Lee, J., Mirrokni, V.S., Nagarajan, V., Sviridenko, M.: Non-monotone submodular maximization under matroid and knapsack constraints. In: Proceedings of the Forty-first Annual ACM Symposium on Theory of Computing, pp. 323–332. ACM (2009)
Li, J., Yi, K., Zhang, Q.: Clustering with diversity. In: International Colloquium on Automata, Languages and Programming, pp. 188–200. Springer, Berlin (2010)
McCutchen, R.M., Khuller, S.: Streaming algorithms for k-center clustering with outliers and with anonymity. In: Approximation, Randomization and Combinatorial Optimization, pp. 165–178. Springer, Berlin (2008)
Papadimitriou, C.H., Yannakakis, M.: On the approximability of trade-offs and optimal access of web sources. In: Proceedings of the 41st Annual Symposium on Foundations of Computer Science, pp. 86–92. IEEE (2000)
Pisinger, D.: Algorithms for Knapsack Problems. PhD thesis, Department of Computer Science, University of Copenhagen (1995)
Schrijver, A.: Combinatorial Optimization: Polyhedra and Efficiency. Springer, Berlin (2003)
Swamy, C.: Improved approximation algorithms for matroid and knapsack median problems and applications. In: Approximation, Randomization and Combinatorial Optimization
Vondrák, J., Chekuri, C., Zenklusen, R.: Submodular function maximization via the multilinear relaxation and contention resolution schemes. In: Proceedings of the Forty-third Annual ACM Symposium on Theory of Computing, pp. 783–792. ACM (2011)
Zarrabi-Zadeh, H., Mukhopadhyay, A.: Streaming 1-center with outliers in high dimensions. In: Canadian Conference on Computational Geometry, pp. 83–86 (2009)
Zenklusen, R.: Matroidal degree-bounded minimum spanning trees. In: Proceedings of the Twenty-Third Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 1512–1521. SIAM (2012)
Acknowledgments
We would like to thank the anonymous reviewers for their insightful and detailed comments, which helped us to improve the presentation significantly.
Author information
Authors and Affiliations
Corresponding author
Additional information
A preliminary version of the paper appeared in the Proceedings of The 16th Conference on Integer Programming and Combinatorial Optimization (IPCO), 2013.
The research of D. Z. Chen was supported in part by NSF under Grant CCF-1217906.
Jian Li: Research supported in part by the National Basic Research Program of China Grants 2015CB358700, 2011CBA00300, 2011CBA00301, the National Natural Science Foundation of China Grants 61202009, 61033001, 61361136003.
This work was done when H. Liang was a student at Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing, China.
H. Wang’s research was supported in part by NSF under Grant CCF-1317143.
Appendix: A Constant Approximation for MatCenter
Appendix: A Constant Approximation for MatCenter
We say a space V with a distance function d satisfies the \((\lambda ,c)\) -relaxed triangle inequality (TI) for some \(\lambda \) and c, if \(d(a_0,a_c)\le \lambda \sum _{i=1}^{c}d(a_{i-1},a_i)\) for all \(a_0,a_1,\ldots ,a_c \in V\). (Thus a metric space satisfies the (1, c)-relaxed TI for all \(c\ge 1\).) By examining the algorithms in [3, 22] for the matroid median problem, we notice that they can actually achieve an approximation factor of \((\mu \lambda )\) for matroid median where \(\mu \) is some universal constant, if the underlying space satisfies the \((\lambda ,c_0)\)-relaxed TI for some algorithm-dependent \(c_0\).Footnote 3 (Roughly speaking, \(c_0\) is the maximum number of times that the triangle inequality is used for bounding the distance between a client and a facility.) Now, given an instance of MatCenter with metric space (V, d), we define a new distance function \(d'\) as \(d'(a,b)=(d(a,b))^p\) for all \(a,b\in V\), where \(p>2\) is a parameter whose value will be specified later. By the convexity of the function \(f(x)=x^{p}\) when \(p\ge 2\), for all \(c\ge 1\) and \(a_0,a_1,\ldots ,a_c \in V\), we have \((\sum _{i=1}^{c}d(a_{i-1},a_i)/c)^p \le \sum _{i=1}^{c}d(a_{i-1},a_i)^p/c\), and thus
Therefore \((V, d')\) satisfies the \((c^{p-1},c)\)-relaxed TI for all \(c \ge 1\). In particular, it satisfies the \((c_0^{p-1},c_0)\)-relaxed TI where \(c_0\) is the algorithm-dependent parameter mentioned before. We now solve the matroid median problem on the instance with the new distance function \(d'\). Let \(\mathsf {OPT}\) denote the optimal objective value of MatCenter on the original instance. Then it is clear that the optimal cost of matroid median on the new instance is at most \(|V|\cdot \mathsf {OPT}^{p}\). By our previous observation, the algorithms of [3, 22] give a solution of cost at most \(\mu c_0^{p-1}|V|\mathsf {OPT}^{p}\). Transforming the distance function back to d, the maximum service cost of any client is at most \((\mu c_0^{p-1}|V|\mathsf {OPT}^{p})^{1/p}=c_0^{1-1/p}(\mu |V|)^{1/p}\mathsf {OPT}\). By choosing \(p=\Omega (|V|)\), this can produce a \((c_0+\epsilon )\)-approximation for MatCenter for any fixed \(\epsilon >0\). Using the algorithm of [3] this roughly gives a 9-approximation.
Rights and permissions
About this article
Cite this article
Chen, D.Z., Li, J., Liang, H. et al. Matroid and Knapsack Center Problems. Algorithmica 75, 27–52 (2016). https://doi.org/10.1007/s00453-015-0010-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00453-015-0010-1