[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/3635637.3663057acmconferencesArticle/Chapter ViewAbstractPublication PagesaamasConference Proceedingsconference-collections
research-article

New Algorithms for Distributed Fair k-Center Clustering: Almost Accurate as Sequential Algorithms

Published: 06 May 2024 Publication History

Abstract

Fair clustering problems have been paid lots of attention recently. In this paper, we study the k-Center problem under the group fairness and data summarization fairness constraints, denoted as Group Fair k-Center (GFkC) and Data Summarization Fair k-Center (DSFkC), respectively, in the massively parallel computational (MPC) distributed model. The previous best results for the above two problems in the MPC model are a 9-approximation with violation 7 (WWW 2022) and a (17+ε)-approximation without fairness violation (ICML 2020), respectively. In this paper, we obtain a (3+ε)-approximation with violation 1 for the GFkC problem in the MPC model, which is almost as accurate as the best known approximation ratio 3 with violation 1 for the sequential algorithm of the GFkC problem. Moreover, for the DSFkC problem in the MPC model, we obtain a (4+ε)-approximation without fairness violation, which is very close to the best known approximation ratio 3 for the sequential algorithm of the DSFkC problem. Empirical experiments show that our distributed algorithms perform better than existing state-of-the-art distributed methods for the above two problems.

References

[1]
Gagan Aggarwal, Rina Panigrahy, Tomás Feder, Dilys Thomas, Krishnaram Kenthapadi, Samir Khuller, and An Zhu. 2010. Achieving Anonymity via Clustering. ACM Transactions on Algorithms, Vol. 6, 3 (2010), 49:1--49:19.
[2]
Sara Ahmadian, Alessandro Epasto, Ravi Kumar, and Mohammad Mahdian. 2019. Clustering without Over-Representation. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 267--275.
[3]
Sara Ahmadian and Chaitanya Swamy. 2016. Approximation Algorithms for Clustering Problems with Lower Bounds and Outliers. In Proceedings of the 43rd International Colloquium on Automata, Languages, and Programming. 69:1--69:15.
[4]
Hyung-Chan An, Aditya Bhaskara, Chandra Chekuri, Shalmoli Gupta, Vivek Madan, and Ola Svensson. 2015. Centrality of Trees for Capacitated k-Center. Mathematical Programming, Vol. 154, 1--2 (2015), 29--53.
[5]
Mikhail Belkin. 2004. Problems of Learning on Manifolds. The University of Chicago (2004).
[6]
Suman Kalyan Bera, Deeparnab Chakrabarty, Nicolas Flores, and Maryam Negahbani. 2019. Fair Algorithms for Clustering. In Proceedings of the 33rd International Conference on Neural Information Processing Systems. 4955--4966.
[7]
Suman K Bera, Syamantak Das, Sainyam Galhotra, and Sagar Sudhir Kale. 2022. Fair k-Center Clustering in MapReduce and Streaming Settings. In Proceedings of the ACM Web Conference 2022. 1414--1422.
[8]
Ioana Oriana Bercea, Martin Groß, Samir Khuller, Aounon Kumar, Clemens Rösner, Daniel R. Schmidt, and Melanie Schmidt. 2019. On the Cost of Essentially Fair Clusterings. In Proceedings of the 22nd International Conference on Approximation Algorithms for Combinatorial Optimization Problems and 23rd International Conference on Randomization and Computation. 18:1--18:22.
[9]
Matteo Ceccarello, Andrea Pietracaprina, and Geppino Pucci. 2019. Solving k-Center Clustering (with Outliers) in MapReduce and Streaming, almost as Accurately as Sequentially. Proceedings of the VLDB Endowment, Vol. 12, 7 (2019), 766--778.
[10]
Deeparnab Chakrabarty, Prachi Goyal, and Ravishankar Krishnaswamy. 2016. The Non-Uniform k-Center Problem. In Proceedings of the 43rd International Colloquium on Automata, Languages, and Programming. 67:1--67:15.
[11]
Moses Charikar, Samir Khuller, David M. Mount, and Giri Narasimhan. 2001. Algorithms for Facility Location Problems with Outliers. In Proceedings of the 12th Annual ACM-SIAM Symposium on Discrete Algorithms. 642--651.
[12]
Shiri Chechik and David Peleg. 2015. The Fault-Tolerant Capacitated K-center Problem. Theoretical Computer Science, Vol. 566 (2015), 12--25.
[13]
Danny Z. Chen, Jian Li, Hongyu Liang, and Haitao Wang. 2016. Matroid and Knapsack Center Problems. Algorithmica, Vol. 75, 1 (2016), 27--52.
[14]
Xingyu Chen, Brandon Fain, Liang Lyu, and Kamesh Munagala. 2019. Proportionally Fair Clustering. In Proceedings of the 36th International Conference on Machine Learning. 1032--1041.
[15]
Flavio Chierichetti, Ravi Kumar, Silvio Lattanzi, and Sergei Vassilvitskii. 2017. Fair Clustering Through Fairlets. In Proceedings of the 31st International Conference on Neural Information Processing Systems. 5029--5037.
[16]
Ashish Chiplunkar, Sagar Kale, and Sivaramakrishnan Natarajan Ramamoorthy. 2020. How to Solve Fair k-center in Massive Data Models. In Proceedings of the 37th International Conference on Machine Learning. 1877--1886.
[17]
Marek Cygan, MohammadTaghi Hajiaghayi, and Samir Khuller. 2012. LP Rounding for k-Centers with Non-uniform Hard Capacities. In Proceedings of the 53rd Annual Symposium on Foundations of Computer Science. 273--282.
[18]
Hu Ding, Haikuo Yu, and Zixiu Wang. 2019. Greedy Strategy Works for k-Center Clustering with Outliers and Coreset Construction. In Proceedings of the 27th Annual European Symposium on Algorithms. 40:1--40:16.
[19]
Cristina G. Fernandes, Samuel P. de Paula, and Lehilton L. C. Pedrosa. 2018. Improved Approximation Algorithms for Capacitated Fault-Tolerant k-Center. Algorithmica, Vol. 80, 3 (2018), 1041--1072.
[20]
M. R. Garey and David S. Johnson. 1979. Computers and Intractability: A Guide to the Theory of NP-Completeness. WH Freeman.
[21]
Teofilo F. Gonzalez. 1985. Clustering to Minimize the Maximum Intercluster Distance. Theoretical Computer Science, Vol. 38 (1985), 293--306.
[22]
Elfarouk Harb and Ho Shan Lam. 2020. KFC: A Scalable Approximation Algorithm for k-center Fair Clustering. In Proceedings of the 34th International Conference on Neural Information Processing Systems. 14509--14519.
[23]
Dorit S. Hochbaum and David B. Shmoys. 1985. A Best Possible Heuristic for the k-Center Problem. Mathematics of Operations Research, Vol. 10, 2 (1985), 180--184.
[24]
Matthew Jones, Huy Lê Nguyên, and Thy Nguyen. 2020. Fair k-centers via Maximum Matching. In Proceedings of the 37th International Conference on Machine Learning. 4940--4949.
[25]
Matthew Kay, Cynthia Matuszek, and Sean A Munson. 2015. Unequal Representation and Gender Stereotypes in Image Search Results for Occupations. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. 3819--3828.
[26]
Samir Khuller and Yoram J. Sussmann. 2000. The Capacitated K-Center Problem. SIAM Journal on Discrete Mathematics, Vol. 13, 3 (2000), 403--418.
[27]
Matth"a us Kleindessner, Pranjal Awasthi, and Jamie Morgenstern. 2019. Fair k-Center Clustering for Data Summarization. In Proceeding of the 36th International Conference on Machine Learning. 3448--3457.
[28]
Bo Li, Lijun Li, Ankang Sun, Chenhao Wang, and Yingfan Wang. 2021. Approximate Group Fairness for Clustering. In Proceedings of the 38th International Conference on Machine Learning. 6381--6391.
[29]
Sepideh Mahabadi and Ali Vakilian. 2020. Individual Fairness for k-Clustering. In Proceedings of the 37th International Conference on Machine Learning. 6586--6596.
[30]
Evi Micha and Nisarg Shah. 2020. Proportionally Fair Clustering Revisited. In Proceedings of the 47th International Colloquium on Automata, Languages, and Programming. 85:1--85:16.
[31]
Maryam Negahbani and Deeparnab Chakrabarty. 2021. Better Algorithms for Individually Fair k-Clustering. In Proceedings of the 35th International Conference on Neural Information Processing Systems. 13340--13351.

Cited By

View all
  • (2024)Coresets for Deletion-Robust k-Center ClusteringProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679890(3877-3881)Online publication date: 21-Oct-2024

Index Terms

  1. New Algorithms for Distributed Fair k-Center Clustering: Almost Accurate as Sequential Algorithms

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems
    May 2024
    2898 pages
    ISBN:9798400704864

    Sponsors

    Publisher

    International Foundation for Autonomous Agents and Multiagent Systems

    Richland, SC

    Publication History

    Published: 06 May 2024

    Check for updates

    Author Tags

    1. clustering
    2. fairness
    3. machine learning

    Qualifiers

    • Research-article

    Funding Sources

    • Central South University Research Programme of Advanced Interdisciplinary Studies
    • Open Project of Xiangjiang Laboratory
    • National Natural Science Foundation of China

    Conference

    AAMAS '24
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,155 of 5,036 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)43
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 18 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Coresets for Deletion-Robust k-Center ClusteringProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679890(3877-3881)Online publication date: 21-Oct-2024

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media