Membership Inference Attacks as Privacy Tools: Reliability, Disparity and Ensemble

This is a repository for the paper "Membership Inference Attacks as Privacy Tools: Reliability, Disparity and Ensemble", accepted by ACM CCS 2025. This is a cleaned-up version of our MIAE framework repository to contain only essential scripts for reproducing results in this paper. Main scripts are in the experiment/mia_comp directory. Our paper is available at paper.pdf.

Note that throughout this repo, we refer coverage and stability (2 definition defined in the paper) as union and intersection respectively. We also refer instances and seeds, since each instance is prepared with a different seed.

Abstract

Membership inference attacks (MIAs) pose a significant threat to the privacy of machine learning models and are widely used as tools for privacy assessment, auditing, and machine unlearning. While prior MIA research has primarily focused on performance metrics such as AUC, accuracy, and TPR@low FPR—either by developing new methods to enhance these metrics or using them to evaluate privacy solutions—we found that it overlooks the disparities among different attacks. These disparities, both between distinct attack methods and between multiple instantiations of the same method, have crucial implications for the reliability and completeness of MIAs as privacy evaluation tools. In this paper, we systematically investigate these disparities through a novel framework based on coverage and stability analysis. Extensive experiments reveal significant disparities in MIAs, their potential causes, and their broader implications for privacy evaluation. To address these challenges, we propose an ensemble framework with three distinct strategies to harness the strengths of state-of-the-art MIAs while accounting for their disparities. This framework not only enables the construction of more powerful attacks but also provides a more robust and comprehensive methodology for privacy evaluation.

⚠️ NOTE: To be able to set up the directory correctly, please replace the DATA_DIR in all the scripts with the path to the directory where you want to store the attack predictions and results. We also recommend to run all scripts (especially those bash script) in the miae/experiment/mia_comp directory.

⚠️ NOTE: Most bash scripts has different config set by commenting/uncommenting the lines. For example, in experiment_scripts/obtain_venn.sh, you can set the config of the venn diagram by commenting and uncommenting the lines. The same applies to other bash scripts.

Set up the environment

conda env create -f miae_env.yml
conda activate miae

Quick fix for relative imports

go to the root directory of the repo and run:

pip install -e miae

Preparing Predictions of Multi-instances MIAs

`obtain_pred.py`

Initialize the specified target model and the target dataset.
Split the target dataset into target dataset and auxiliary dataset.
Train the target model on the target dataset.
Prepare the specified MIAs with the (black box access) target model and the auxiliary dataset.
Save the prediction on the target dataset.

Workflow diagram:

This file is being called by experiment_script/prepare_target.sh to prepare and save the target model and datasets. And it's also called by experiment_scripts/obtain_pred.sh to run the experiments.

Usage:

Prepare the target datasets and target models
```
bash experiment_scripts/prepare_target.sh
```
Train attacks on the target models and target datasets, then save predictions
```
bash experiment_scripts/obtain_pred.sh [seed]
```
where the [seed] is the seed of that instance.

to launch multiple instances at one time: bash bash run_multi_seed.sh {0..5} 0..5 is the range of the seeds you want to run.

Comparing MIAs

`obtain_graph.py`

The obtain_graph.py script is designed to load data, generate various plots, and evaluate metrics. The code is divided into three primary categories: Data Loading, Plot Diagram, and Evaluation.

Data Loading
- load_and_create_predictions: Loads data and create prediction object used in later evaluations and plotting.
- load_diff_distribution: Loads data and create prediction object for same attack different distribution case.
Plot Diagram
- plot_venn: Plots a Venn diagram for comparisons between attackss.
```
bash experiment_scripts/obtain_venn.sh 
```
- plot_auc: Plots a AUC diagram (Area Under the Curve) for different models or attacks.
```
bash experiment_scripts/obtain_auc.sh
```
- multi_seed_convergence: Visualizes the convergence across multiple seeds for the model or experiment.
```
bash experiment_scripts/obtain_multi_seed_conv.sh
```

`obtain_jaccard.py`

The `obtain_jaccard.py` is designed to save the Jaccard similarity between different MIAs and plot a heatmap to visualize the Jaccard similarity matrix. Before running this shell script, make sure you have already have the 
pair-wise jaccard similarity, which can be calculated via running obtain_venn.sh 
```bash
bash experiment_scripts/obtain_jaccard.sh
```

`process_CINIC10.ipynb`

The `process_CINIC10.ipynb` is designed to process the CINIC10 dataset 30,000 ImageNet samples and 30,000 CIFAR10 samples. Run it to make sure cinic10 is available in the `DATA_DIR`.

`/ensemble` Directory

This directory contains the code for the ensemble strategies proposed in the paper: Coverage Ensemble and Stability ensemble.

`max_ensemble_low_fpr.ipynb`

This notebook is designed to performs Coverage Ensemble and Stability Ensemble. It starts with thresholding the predictions of the base instances at the same low FPR, then ensemble the predictions follows our paper's definition of 2 step ensemble approach.

`ensemble_roc.py`

Ensemble roc samples n thresholds for n FPRs for each base instance. Then each attack goes through the steps of ensemble in `max_ensemble_low_fpr.ipynb` for n times with different thresholds to get n samples for each ensemble TPR@FPR. It also calculates the AUC, ACC and TPR@Low FPR for each ensemble.

`ensemble_performance.ipynb`

This notebook is designed to compare the performance of the ensemble strategies proposed in the paper. It organizes the performance result (TPR@low FPR, auc, acc) with respect to the number of instances used in the ensemble.

`same_attack_different_signal/same_attack_different_signal.ipynb`

This notebook is designed to compare the performance of the same attack on different signals. Corresponding to the paper's Attack Signals of A-covered Samples.

`disparity_empirical_analysis.ipynb`

This notebook is designed to analyze the disparity of MIAs in the empirical study. Corresponding to the paper's Output Distribution of A-Unique Samples.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
experiment		experiment
miae		miae
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
miae_env.yml		miae_env.yml
obtain_pred_fpr_workflow.png		obtain_pred_fpr_workflow.png
paper.pdf		paper.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Membership Inference Attacks as Privacy Tools: Reliability, Disparity and Ensemble

Table of Contents

Abstract

Set up the environment

Quick fix for relative imports

Preparing Predictions of Multi-instances MIAs

`obtain_pred.py`

Comparing MIAs

`obtain_graph.py`

`obtain_jaccard.py`

`process_CINIC10.ipynb`

`/ensemble` Directory

`max_ensemble_low_fpr.ipynb`

`ensemble_roc.py`

`ensemble_performance.ipynb`

`same_attack_different_signal/same_attack_different_signal.ipynb`

`disparity_empirical_analysis.ipynb`

About

Uh oh!

Releases

Uh oh!

Languages

License

RPI-DSPlab/mia-disparity

Folders and files

Latest commit

History

Repository files navigation

Membership Inference Attacks as Privacy Tools: Reliability, Disparity and Ensemble

Table of Contents

Abstract

Set up the environment

Quick fix for relative imports

Preparing Predictions of Multi-instances MIAs

obtain_pred.py

Comparing MIAs

obtain_graph.py

obtain_jaccard.py

process_CINIC10.ipynb

/ensemble Directory

max_ensemble_low_fpr.ipynb

ensemble_roc.py

ensemble_performance.ipynb

same_attack_different_signal/same_attack_different_signal.ipynb

disparity_empirical_analysis.ipynb

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Languages

`obtain_pred.py`

`obtain_graph.py`

`obtain_jaccard.py`

`process_CINIC10.ipynb`

`/ensemble` Directory

`max_ensemble_low_fpr.ipynb`

`ensemble_roc.py`

`ensemble_performance.ipynb`

`same_attack_different_signal/same_attack_different_signal.ipynb`

`disparity_empirical_analysis.ipynb`