Is GPT Pulling the Strings?

An Exploration of Preferential Treatment of Products in RAG Systems

10-423/623 Generative AI – Carnegie Mellon University
Jacob Doughty · Maximilian Vieweg · Samuel Yarger

📜 Project Motivation

Problem: Platforms have monetary incentives to manipulate product review summaries Goal: Test whether we can deliberately inject product-specific bias into a Retrieval-Augmented Generation (RAG) pipeline and possibly detect it Dataset: UCSD 2023 Amazon Reviews Research Questions:

RQ1: To what extent can different stages of a RAG pipeline be altered to introduce bias towards specific products on Amazon?
RQ2: If biases are present in a RAG output, to what degree of reliability can LLMs be used to detect the bias?

Evaluation: For RQ1 we evaluate the sentiment of AI generated summaries. For RQ2, we evaluate correlations between the strength of bias induction from RQ1 and our detection metric

📂 Dataset

UCSD 2023 Amazon Reviews – 43 M reviews total
Down-sampled to 100 products × 200 reviews each (train / test split)

Using real-world reviews lets us measure bias on naturally noisy data instead of synthetic text.

⚙️ Methodology

Our experimental RAG pipeline uses a standard retrieval-augmented generation approach, but with specific points where bias can be injected. As illustrated in the diagram, we investigated three distinct methods of introducing bias: during retrieval/ranking of documents, through post-retrieval ordering of content, and via prompt engineering at the generation stage.

2 · Bias-Injection Techniques

Stage	Technique	Intuition (how it biases the pipeline)
Retrieval / Ranking	Sentiment-Aware Re-ranking – score documents as `α·cos(q,d) + (1-α)·s(d)`	Elevates chunks with positive sentiment so favorable evidence is fetched
Post-retrieval Ordering	High-to-Low Rating Ordering – pass reviews to the LLM sorted by star rating	Positive context appears first, nudging the LLM's perception
Generation	Prompt Injection – prepend an instruction to highlight the product's strengths	Directly biases the tone of the generated summary

🔬 Experiments & Key Findings

Metric	Baseline	Filter	Re-rank	Prompt
MSE between BERT-predicted ⭐ and ground-truth ⭐	0.24	0.80	2.19	1.76
Cohen’s κ w.r.t. Baseline star prediction	—	0.25	0.67	0.56
Average BERTScore	—	0.51	0.58	0.57

Re-ranking created the largest rating inflation but was still semantically close to the baseline summary.
Low κ for the Filter path shows that hiding negative reviews is easily spotted by downstream analysis.
Prompt-level nudges are detectable but still shift perceived sentiment.

Detection of Biased Pipelines

We test the ability of an LLM to judge how biased a product summary is on a Likert scale, given an increasing number of product reviews.

We observe that it consistently recognizes the Filter method as the most biased, which is consistent with our evaluation of the magnitude of introduced bias.

The confusion matrix of our experiments is seen below:

Using this threshold as a separation, we can now evaluate our detection method. The upper figure shows the confusion matrix associated with our detection metric. We observe a precision 0.61, a recall of 0.66, when we choose our threshold to be 0.85. These scores are obtained after the evaluation of 20 products, where we run the detection five times for each, for a total of 100 experiments, once for the filter method and once for the unbiased example. We chose to provide 30 reviews to provide to the LLM, given that we see no decrease in the detection metric after that.

📊 Conclusions

We note that while the quality of the detection is passable, further work needs to be performed to obtain levels of performance necessary to conclude active biasing in real-life scenarios.

🚀 Quick Start

# clone the public repo
git clone https://github.com/majvie/10423_project.git
cd 10423_project

# create environment
pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
graphics		graphics
results		results
.gitignore		.gitignore
10-423-RStudio.Rmd		10-423-RStudio.Rmd
4o_incontext_bias.py		4o_incontext_bias.py
README.md		README.md
bias_results.csv		bias_results.csv
bias_results_.csv		bias_results_.csv
biasing.py		biasing.py
csvCreatorRevised.R		csvCreatorRevised.R
csvPython.py		csvPython.py
detection.py		detection.py
detection_evaluation.py		detection_evaluation.py
electronics_reviews_200_each.csv		electronics_reviews_200_each.csv
electronics_reviews_with_meta.csv		electronics_reviews_with_meta.csv
evaluation.py		evaluation.py
main.ipynb		main.ipynb
plot_detection.ipynb		plot_detection.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Is GPT Pulling the Strings?

📜 Project Motivation

📂 Dataset

⚙️ Methodology

2 · Bias-Injection Techniques

🔬 Experiments & Key Findings

Detection of Biased Pipelines

📊 Conclusions

🚀 Quick Start

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

majvie/10423_project

Folders and files

Latest commit

History

Repository files navigation

Is GPT Pulling the Strings?

📜 Project Motivation

📂 Dataset

⚙️ Methodology

2 · Bias-Injection Techniques

🔬 Experiments & Key Findings

Detection of Biased Pipelines

📊 Conclusions

🚀 Quick Start

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages