BiasLens: News Bias Detector

Research Question: To what extent do media sources differ in their framing of news, and how can machine learning be used to categorize statements by their id 77B1 eological alignment?

Team

Eddie Tian, Jiahang Luo, Peter Zhang

Dataset

Article-Bias-Prediction dataset curated by Baly et al. (2020) is used in this project. The dataset contains over 30,000 news articles labeled with bias labels: left, center, and right. The dataset is split into training, validation, and test sets.

@inproceedings{baly2020we,
  author      = {Baly, Ramy and Da San Martino, Giovanni and Glass, James and Nakov, Preslav},
  title       = {We Can Detect Your Bias: Predicting the Political Ideology of News Articles},
  booktitle   = {Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)},
  series      = {EMNLP~'20},
  NOmonth     = {November},
  year        = {2020}
  pages       = {4982--4991},
  NOpublisher = {Association for Computational Linguistics}
}

Models

Logistic Regression
Random Forest
LlaMA: See the README for more details

File Structure

llama: contains the LlaMA model and the evaluation of the model
    ├── README.md
    ├── llama_train.py: QLoRA finetuning script for the LlaMA model
    ├── llama_distill.py: Script to further finetune the LlaMA model on DeepSeek labels
    ├── Llama.ipynb
    ├── match_articles.py: Script to detect test data leakage
Random_forest.ipynb: contains the random forest model and the evaluation of the model
RQ_EDA_Preprocessing_logistic_stcking_Evaluation.ipynb: contains the EDA, preprocessing, logistic regression, and stacking models
Sentence_transformer_visualization.ipynb: contains the preliminary exploration and visualization with sentence transformer models including BGE-m3
requirements.txt: contains the required libraries to run the code

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BiasLens: News Bias Detector

Team

Dataset

Models

File Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
llama		llama
.DS_Store		.DS_Store
.gitignore		.gitignore
P3_MACS30100.pdf		P3_MACS30100.pdf
README.md		README.md
RQ_EDA_Preprocessing_logistic_stcking_Evaluation.ipynb		RQ_EDA_Preprocessing_logistic_stcking_Evaluation.ipynb
Random_forest.ipynb		Random_forest.ipynb
Sentence_transformer_visualization.ipynb		Sentence_transformer_visualization.ipynb
requirements.txt		requirements.txt

UChicago-ML-DL/30100-final

Folders and files

Latest commit

History

Repository files navigation

BiasLens: News Bias Detector

Team

Dataset

Models

File Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages