8000 GitHub - crasgaitis/llm_moral_plasticity: by "The Saints" @ UW | Andrew, Sanjana, Catherine | NLP Capstone CSE 481M
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

crasgaitis/llm_moral_plasticity

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

llm_moral_plasticity

Research Question:

Are the “moral values” of LLMs robust to morally irrelevant situational distractors? We investigate this question as part of our CSE 481M Natural Language Processing Capstone project. This repository serves as a platform to reproduce our results.

image

Useful Commands

Run evaluation with specified experiment, dataset, model, and question types:

CUDA_VISIBLE_DEVICES=0 python -m src.evaluate \
  --experiment-name "moraltest" \
  --dataset "moralchoice_high_ambiguity" \
  --model "google/flan-t5-small" \
  --question-types "ab" \
  --eval-nb-samples 5 \
  --eval-max-tokens 1

Run result collection on experiment and specified dataset:

python -m src.collect \
  --experiment-name "moraltest" \
  --dataset "moralchoice_high_ambiguity"

About

by "The Saints" @ UW | Andrew, Sanjana, Catherine | NLP Capstone CSE 481M

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.1%
  • Python 0.9%
0