llm_moral_plasticity

Research Question:

Are the “moral values” of LLMs robust to morally irrelevant situational distractors? We investigate this question as part of our CSE 481M Natural Language Processing Capstone project. This repository serves as a platform to reproduce our results.

Useful Commands

Run evaluation with specified experiment, dataset, model, and question types:

CUDA_VISIBLE_DEVICES=0 python -m src.evaluate \
  --experiment-name "moraltest" \
  --dataset "moralchoice_high_ambiguity" \
  --model "google/flan-t5-small" \
  --question-types "ab" \
  --eval-nb-samples 5 \
  --eval-max-tokens 1

Run result collection on experiment and specified dataset:

python -m src.collect \
  --experiment-name "moraltest" \
  --dataset "moralchoice_high_ambiguity"

Name		Name	Last commit message	Last commit date
Latest commit History 112 Commits
api_keys		api_keys
cache		cache
data		data
fig		fig
offload		offload
src		src
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

llm_moral_plasticity

Research Question:

Useful Commands

About

Uh oh!

Releases

Packages

Languages

License

crasgaitis/llm_moral_plasticity

Folders and files

Latest commit

History

Repository files navigation

llm_moral_plasticity

Research Question:

Useful Commands

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages