GitHub - andreeavodita/gen-unit-tests

This is my attempt at generating JUnit tests using LLMs based on given methods or prompts.

The folders at the top contain the generated tests using the t5-train.py code that fine tunes the t5-small model on jitx/Methods2Test_java_unit_test_code dataset. The dataset has methods as inputs and JUnit tests as outputs. I have attempted to use only the main method that needs to be tested, and the results are in java_test_results and java_test_results_bigger_dataset, but I have also tested the results when giving it more than just the method, so that it can have more context, and the results are in java_test_results_src_fm_fc_ms_ff.

The results are not compilable. There was an aim at slightly modifying them so that they may compile and I would be able to test the coverage and the correctness of the tests, but the generated tests are too far from a compilable version. The task needed too much work and the result wouldn't even be comparable with the initial generated test.

The biggest reason why the results are not great is the model used and its limitations. It is a small model because a bigger one would require more resources, not local ones, as I have used. The model has limitations regarding the input and output lengths, and thus the capacity to understand complex patterns.

The next step would be to use a bigger model with resources in cloud and compare the results.

BONUS: The rest of the files are other ways in which I approached the task. I used different models, like gpt2, but t5-small gave better results because it is a sequence-to-sequence model. I also used the java tests from HumanEval-XL that had a prompt as input (a description of the functionality of the method that should be tested), and the JUnit tests as output, but the results were not great, partly because there is a lot less data (<100 samples) than in the Methods2Test_java_unit_test_code (>60000, of which I have used 6000), but also because the task was different (generating test from method vs generating test from prompt).

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
java_test_results		java_test_results
java_test_results_bigger_dataset		java_test_results_bigger_dataset
java_test_results_compl		java_test_results_compl
java_test_results_src_fm_fc_ms_ff		java_test_results_src_fm_fc_ms_ff
English.jsonl		English.jsonl
English_text.jsonl		English_text.jsonl
README.md		README.md
better.py		better.py
create_json.py		create_json.py
data.json		data.json
data_cleaned.json		data_cleaned.json
detect_missing.py		detect_missing.py
flan-t5.py		flan-t5.py
gpt2-test.py		gpt2-test.py
gpt2-train.py		gpt2-train.py
llm-test.py		llm-test.py
llm-tutorial.py		llm-tutorial.py
rag.py		rag.py
remove_missing_features.py		remove_missing_features.py
removed		removed
small-data.json		small-data.json
t5-test.py		t5-test.py
t5-train.py		t5-train.py
try.py		try.py
txt		txt
txt1		txt1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

andreeavodita/gen-unit-tests

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages