De-Hallucinator

Iterative Grounding for LLM-Based Code Completion

This repository contains the scripts and dataset for "De-Hallucinator: Iterative Grounding for LLM-Based Code Completion" paper.

The benchmark directory contains the scripts and configuration files needed to run the evaluations. The coder directory contains the implementation of De-Hallucinator. The scripts in server allow running open source LLMs on a remote server so that De-Hallucinator can use them through an API call.

How to use

Install De-Hallucinator:

pip install -r requirements-client.txt
pip install -e .

De-Hallucinator is currently in experimental phase, so the use is through completion specifications from json files. Samples of such files can be found in benchmark/benchmark_configs/. The files can be generated for any project via benchmark/make_config.py and benchmark/API_config.py scripts.

Steps to run De-Hallucinator from scratch:

Copy the project to benchmark/GitHubProjects/.
Run

python benchmark/API_config.py --project benchmark/GitHubProjects/<your_project> --tests <path_to_tests> --packageName <package_name>

Create the experiments directory:

mkdir benchmark/experiment

Run

python benchmark/run_project.py --model <model_name> --config benchmark/becnhmark_configs/<project_config>.json --mode comment --k 4 --c 20 --noTests

These steps will first generate the completion tasks for your project, and then run De-Hallucinator to generate the completions.

To rerun the experiments from our paper, run

bash benchmark/run_all.sh

to run the baselines and De-Hallucinator with default settings on the 11 projects.
To run the ablation study, run

bash benchmark/run_var.sh

To generate the results from these experiments, run

python benchmark/process_results.py --project <path_to_project_in_experiment> --modes <list_of_modes> --output <output_dir>
python benchmark/further_process_results.py --results <output_from_previous>

Name		Name	Last commit message	Last commit date
Latest commit History 324 Commits
benchmark		benchmark
coder		coder
server		server
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
dataset_report_CodeGen.txt		dataset_report_CodeGen.txt
dataset_report_StarCoderPlus.txt		dataset_report_StarCoderPlus.txt
dataset_report_UniXcoder.txt		dataset_report_UniXcoder.txt
requirements-client.txt		requirements-client.txt
requirements-server.txt		requirements-server.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

De-Hallucinator

How to use

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

AryazE/dehallucinator

Folders and files

Latest commit

History

Repository files navigation

De-Hallucinator

How to use

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages