CrispEdit

This repository hosts the code and data for the paper: CrispEdit.

Requirements:

Environment: requirements.txt (Please use Python 3.9 for this repository)
```
pip install -r requirements.txt
```
If you get a pyarrow error, try this
```
pip install --upgrade datasets pyarrow
```
Large Language Models to Edit:

You have three options to load LLMs for editing:
1. Download the LLMs you want to edit and put them in ./hugging_cache/
2. Specify the path to your existing LLMs in the configuration files, e.g., ./hparams/FT/llama-7b.yaml:
```
model_name: "your/path/to/LLMs"
```
3. Provide the model name in the configuration files and the program will automatically employ from_pretrained to load the model:
```
model_name: "meta-llama/Meta-Llama-3-8B-Instruct"
```
4. Copy the .env.example file for your own .env file
```
  cp .env.example .env
```
And fill it up.

Datasets: The data of ZsRE, COUNTERFACT, and WikiBigEdit are provided in ./data/
LLM-As-A-Judge Configuration: To use the LLM-As-A-Judge functionality, you must provide your OpenRouter API key via environment variables.
1. Create a .env file in the root directory.
2. Add your key as follows:
```
API_KEY=your_open_router_key_here
```

Training

CrispEdit

python run_crispedit.py --model llama3-8b --data_type wiki --cache_sample_num 100 --energy_threshold 0.8 --batch_size 32 --wandb_project CrispEdit

Note: Default datasets are wiki/zsre/counterfact which have 3000 data each. Try running --data_type zsre10k or --data_type zsre163k.

CrispEdit Sequential
```
python run_crispedit.py --model llama3-8b --data_type zsre --cache_sample_num 100 --energy_threshold 0.8 --batch_size 32 --wandb_project CrispEdit --sequential_edit --num_edits 100
```
Note: It is important to set --num_edits whenever --sequential_edit is enabled to define the edit batch size or sequence limit.

Note: We can set --recalculate_cache to recalculate the cache in the event of big weight changes. set --recalculate_weight_threshold (e.g., --recalculate_weight_threshold 0.1) to override the default 25% change.

Note: Set --disable_old_loss_check to avoid calculating old loss every iteration.

Note: Set --no_crisp to avoid gradient projection (which essntially meaning regular finetuning.)

MEMIT

python edit.py --model llama3-8b --data_type wiki --editing_method MEMIT --num_edits 32 --eval_every 512 --batch_edit True --wandb_project CrispEdit

UltraEdit

python edit.py --model llama3-8b --data_type wiki --editing_method UltraEdit --num_edits 32 --eval_every 512 --batch_edit True --wandb_project CrispEdit

AlphaEdit

python edit.py --model llama3-8b --data_type wiki --editing_method AlphaEdit --num_edits 100 --eval_every 500 --batch_edit True --wandb_project CrispEdit

WISE

python edit.py --model llama3-8b --data_type wiki --editing_method WISE --num_edits 1 --eval_every 512 --batch_edit False --wandb_project CrispEdit

MEND

python edit.py --model llama3-8b --data_type wiki --editing_method MEND --num_edits 1 --eval_every 512 --batch_edit False --wandb_project CrispEdit

LoRA (Sequential Style)

python edit.py --model llama3-8b --data_type wiki --editing_method LoRA --num_edits 100 --batch_size 32 --eval_every 100 --batch_edit True --wandb_project CrispEdit

**LoRA **

python run_crispedit.py --model llama3-8b --data_type wiki --batch_size 32 --wandb_project CrispEdit --no_crisp --perform_lora --lora_type adalora

Loc-BF-FT

python locft-bf.py --model llama3-8b --data_type wiki --batch_size 32 --wandb_project CrispEdit

Adam-NSCL

python alphaedit_ft.py --model llama3-8b --data_type wiki --cache_sample_num 10000 --energy_threshold 0.5 --batch_size 32 --wandb_project CrispEdit

Evaluate

Base Model (Base Capabilities)

python run_base_benchmarks.py --edited_model_dir models--meta-llama--Meta-Llama-3-8B-Instruct/snapshots/8afb486c1db24fe5011ec46dfbe5b5dccdb575c2/ --model_name llama3-8b --alg_name Base --data_type wiki --eval_num 200

Base Model (Base Capabilities) (Example: Resume)

python run_base_benchmarks.py --edited_model_dir models--meta-llama--Meta-Llama-3-8B-Instruct/snapshots/8afb486c1db24fe5011ec46dfbe5b5dccdb575c2/ --model_name llama3-8b --alg_name Base --data_type wiki --eval_num 200 --wandb_run_id your_wandb_run_id

Base Model (Edited Capabilities)

python run_edited_benchmarks.py --edited_model_dir models--meta-llama--Meta-Llama-3-8B-Instruct/snapshots/8afb486c1db24fe5011ec46dfbe5b5dccdb575c2/ --model_name llama3-8b --max_length 40 --context_type qa_inst --alg_name Base --data_type wiki --evaluation_criteria llm_judge --eval_num 3000

UltraEdit

python run_benchmarks.py --edited_model_dir llama3-8b_UltraEdit_wiki --model_name llama3-8b --max_length 40 --context_type qa_inst --alg_name UltraEdit --data_type wiki --eval_num 30 --evaluation_criteria exact_match

CrispEdit

python run_benchmarks.py --edited_model_dir llama3-8b_CrispEdit_wiki_0.5 --model_name llama3-8b --max_length 40 --context_type qa_inst --alg_name CrispEdit --data_type wiki --eval_num 30 --evaluation_criteria exact_match

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CrispEdit

Requirements:

Training

Evaluate

About

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
data		data
easyeditor		easyeditor
hparams		hparams
toy_experiment		toy_experiment
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
alphaedit_ft.py		alphaedit_ft.py
calculate_AB_layer.py		calculate_AB_layer.py
crispedit.py		crispedit.py
edit.py		edit.py
locft-bf.py		locft-bf.py
requirements.txt		requirements.txt
run_base_benchmarks.py		run_base_benchmarks.py
run_crispedit.py		run_crispedit.py
run_edited_benchmarks.py		run_edited_benchmarks.py
utils.py		utils.py

Folders and files

Latest commit

History

Repository files navigation

CrispEdit

Requirements:

Training

Evaluate

About

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages