DecLimSup: Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher

You can run the code by referring to the script.sh file
Also our code supports Gaudi HPUs. You can enable it by adding the --use_hpu.

How to Run the Code

You can run the code using the following command:

python inference.py

Arguments

Below is a description of the key arguments:

`--benchmark` (str)

Description: Specifies the benchmark to evaluate the model on.
Supported Benchmarks:
- gsm8k, strategyqa, multiarith, math, arc_c, arc_e, svamp

`--N` (int)

Description: Determines how many tokens will receive knowledge from the teacher model.

`--multi_exec` (bool)

Description: Enables running multiple alpha values in a single execution.
- --alpha_start The starting value of alpha.
- --alpha_end The ending value of alpha.
- --alpha_step The step size for iterating over alpha values.

Introduction

DecLimSup is our work that empirically analyzes contrastive decoding settings in a limited supervision scenario of teacher LLM. We find that it is essential to adaptively overtrust or disregard the LLM prediction based on the confidence of the small-scale LLM. Our experiments on a wide range of models and datasets demonstrate that our method consistently improves over conventional decoding strategies.

Citation

If you use this code, please cite the following paper:

@inproceedings{ok-etal-2024-decoding,
    title = "Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher",
    author = "Ok, Hyunjong  and
      Ryu, Jegwang  and
      Lee, Jaeho",
    editor = "Al-Onaizan, Yaser  and
      Bansal, Mohit  and
      Chen, Yun-Nung",
    booktitle = "Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2024",
    address = "Miami, Florida, USA",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.emnlp-main.693",
    pages = "12460--12476",
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
alpha_predict_model		alpha_predict_model
prompt_list		prompt_list
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
generate.py		generate.py
inference.py		inference.py
requirements.txt		requirements.txt
script.sh		script.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DecLimSup: Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher

How to Run the Code

Arguments

`--benchmark` (str)

`--N` (int)

`--multi_exec` (bool)

`--alpha_start` The starting value of alpha.

`--alpha_end` The ending value of alpha.

`--alpha_step` The step size for iterating over alpha values.

Introduction

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DecLimSup: Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher

How to Run the Code

Arguments

--benchmark (str)

--N (int)

--multi_exec (bool)

--alpha_start The starting value of alpha.

--alpha_end The ending value of alpha.

--alpha_step The step size for iterating over alpha values.

Introduction

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

`--benchmark` (str)

`--N` (int)

`--multi_exec` (bool)

`--alpha_start` The starting value of alpha.

`--alpha_end` The ending value of alpha.

`--alpha_step` The step size for iterating over alpha values.

Packages