Skip to content

Latest commit

 

History

History
58 lines (44 loc) · 2.14 KB

File metadata and controls

58 lines (44 loc) · 2.14 KB
layout page
title Selected Publications

For a complete list of my publications, please visit my Google Scholar profile.

2025

  • Group-Level Data Selection for Efficient Pretraining
    Zichun Yu, Fei Peng, Jie Lei, Arnold Overwijk, Wen-tau Yih, Chenyan Xiong
    NeurIPS 2025
    Paper{: .btn} Code{: .btn}

  • Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning
    Xiaochuan Li, Zichun Yu, Chenyan Xiong
    ICLR 2025
    Paper{: .btn} Code{: .btn}

2024

  • MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models
    Zichun Yu, Spandan Das, Chenyan Xiong
    NeurIPS 2024
    Paper{: .btn} Code{: .btn}

2023

  • An In-depth Look at Gemini's Language Abilities
    Syeda Nahida Akter*, Zichun Yu*, Aashiq Muhamed*, Tianyue Ou*, Alex Bäuerle, Ángel Alexander Cabrera, Krish Dholakia, Chenyan Xiong, Graham Neubig
    Preprint
    Paper{: .btn} Code{: .btn}

  • Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In
    Zichun Yu, Chenyan Xiong, Shi Yu, Zhiyuan Liu
    ACL 2023
    Paper{: .btn} Code{: .btn}

2022

  • Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models
    Zichun Yu, Tianyu Gao, Zhengyan Zhang, Yankai Lin, Zhiyuan Liu, Maosong Sun, Jie Zhou
    COLING 2022
    Paper{: .btn} Code{: .btn}