Enhanced Co-training for Large Language Models

This repository contains the implementation for our ICML 2022 paper Co-training Improves Prompt-based Learning for Large Language Models and subsequent advancements, including tuning methodologies based on T-Few.

The code is instrumental for:

Enhancing the zero-shot and few-shot performance of large language models
Distilling large models like GPT-3 and T0 into compact task-specific models.

We sucesfully built many parts of this repository on top of the outstanding T-Few repository.

If you find this code useful, please consider citing our paper:

@inproceedings{lang2022co,
  title={Co-training improves prompt-based learning for large language models},
  author={Lang, Hunter and Agrawal, Monica N and Kim, Yoon and Sontag, David},
  booktitle={International Conference on Machine Learning},
  pages={11985--12003},
  year={2022},
  organization={PMLR}
}

Setup, usage, model training, result reproduction, and method of application to your data set are outlined in detail in the original README content.

Note:

This is the new home for the project, under the new owner lopeve, for any references or author contacts please see the original paper and repository maintained by clinicalml.

Name		Name	Last commit message	Last commit date
Latest commit History 517 Commits
bin		bin
configs		configs
gpt-data		gpt-data
src		src
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Enhanced Co-training for Large Language Models

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

lopeve/colearnt-prompting

Folders and files

Latest commit

History

Repository files navigation

Enhanced Co-training for Large Language Models

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages