GreaterPrompt: A Python Toolkit for Prompt Optimization

Installation | Quick-Start | Colab-Examples | Input-Format | Optimize-Configs | Features | GreaterPrompt-UI

GreaterPrompt is a python toolkit for prompt optimization which only levarges small models to achieve a better performances by large models. Our toolkit includes 3 different optimizer and supports 2 models family now.

🔧 Installation

To get started with GreaterPormpt, you can simply install with pip:

⚠️ Currently, we only support python 3.11

pip install greaterprompt

Notice: Its strongly recommended to create a new conda environment to install greaterprompt to avoid the overwrite your other python packages.

🚀 Quick Start

Step 1: Build Input Dataloader

We support 2 methods to build dataloader, either load from a jsonl file or custom input

Method 1: load from a jsonl file

from greaterprompt import GreaterDataloader

dataset1 = GreaterDataloader(data_path="./data/boolean_expressions.jsonl")

Method 2: custom input

from greaterprompt import GreaterDataloader

dataset2 = GreaterDataloader(custom_inputs=[
    {
        "question": "((-1 + 2 + 9 * 5) - (-2 + -4 + -4 * -7)) =", 
        "prompt": "Use logical reasoning and think step by step.", 
        "answer": "24"
    },
    {
        "question": "((-9 * -5 - 6 + -2) - (-8 - -6 * -3 * 1)) =",
        "prompt": "Use logical reasoning and think step by step.",
        "answer": "63"
     },
    {
        "question": "((3 * -3 * 6 + -5) - (-2 + -7 - 7 - -7)) =",
        "prompt": "Use logical reasoning and think step by step.",
        "answer": "-50"
    }
])

Step 2: Init the Model and Tokenizer (Only for GreaterPrompt)

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

MODEL_PATH = "google/gemma-2-9b-it"
model = AutoModelForCausalLM.from_pretrained(MODEL_PATH, torch_dtype=torch.bfloat16, device_map='cuda:0')
model.gradient_checkpointing_enable() #to save the cuda memory
tokenizer = AutoTokenizer.from_pretrained(MODEL_PATH)

Step 3: Set Optimize Configs

This step is totally Optional. For details about parameters, please refer to Optimize-Configs

from torch.nn import functional as F

# for GreaterOptimizer
greater_optimize_config = {
    "intersect_q": 5,
    "candidates_topk": 10,
    "loss_function": F.cross_entropy,
    "perplexity_loss": True,
    "perplexity_lambda": 0.2,
    "filter": True,
    "generate_config": {
        "do_sample": True,
        "temperature": 0.2,
        "max_new_tokens": 512,
    }
}

# for ApeOptimizer, ApoOptimizer, Pe2Optimizer
optimize_config = {
    "task_model": "openai_gpt35_turbo_instruct",
    "optim_model": "openai_gpt4_turbo",
}

# for TextGradOptimizer
textgrad_optimize_config = {
    "evaluation_engine": "meta-llama/Meta-Llama-3-8B-Instruct",
    "test_engine": "meta-llama/Meta-Llama-3-8B-Instruct",
    "device": "cuda:0"
}

Step 4: Init the Optimizers you wanna use

To use the APO, APE and PE2 optimizer, OPENAI_API_KEY is required. There are two ways to set the OPENAI_API_KEY:

Set the OPENAI_API_KEY in the environment variables

export OPENAI_API_KEY=<your_openai_api_key>

Set the OPENAI_API_KEY in the code

import os
os.environ["OPENAI_API_KEY"] = "<your_openai_api_key>"

from greaterprompt import (
    ApeOptimizer, ApoOptimizer, GreaterOptimizer, GreaterDataloader, Pe2Optimizer, TextGradOptimizer
)

ape_optimizer = ApeOptimizer(
    optimize_config=optimize_config # Optional
)

apo_optimizer = ApoOptimizer(
    optimize_config=optimize_config # Optional
)

greater_optimizer = GreaterOptimizer(
    model=model,
    tokenizer=tokenizer,
    optimize_config=greater_optimize_config # Optional
)

pe2_optimizer = Pe2Optimizer(
    optimize_config=optimize_config # Optional
)

textgrad_optimizer = TextGradOptimizer(
    textgrad_optimize_config # Optional
)

Step 5: Pass the Dataloader to Optimizer and Optimize

ape_result = ape_optimizer.optimize(dataset1, p_init="think step by step")
apo_result = apo_optimizer.optimize(dataset2, p_init="think step by step")
greater_result = greater_optimizer.optimize(
    dataset1,
    p_extractor="\nNext, only give the exact answer, no extract words or any punctuation:",
    rounds=80
)
pe2_result = pe2_optimizer.optimize(dataset2, p_init="think step by step")
textgrad_result = textgrad_optimizer.optimize(dataset1, p_init="think step by step")

# print results
print(f'ape_result: {ape_result}')
print(f'-' * 30)
print(f'apo_result: {apo_result}')
print(f'-' * 30)
print(f'greater_result: {greater_result}')
print(f'-' * 30)
print(f'pe2_result: {pe2_result}')
print(f'-' * 30)
print(f'textgrad_result: {textgrad_result}')

📖 Input Format

If you wanna use jsonl file as the input, please make sure each line contains "question", "prompt" and "answer" three mandatory keys

{"id": "0", "question": "not ( True ) and ( True ) is", "prompt": "Use logical reasoning and think step by step.", "answer": "False"}
{"id": "1", "question": "True and not not ( not False ) is", "prompt": "Use logical reasoning and think step by step.", "answer": "True"}
{"id": "2", "question": "not True or False or ( False ) is", "prompt": "Use logical reasoning and think step by step.", "answer": "False"}
{"id": "3", "question": "False or not ( True ) and False is", "prompt": "Use logical reasoning and think step by step.", "answer": "False"}
{"id": "4", "question": "True or not False and True and False is", "prompt": "Use logical reasoning and think step by step.", "answer": "True"}

🤖 Optimize Configs

ApeOptimizer, ApoOptimizer, Pe2Optimizer Parameters

task_model: str, the LLM that performs the task with a prompt. You can use OpenAI instruct models like openai_gpt35_turbo_instruct or you can also use models supported in vLLM. The code currently supports mistralai/Mistral-7B-Instruct-v0.2, mosaicml/mpt-7b-instruct, 01-ai/Yi-6B.
optim_model: str, the LLM that performs prompt engineering. Currently the code supports openai_gpt35, openai_gpt4, openai_gpt4_turbo, openai_gpt4o, openai_gpt4o_mini

GreaterPrompt Parameters

intersect_q: int, use how many question/prompt inpur pair to build a batch to get candidates.
candidates_topk: int, sample how many candidates for each p_i.
loss_function: Callable[torch.Tensor, torch.Tensor] -> torch.Tensor, the loss function used for the backward to get the gradients.
perplexity_loss: bool, whether to enable the perplexity_loss.
perplexity_lambda: float, if perplexity loss was enabled, its weight in the whole loss function.
filter: bool, whether to filter the p* to make sure all prompts are human readable.
generate_config: dict: configs used for transformer model's generation.

TextGradOptimizer Parameters

evaluation_engine: str, currently we only support Llama3 family models.
test_engine: str, currently we only support Llama3 family models.
device: str, which device you wanna use to load the model.

✨ Features

For GreaterOptimizer, we support custom loss function in order to optimize different tasks, you could either use customed loss function or other pytorch loss functions!

import torch
from torch.nn import functional as F

def dummy_loss_fn(y_pred: torch.Tensor, y: torch.Tensor) -> torch.Tensor:
    loss = F.cross_entropy(y_pred, y)
    dummy_loss = 2 * loss + 1

    return dummy_loss

# use custom loss function
optimize_config = {
    "loss_function": dummy_loss_fn
}

# use other pytorch loss function
optimize_config = {
    "loss_function": F.nll_loss
}

🎨 GreaterPrompt UI

With GreaterPrompt-UI, you can easily and quickly configure and experience the supported optimize methods through our visual interface, it makes everything much more efficient!

Before using the UI interface, you have to clone the repo

git clone https://github.com/psunlpgroup/GreaterPrompt.git

cd Web
python web.py

🔖 License

GreaterPrompt is licensed under the MIT License.

🌟 Citation

Please kindly cite our work if helps your research:

@misc{zheng2025greaterpromptunifiedcustomizablehighperforming,
      title={GREATERPROMPT: A Unified, Customizable, and High-Performing Open-Source Toolkit for Prompt Optimization}, 
      author={Wenliang Zheng and Sarkar Snigdha Sarathi Das and Yusen Zhang and Rui Zhang},
      year={2025},
      eprint={2504.03975},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2504.03975}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
Web		Web
data		data
dist		dist
images		images
src/greaterprompt		src/greaterprompt
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_pypi.md		README_pypi.md
example.py		example.py
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GreaterPrompt: A Python Toolkit for Prompt Optimization

GreaterPrompt: A Python Toolkit for Prompt Optimization

Installation | Quick-Start | Colab-Examples | Input-Format | Optimize-Configs | Features | GreaterPrompt-UI

🔧 Installation

🚀 Quick Start

Step 1: Build Input Dataloader

Method 1: load from a jsonl file

Method 2: custom input

Step 2: Init the Model and Tokenizer (Only for GreaterPrompt)

Step 3: Set Optimize Configs

Step 4: Init the Optimizers you wanna use

Step 5: Pass the Dataloader to Optimizer and Optimize

📖 Input Format

🤖 Optimize Configs

✨ Features

🎨 GreaterPrompt UI

🔖 License

🌟 Citation

About

Uh oh!

Releases

Packages

Languages

License

PAWSLabUniversityOfPittsburgh/GreaterPrompt-a2un

Folders and files

Latest commit

History

Repository files navigation

GreaterPrompt: A Python Toolkit for Prompt Optimization

GreaterPrompt: A Python Toolkit for Prompt Optimization

Installation | Quick-Start | Colab-Examples | Input-Format | Optimize-Configs | Features | GreaterPrompt-UI

🔧 Installation

🚀 Quick Start

Step 1: Build Input Dataloader

Method 1: load from a jsonl file

Method 2: custom input

Step 2: Init the Model and Tokenizer (Only for GreaterPrompt)

Step 3: Set Optimize Configs

Step 4: Init the Optimizers you wanna use

Step 5: Pass the Dataloader to Optimizer and Optimize

📖 Input Format

🤖 Optimize Configs

✨ Features

🎨 GreaterPrompt UI

🔖 License

🌟 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages