[KDD'25] UQABench

Title: UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering [KDD 2025 Accepted (Oral) Paper]

The paper link: UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering.

The source data is available in HuggingFace and Kaggle.

Overview

Description

The UQABench is a benchmark dataset for evaluating user embeddings in prompting LLMs for personalized question answering. The standardized evaluation process includes pre-training, fine-tuning, and evaluating stages. We provide the requirements and quick-start scripts for each stage.

The source data are user interactions collected and processed from Taobao. Following previous work, we randomly split the data into 9:1 as the training and test sets. The dataset statistics are summarized as follows:

Data Split	Total	#Training	#Test
Interaction	31,317,087	28,094,799	3,222,288

Specifically, the training set serves in the pre-training and fine-tuning (aligning) stages. Then, we design task-specific question prompts based on the test set. We refine the questions, filter out low-quality questions, and eventually get 7,192 personalized Q&A for the evaluating stage.

Download Data & LLM

Download data from HuggingFace or Kaggle.
Download Qwen/Qwen2.5-3B-Instruct from Huggingface.

Requirements

pytorch 2.4
fbgemm_gpu
transformers
causal_conv1d==1.4.0
mamba_ssm==2.2.3

Pretrain

bash scripts/pretrain_trm_plus.sh

Alignment

bash scripts/align_trm_plus.sh

Generation

bash scripts/generate_trm_plus.sh

Evaluation

python calc_metrics_acc.py generated/trm_plus_align_frozen.jsonl

Citation

Please cite our paper if you use our dataset.

@inproceedings{liu2025uqabench,
  title={UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering},
  author={Liu, Langming and Liu, Shilei and Yuan, Yujin and Zhang, Yizhen and Yan, Bencheng and Zeng, Zhiyuan and Wang, Zihao and Liu, Jiaqi and Wang, Di and Su, Wenbo and others},
  booktitle={Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 2},
  pages={5652--5661},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
pic		pic
scripts		scripts
uqa_bench		uqa_bench
README.md		README.md
calc_metrics_acc.py		calc_metrics_acc.py
run_align_fsdp.py		run_align_fsdp.py
run_generation.py		run_generation.py
run_pretrain_ddp.py		run_pretrain_ddp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

[KDD'25] UQABench

Overview

Description

Download Data & LLM

Requirements

Pretrain

Alignment

Generation

Evaluation

Citation

About

Uh oh!

Releases

Packages

Languages

OpenStellarTeam/UQABench

Folders and files

Latest commit

History

Repository files navigation

[KDD'25] UQABench

Overview

Description

Download Data & LLM

Requirements

Pretrain

Alignment

Generation

Evaluation

Citation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages