RedTrans

RedTrans is a large-scale translation model tailored for social media platforms, designed to handle informal and culturally embedded expressions like slang, emojis, and memes in both Chinese and English.

🔍 Overview

Traditional translation systems often struggle with the highly contextual and culturally nuanced content found on social media—think memes, pop culture references, internet slang, and emojis. RedTrans bridges this gap by combining large language models with fine-tuning on social-domain data and preference-aligned training to deliver more accurate and culturally aware translations.

✨ Key Features

🧠 Social-context aware translation: Designed to handle slang, internet memes, emoji semantics, and culturally rich expressions
📊 RedTrans-Bench benchmark: 2,800+ curated translation cases from real-world social media content, including posts, comments, and captions
🔁 Dual-LLM back-translation sampling: Enhances data diversity and improves generalization
🎯 RePO (Rewritten Preference Optimization): Human-in-the-loop preference correction for cleaner RLHF signal
✅ Production-ready: Already deployed in a real-world social app used by millions

📦 Project Structure

.
├── Data License              # License for dataset usage
├── README.md                 # Project documentation (this file)
├── RedTrans-Bench.json       # Benchmark dataset for SNS translation
└── utils.py                  # Utility functions (data processing, evaluation, etc.)

📚 Citation

@misc{guo2025redefiningmachinetranslationsocial,
      title={Redefining Machine Translation on Social Network Services with Large Language Models}, 
      author={Hongcheng Guo and Fei Zhao and Shaosheng Cao and Xinze Lyu and Ziyan Liu and Yue Wang and Boyang Wang and Zhoujun Li and Chonggang Lu and Zhe Xu and Yao Hu},
      year={2025},
      eprint={2504.07901},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2504.07901}, 
}

🤝 Contributing & Contact

Feel free to open an issue or submit a PR. If you’re interested in LLM/VLM for social media, we’d love to collaborate or hear from you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RedTrans

🔍 Overview

✨ Key Features

📦 Project Structure

📚 Citation

🤝 Contributing & Contact

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Data License		Data License
README.md		README.md
RedTrans-Bench.json		RedTrans-Bench.json
utils.py		utils.py

HC-Guo/RedTrans

Folders and files

Latest commit

History

Repository files navigation

RedTrans

🔍 Overview

✨ Key Features

📦 Project Structure

📚 Citation

🤝 Contributing & Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages