Skip to content

HC-Guo/RedTrans

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

8 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

RedTrans

RedTrans is a large-scale translation model tailored for social media platforms, designed to handle informal and culturally embedded expressions like slang, emojis, and memes in both Chinese and English.


πŸ” Overview

Traditional translation systems often struggle with the highly contextual and culturally nuanced content found on social mediaβ€”think memes, pop culture references, internet slang, and emojis. RedTrans bridges this gap by combining large language models with fine-tuning on social-domain data and preference-aligned training to deliver more accurate and culturally aware translations.


✨ Key Features

  • 🧠 Social-context aware translation: Designed to handle slang, internet memes, emoji semantics, and culturally rich expressions
  • πŸ“Š RedTrans-Bench benchmark: 2,800+ curated translation cases from real-world social media content, including posts, comments, and captions
  • πŸ” Dual-LLM back-translation sampling: Enhances data diversity and improves generalization
  • 🎯 RePO (Rewritten Preference Optimization): Human-in-the-loop preference correction for cleaner RLHF signal
  • βœ… Production-ready: Already deployed in a real-world social app used by millions

πŸ“¦ Project Structure

.
β”œβ”€β”€ Data License              # License for dataset usage
β”œβ”€β”€ README.md                 # Project documentation (this file)
β”œβ”€β”€ RedTrans-Bench.json       # Benchmark dataset for SNS translation
└── utils.py                  # Utility functions (data processing, evaluation, etc.)

πŸ“š Citation

@misc{guo2025redefiningmachinetranslationsocial,
      title={Redefining Machine Translation on Social Network Services with Large Language Models}, 
      author={Hongcheng Guo and Fei Zhao and Shaosheng Cao and Xinze Lyu and Ziyan Liu and Yue Wang and Boyang Wang and Zhoujun Li and Chonggang Lu and Zhe Xu and Yao Hu},
      year={2025},
      eprint={2504.07901},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2504.07901}, 
}

🀝 Contributing & Contact

Feel free to open an issue or submit a PR. If you’re interested in LLM/VLM for social media, we’d love to collaborate or hear from you!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages