Skip to content

[ACL2025] Official repository for "Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social Media"

Notifications You must be signed in to change notification settings

TrustAIRLab/AIGT_on_Social_Media

Repository files navigation

Are We in the AI-Generated Text World Already? 🤖📱

arXiv Dataset License

[ACL2025] Official repository for "Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social Media"

Method Overview

🚀 Quick Start

Installation

git clone https://github.com/Vincent-HKUSTGZ/AIGT_on_Social_Media.git
cd AIGT_on_Social_Media

# Create environment
conda create -n aigt python=3.10 -y
conda activate aigt

# Install dependencies
pip install -r requirements.txt

Training OSM-Det

# Basic training
python train_osm_det.py --model_path allenai/longformer-base-4096

# Custom parameters
python train_osm_det.py \
  --model_path allenai/longformer-base-4096 \
  --num_epochs 10 \
  --batch_size 5 \
  --output_dir ./my_osm_det_model

Evaluation OSM-Det on AIGTBench

# Evaluate by platform (Medium, Quora, Reddit)
python evaluate_aigtbench.py --model_path ./osm-det-model/final_model

# Custom evaluation
python evaluate_aigtbench.py \
  --model_path ./my_model \
  --batch_size 16 \
  --save_results ./results

📊 Datasets

AIGTBench

  • 🤗 Hugging Face: AIGTBench
  • Content: ~28.77M AIGT + 13.55M human-written samples
  • Platforms: Medium, Quora, Reddit

SM-D Dataset

🏆 OSM-Det Checkpoint

📋 Requirements

  • Python 3.10+
  • CUDA 11.8+ (for GPU)
  • 41GB+ GPU memory (for training)
  • 21GB+ GPU memory (for inference)

📜 Citation

@inproceedings{SZSZLBZH25,
    title = {{Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social Media}},
    author = {Zhen Sun and Zongmin Zhang and Xinyue Shen and Ziyi Zhang and Yule Liu and Michael Backes and Yang Zhang and Xinlei He},
booktitle = {{Annual Meeting of the Association for Computational Linguistics (ACL)}},
    pages = {},
    publisher ={ACL},
    year = {2025}
}

⚖️ License

Apache 2.0 License

About

[ACL2025] Official repository for "Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social Media"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages