2025 SW-Centric Universities Digital Contest: Text Discrimination Challenge

Official Website: https://dacon.io/competitions/official/236473/overview/description

Overview

With the recent advancement of Generative AI, particularly Large Language Models (LLMs), it has become increasingly difficult to distinguish between AI-generated and human-written text. To address societal issues like the spread of misinformation and public opinion manipulation, this project aims to develop an AI model that predicts the probability of a given text being generated by AI.

The goal is to develop reliable AI-generated content detection technology, contributing to the responsible use of AI and restoring trust in digital information.

Problem Definition

Objective: Develop an AI model to predict the probability (from 0 to 1) that a given paragraph of text was written by a generative AI.
Unique Labeling Scheme:
- Training Data: Labeled at the full-text level. If even a single paragraph in a document is AI-generated, the entire document is labeled as 'AI-written (1)'. Paragraph-level labels are not provided.
- Evaluation Data: Provided at the paragraph level. The model must submit a probability for each individual paragraph.
Core Challenge: The key challenge is to perform paragraph-level prediction using document-level weak labels.
Additional Rule: Using context from other paragraphs within the same document (grouped by title) is permitted and encouraged for inference.

Team

1	2	3	4
Tae-Min, Kim	Jun-Hyuk, Seo (BuAs)	Jae-Hyun, Jo	Geon-Woo, Yoo

🏆 Execution Results

🏆 Award: Grand Prize (IITP President's Award)

Final Performance

The model achieved the following scores on the competition's official leaderboard, securing the second-place position.

Metric	Public Score	Private Score (Final)
ROC AUC	0.9381	0.9323

Our Presentation

You can check our presentation at this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
clustering		clustering
config		config
image		image
src		src
.DS_Store		.DS_Store
README.md		README.md
[AI 부문](상상부기) 발표자료.pdf		[AI 부문](상상부기) 발표자료.pdf
ensemble.py		ensemble.py
generation.py		generation.py
inference.py		inference.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

2025 SW-Centric Universities Digital Contest: Text Discrimination Challenge

Overview

Problem Definition

Team

🏆 Execution Results

Final Performance

Our Presentation

About

Uh oh!

Releases

Packages

Languages

SeoBuAs/2025_SW_Univ_Text_Challenge

Folders and files

Latest commit

History

Repository files navigation

2025 SW-Centric Universities Digital Contest: Text Discrimination Challenge

Overview

Problem Definition

Team

🏆 Execution Results

Final Performance

Our Presentation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages