-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
Thanks for setting up this repository! I would like to add our papers, "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games" and "Orak: A Foundational Benchmark for Training and Evaluating LLM Agents on Diverse Video Games" to this list.
- FlashAdventure
🎓 Venue: EMNLP 2025 Main Conference
🔗 Website: https://ahnjaewoo.github.io/flashadventure/
📄 Paper: https://arxiv.org/abs/2509.01052
🌎 Code: https://github.com/ahnjaewoo/FlashAdventure - Orak
🔗 Website: https://krafton-ai.github.io/orak-leaderboard/
📄 Paper: https://arxiv.org/abs/2506.03610
🌎 Code: https://github.com/krafton-ai/ORAK
Thanks!
Metadata
Metadata
Assignees
Labels
No labels