5万字长文全面解读GUI Agent的前世今生 - 文章 - 开发者社区 - 火山引擎
[2407.09018v1] AUITestAgent: Automatic Requirements Oriented GUI Function Testing
[2311.08649] Autonomous Large Language Model Agents Enabling Intent-Driven Mobile GUI Testing
[****2410.12157] Leveraging Large Vision Language Model For Better Automatic Web GUI Testing****
[2401.10935] SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents
[2401.13919] WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models
[2307.12856] A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis
[2411.06559] Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
[2502.17419] From System 1 to System 2: A Survey of Reasoning Large Language Models