PDF Viewer

BUILDER'S SANDBOX

Build This Paper

Use an AI coding agent to implement this research.

OpenAI Codex
OpenAI CodexAI Agent

Lightweight coding agent in your terminal.

Claude Code
Claude CodeAI Agent

Agentic coding tool for terminal workflows.

AntiGravity IDE
AntiGravity IDEScaffolding

AI agent mindset installer and workflow scaffolder.

Cursor
CursorIDE

AI-first code editor built on VS Code.

VS Code
VS CodeIDE

Free, open-source editor by Microsoft.

Estimated $9K - $13K over 6-10 weeks.

See exactly what it costs to build this -- with 3 comparable funded startups.

7-day free trial. Cancel anytime.

Discover the researchers behind this paper and find similar experts.

7-day free trial. Cancel anytime.

References (47)

[1]
Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning
2026Jinyang Wu, Guocheng Zhai et al.
[2]
A Survey on Agentic Multimodal Large Language Models
2025Huanjin Yao, Ruifei Zhang et al.
[3]
UI-Venus Technical Report: Building High-performance UI Agents with RFT
2025Zhangxuan Gu, Zhengwen Zeng et al.
[4]
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities
2025Gheorghe Comanici, Eric Bieber et al.
[5]
Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
2025Yuquan Xie, Zaijing Li et al.
[6]
FingerTip 20K: A Benchmark for Proactive and Personalized Mobile LLM Agents
2025Qinglong Yang, Haoming Li et al.
[7]
AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning
2025Zhong Zhang, Ya-Ting Lu et al.
[8]
GUI-G1: Understanding R1-Zero-Like Training for Visual Grounding in GUI Agents
2025Yuqi Zhou, Sunhao Dai et al.
[9]
ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions
2025Bufang Yang, Lilin Xu et al.
[10]
GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
2025Run Luo, Lu Wang et al.
[11]
ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use
2025Kaixin Li, Ziyang Meng et al.
[12]
UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning
2025Zhengxi Lu, Yuxiang Chai et al.
[13]
A Survey on (M)LLM-Based GUI Agents
2025Fei Tang, Haolei Xu et al.
[14]
MP-GUI: Modality Perception with MLLMs for GUI Understanding
2025Ziwei Wang, Weizhi Chen et al.
[15]
Qwen2.5-VL Technical Report
2025Shuai Bai, Keqin Chen et al.
[16]
WorldGUI: An Interactive Benchmark for Desktop GUI Automation from Any Starting Point
2025Henry Hengyuan Zhao, Kai Yang et al.
[17]
AutoGUI: Scaling GUI Grounding with Automatic Functionality Annotations from LLMs
2025Hongxin Li, Jingfan Chen et al.
[18]
Proactive Conversational AI: A Comprehensive Survey of Advancements and Opportunities
2025Yang Deng, Lizi Liao et al.
[19]
Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks
2025Zhenhailong Wang, Haiyang Xu et al.
[20]
OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use
2025Xueyu Hu, Tao Xiong et al.

Showing 20 of 47 references

Founder's Pitch

"ProactiveMobile benchmark enhances proactive intelligence in mobile agents by providing a real-world complexity framework for evaluating proactive MLLMs."

Mobile AIScore: 4View PDF ↗

Commercial Viability Breakdown

0-10 scale

High Potential

2/4 signals

5

Quick Build

0/4 signals

0

Series A Potential

0/4 signals

0

Sources used for this analysis

arXiv Paper

Full-text PDF analysis of the research paper

GitHub Repository

Code availability, stars, and contributor activity

Citation Network

Semantic Scholar citations and co-citation patterns

Community Predictions

Crowd-sourced unicorn probability assessments

Analysis model: GPT-4o · Last scored: 2/25/2026

Explore the full citation network and related research.

7-day free trial. Cancel anytime.

Understand the commercial significance and market impact.

7-day free trial. Cancel anytime.

Get detailed profiles of the research team.

7-day free trial. Cancel anytime.