PDF Viewer

BUILDER'S SANDBOX

Build This Paper

Use an AI coding agent to implement this research.

OpenAI Codex
OpenAI CodexAI Agent

Lightweight coding agent in your terminal.

Claude Code
Claude CodeAI Agent

Agentic coding tool for terminal workflows.

AntiGravity IDE
AntiGravity IDEScaffolding

AI agent mindset installer and workflow scaffolder.

Cursor
CursorIDE

AI-first code editor built on VS Code.

VS Code
VS CodeIDE

Free, open-source editor by Microsoft.

Estimated $9K - $13K over 6-10 weeks.

See exactly what it costs to build this -- with 3 comparable funded startups.

7-day free trial. Cancel anytime.

Discover the researchers behind this paper and find similar experts.

7-day free trial. Cancel anytime.

References (31)

[1]
Planning with Reasoning using Vision Language World Model
2025Delong Chen, Théo Moutakanni et al.
[2]
Agentic AI: Autonomous Intelligence for Complex Goals—A Comprehensive Survey
2025D. Acharya, Karthigeyan Kuppan et al.
[3]
Enhancing Financial Question Answering with a Multi-Agent Reflection Framework
2024Sorouralsadat Fatemi, Yuheng Hu
[4]
Will LLMs Replace the Encoder-Only Models in Temporal Relation Classification?
2024G. Roccabruna, Massimo Rizzoli et al.
[5]
LLaVA-OneVision: Easy Visual Task Transfer
2024Bo Li, Yuanhan Zhang et al.
[6]
MuEP: A Multimodal Benchmark for Embodied Planning with Foundation Models
2024Kanxue Li, Baosheng Yu et al.
[7]
CaT-Bench: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans
2024Yash Kumar Lal, Vanya Cohen et al.
[8]
A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist
2024Wentao Zhang, Lingxuan Zhao et al.
[9]
Graph-enhanced Large Language Models in Asynchronous Plan Reasoning
2024Fangru Lin, Emanuele La Malfa et al.
[10]
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
2024Jian Xie, Kai Zhang et al.
[11]
Position: LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
2024Subbarao Kambhampati, Karthik Valmeekam et al.
[12]
Software Engineering Using Autonomous Agents: Are We There Yet?
2023Samdyuti Suri, Sankar Narayan Das et al.
[13]
On the Planning Abilities of Large Language Models - A Critical Investigation
2023Karthik Valmeekam, Matthew Marquez et al.
[14]
Multimodal Procedural Planning via Dual Text-Image Prompting
2023Yujie Lu, Pan Lu et al.
[15]
Understanding Multimodal Procedural Knowledge by Sequencing Multimodal Instructional Manuals
2021Te-Lin Wu, Alexander Spangher et al.
[16]
LoRA: Low-Rank Adaptation of Large Language Models
2021Edward J. Hu, Yelong Shen et al.
[17]
Temporal Reasoning in Natural Language Inference
2020Siddharth Vashishtha, Adam Poliak et al.
[18]
Multi-modal Cooking Workflow Construction for Food Recipes
2020Liangming Pan, Jingjing Chen et al.
[19]
Language Models are Few-Shot Learners
2020Tom B. Brown, Benjamin Mann et al.
[20]
English Recipe Flow Graph Corpus
2020Yoko Yamakata, Shinsuke Mori et al.

Showing 20 of 31 references

Founder's Pitch

"Develop MATEO as a benchmark tool to enhance temporal reasoning in large vision language models using multimodal data."

Benchmark DevelopmentScore: 5View PDF ↗

Commercial Viability Breakdown

0-10 scale

High Potential

1/4 signals

2.5

Quick Build

3/4 signals

7.5

Series A Potential

0/4 signals

0

Sources used for this analysis

arXiv Paper

Full-text PDF analysis of the research paper

GitHub Repository

Code availability, stars, and contributor activity

Citation Network

Semantic Scholar citations and co-citation patterns

Community Predictions

Crowd-sourced unicorn probability assessments

Analysis model: GPT-4o · Last scored: 2/16/2026

Explore the full citation network and related research.

7-day free trial. Cancel anytime.

Understand the commercial significance and market impact.

7-day free trial. Cancel anytime.

Get detailed profiles of the research team.

7-day free trial. Cancel anytime.