Visual Model Checking: Graph-Based Inference of Visual Routines for Image Retrieval

PDF Viewer

BUILDER'S SANDBOX

Build This Paper

Use an AI coding agent to implement this research.

OpenAI Codex
OpenAI CodexAI Agent

Lightweight coding agent in your terminal.

Claude Code
Claude CodeAI Agent

Agentic coding tool for terminal workflows.

AntiGravity IDE
AntiGravity IDEScaffolding

AI agent mindset installer and workflow scaffolder.

Cursor
CursorIDE

AI-first code editor built on VS Code.

VS Code
VS CodeIDE

Free, open-source editor by Microsoft.

Estimated $9K - $13K over 6-10 weeks.

See exactly what it costs to build this -- with 3 comparable funded startups.

7-day free trial. Cancel anytime.

Discover the researchers behind this paper and find similar experts.

7-day free trial. Cancel anytime.

References (23)

[1]
Phi-4 Technical Report
2024Marah Abdin, J. Aneja et al.
[2]
Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives
2024Sara Sarto, Marcella Cornia et al.
[3]
Can biased search results change people’s opinions about anything at all? a close replication of the Search Engine Manipulation Effect (SEME)
2024Robert Epstein, Ji Li
[4]
Pix2Code: Learning to Compose Neural Visual Concepts as Programs
2024Antonia Wüst, Wolfgang Stammer et al.
[5]
Balanced Similarity with Auxiliary Prompts: Towards Alleviating Text-to-Image Retrieval Bias for CLIP in Zero-shot Learning
2024Hanyao Wang, Yibing Zhan et al.
[6]
Demystifying CLIP Data
2023Hu Xu, Saining Xie et al.
[7]
Scaling Open-Vocabulary Object Detection
2023M. Minderer, A. Gritsenko et al.
[8]
Image as a Foreign Language: BEIT Pretraining for Vision and Vision-Language Tasks
2023Wen Wang, Hangbo Bao et al.
[9]
Improved Probabilistic Image-Text Representations
2023Sanghyuk Chun
[10]
Sigmoid Loss for Language Image Pre-Training
2023Xiaohua Zhai, Basil Mustafa et al.
[11]
ViperGPT: Visual Inference via Python Execution for Reasoning
2023D'idac Sur'is, Sachit Menon et al.
[12]
Visual Programming: Compositional visual reasoning without training
2022Tanmay Gupta, Aniruddha Kembhavi
[13]
From Show to Tell: A Survey on Deep Learning-Based Image Captioning
2021Matteo Stefanini, Marcella Cornia et al.
[14]
DreamCoder: bootstrapping inductive program synthesis with wake-sleep library learning
2021Kevin Ellis, Catherine Wong et al.
[15]
Learning Transferable Visual Models From Natural Language Supervision
2021Alec Radford, Jong Wook Kim et al.
[16]
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
2021Chao Jia, Yinfei Yang et al.
[17]
Probabilistic Embeddings for Cross-Modal Retrieval
2021Sanghyuk Chun, Seong Joon Oh et al.
[18]
Introduction to Model Checking
2018E. Clarke, T. Henzinger et al.
[19]
Compositional Reasoning
2018D. Giannakopoulou, Kedar S. Namjoshi et al.
[20]
Self-Supervised Learning of Visual Features through Embedding Images into Text Topic Spaces
2017L. G. I. Bigorda, Yash J. Patel et al.

Showing 20 of 23 references

Founder's Pitch

"Integrate formal verification with image retrieval for transparent and accountable query processing."

Image RetrievalScore: 3View PDF ↗

Commercial Viability Breakdown

0-10 scale

High Potential

0/4 signals

0

Quick Build

0/4 signals

0

Series A Potential

0/4 signals

0

Sources used for this analysis

arXiv Paper

Full-text PDF analysis of the research paper

GitHub Repository

Code availability, stars, and contributor activity

Citation Network

Semantic Scholar citations and co-citation patterns

Community Predictions

Crowd-sourced unicorn probability assessments

Analysis model: GPT-4o · Last scored: 2/19/2026

Explore the full citation network and related research.

7-day free trial. Cancel anytime.

Understand the commercial significance and market impact.

7-day free trial. Cancel anytime.

Get detailed profiles of the research team.

7-day free trial. Cancel anytime.

Related Papers

Loading…