Build Loop | ScienceToStartup

DateSearchCodeProof

Papers

250

With code

182

Suggested Build

148

Suggested Watch

🔔

Preview from your Build/Watch decisions. Set up Scout for daily delivery.

SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing

Morning brief

High conviction build candidate

SEAR: Simple and Efficient Adaptation of Visual Geometric Transformers for RGB+Thermal 3D Reconstruction

Morning brief

High conviction build candidate

Hypothesis-Conditioned Query Rewriting for Decision-Useful Retrieval

48h review

Needs sharper wedge before committing

Saved thesis

Find deployable ai papers with public code, proof pass, and a wedge that can ship inside 6 weeks.

🔔Run morning brief

Novelty / saturation by cluster

Uses the current paper cohort to show whether a lane looks crowded or sparse, with named comparable papers from the same slice.

Robotics
Not All Features Are Created Equal: A Mechanistic Study of Vision-Language-Action Models · FASTER: Rethinking Real-Time Flow VLAs
13
Crowded
Medical AI
ARIADNE: A Perception-Reasoning Synergy Framework for Trustworthy Coronary Angiography Analysis · LuMamba: Latent Unified Mamba for Electrode Topology-Invariant and Efficient EEG Modeling
13
Crowded
LLM Training
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation · VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models
9
Crowded
Agents
OS-Themis: A Scalable Critic Framework for Generalist GUI Rewards · ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents
6
Balanced
3D Reconstruction
SEAR: Simple and Efficient Adaptation of Visual Geometric Transformers for RGB+Thermal 3D Reconstruction · MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction
5
Balanced
LLM Agents
MoRI: Learning Motivation-Grounded Reasoning for Scientific Ideation in Large Language Models · D-Mem: A Dual-Process Memory System for LLM Agents
5
Balanced
Vision-Language Models
Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders · Tinted Frames: Question Framing Blinds Vision-Language Models
4
Rarer lane
Generative Video
Measuring 3D Spatial Geometric Consistency in Dynamic Generated Videos · MeInTime: Bridging Age Gap in Identity-Preserving Face Restoration
4
Rarer lane
Reinforcement Learning
Context Bootstrapped Reinforcement Learning · Maximum-Entropy Exploration with Future State-Action Visitation Measures
4
Rarer lane
Computer Vision
VGGT-360: Geometry-Consistent Zero-Shot Panoramic Depth Estimation · PromptHub: Enhancing Multi-Prompt Visual In-Context Learning with Locality-Aware Fusion, Concentration and Alignment
4
Rarer lane
Embodied AI
NavTrust: Benchmarking Trustworthiness for Embodied Navigation · GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning
3
Rarer lane
Generative Image
Adaptive Auxiliary Prompt Blending for Target-Faithful Diffusion Generation · ADAPT: Attention Driven Adaptive Prompt Scheduling and InTerpolating Orthogonal Complements for Rare Concepts Generation
3
Rarer lane

SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing

Video Editing2026-03-19Build NowNo Code

Commercial100

Deployability—

Reproducibility0

Novelty92

View full paper →

No dossier data.