PDF Viewer

BUILDER'S SANDBOX

Build This Paper

Use an AI coding agent to implement this research.

OpenAI Codex
OpenAI CodexAI Agent

Lightweight coding agent in your terminal.

Claude Code
Claude CodeAI Agent

Agentic coding tool for terminal workflows.

AntiGravity IDE
AntiGravity IDEScaffolding

AI agent mindset installer and workflow scaffolder.

Cursor
CursorIDE

AI-first code editor built on VS Code.

VS Code
VS CodeIDE

Free, open-source editor by Microsoft.

Estimated $10K - $14K over 6-10 weeks.

See exactly what it costs to build this -- with 3 comparable funded startups.

7-day free trial. Cancel anytime.

Discover the researchers behind this paper and find similar experts.

7-day free trial. Cancel anytime.

References (24)

[1]
Learning to Route LLMs from Bandit Feedback: One Policy, Many Trade-offs
2025Wang Wei, Tiankai Yang et al.
[2]
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
2025Haozhen Zhang, Tao Feng et al.
[3]
Performance Aware LLM Load Balancer for Mixed Workloads
2025Kunal Jain, Anjaly Parayil et al.
[4]
RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs
2025Zhongzhan Huang, Guoming Ling et al.
[5]
OmniRouter: Budget and Performance Controllable Multi-LLM Routing
2025Kai Mei, Wujiang Xu et al.
[6]
SAGESERVE: Optimizing LLM Serving on Cloud Data Centers with Forecast Aware Auto-Scaling
2025Shashwat Jaiswal, Kunal Jain et al.
[7]
CARROT: A Cost Aware Rate Optimal Router
2025Seamus Somerstep, Felipe Maia Polo et al.
[8]
RouteLLM: Learning to Route LLMs with Preference Data
2024Isaac Ong, Amjad Almahairi et al.
[9]
Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing
2024Dujian Ding, Ankur Mallick et al.
[10]
Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling
2024Sohaib Ahmad, Hui Guan et al.
[11]
RouterBench: A Benchmark for Multi-LLM Routing System
2024Qitian Jason Hu, Jacob Bieker et al.
[12]
AutoMix: Automatically Mixing Language Models
2023Aman Madaan, Pranjal Aggarwal et al.
[13]
Efficient Memory Management for Large Language Model Serving with PagedAttention
2023Woosuk Kwon, Zhuohan Li et al.
[14]
LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion
2023Dongfu Jiang, Xiang Ren et al.
[15]
Holistic Evaluation of Language Models
2023Percy Liang, Rishi Bommasani et al.
[16]
FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance
2023Lingjiao Chen, M. Zaharia et al.
[17]
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing
2021Pengcheng He, Jianfeng Gao et al.
[18]
INFaaS: Automated Model-less Inference Serving
2021Francisco Romero, Qian Li et al.
[19]
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
2020Adam Stooke, Joshua Achiam et al.
[20]
Reward Constrained Policy Optimization
2018Chen Tessler, D. Mankowitz et al.

Showing 20 of 24 references

Founder's Pitch

"PROTEUS optimizes LLM routing for SLA targets, achieving high accuracy and cost savings."

LLM OptimizationScore: 7View PDF ↗

Commercial Viability Breakdown

0-10 scale

High Potential

2/4 signals

5

Quick Build

4/4 signals

10

Series A Potential

1/4 signals

2.5

Sources used for this analysis

arXiv Paper

Full-text PDF analysis of the research paper

GitHub Repository

Code availability, stars, and contributor activity

Citation Network

Semantic Scholar citations and co-citation patterns

Community Predictions

Crowd-sourced unicorn probability assessments

Analysis model: GPT-4o · Last scored: 1/27/2026

Explore the full citation network and related research.

7-day free trial. Cancel anytime.

Understand the commercial significance and market impact.

7-day free trial. Cancel anytime.

Get detailed profiles of the research team.

7-day free trial. Cancel anytime.