PDF Viewer

BUILDER'S SANDBOX

Build This Paper

Use an AI coding agent to implement this research.

OpenAI Codex
OpenAI CodexAI Agent

Lightweight coding agent in your terminal.

Claude Code
Claude CodeAI Agent

Agentic coding tool for terminal workflows.

AntiGravity IDE
AntiGravity IDEScaffolding

AI agent mindset installer and workflow scaffolder.

Cursor
CursorIDE

AI-first code editor built on VS Code.

VS Code
VS CodeIDE

Free, open-source editor by Microsoft.

Estimated $9K - $13K over 6-10 weeks.

See exactly what it costs to build this -- with 3 comparable funded startups.

7-day free trial. Cancel anytime.

Discover the researchers behind this paper and find similar experts.

7-day free trial. Cancel anytime.

References (43)

[1]
Training Feature Attribution for Vision Models
2025Aziz Bacha, Thomas George
[2]
Persona Vectors: Monitoring and Controlling Character Traits in Language Models
2025Runjin Chen, Andy Arditi et al.
[3]
A Survey on Interpretability in Visual Recognition
2025Qiyang Wan, Chengzhi Gao et al.
[4]
Model Organisms for Emergent Misalignment
2025Edward Turner, Anna Soligo et al.
[5]
Convergent Linear Representations of Emergent Misalignment
2025Anna Soligo, Edward Turner et al.
[6]
Qwen3 Technical Report
2025An Yang, Anfeng Li et al.
[7]
Gemma 3 Technical Report
2025Gemma Team Aishwarya Kamath, Johan Ferret et al.
[8]
Position: Curvature Matrices Should Be Democratized via Linear Operators
2025Felix Dangel, Runa Eschenhagen et al.
[9]
Do Influence Functions Work on Large Language Models?
2024Zhe Li, Wei Zhao et al.
[10]
The Llama 3 Herd of Models
2024Abhimanyu Dubey, Abhinav Jauhri et al.
[11]
Evaluating Saliency Explanations in NLP by Crowdsourcing
2024Xiaotian Lu, Jiyi Li et al.
[12]
Visual Concept Connectome (VCC): Open World Concept Discovery and Their Interlayer Connections in Deep Models
2024M. Kowal, Richard P. Wildes et al.
[13]
LESS: Selecting Influential Data for Targeted Instruction Tuning
2024Mengzhou Xia, Sadhika Malladi et al.
[14]
MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models
2024Wai-Chung Kwan, Xingshan Zeng et al.
[15]
Understanding Video Transformers via Universal Concept Discovery
2024M. Kowal, Achal Dave et al.
[16]
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
2024Evan Hubinger, Carson E. Denison et al.
[17]
Studying Large Language Model Generalization with Influence Functions
2023R. Grosse, Juhan Bae et al.
[18]
TinyStories: How Small Can Language Models Be and Still Speak Coherent English?
2023Ronen Eldan, Yuan-Fang Li
[19]
OpenAssistant Conversations - Democratizing Large Language Model Alignment
2023Andreas Kopf, Yannic Kilcher et al.
[20]
Training data influence analysis and estimation: a survey
2022Zayd Hammoudeh, Daniel Lowd

Showing 20 of 43 references

Founder's Pitch

"Leverage Concept Influence to efficiently attribute language model behaviors to training data for improved model control."

Model InterpretabilityScore: 5View PDF ↗

Commercial Viability Breakdown

0-10 scale

High Potential

1/4 signals

2.5

Quick Build

3/4 signals

7.5

Series A Potential

1/4 signals

2.5

Sources used for this analysis

arXiv Paper

Full-text PDF analysis of the research paper

GitHub Repository

Code availability, stars, and contributor activity

Citation Network

Semantic Scholar citations and co-citation patterns

Community Predictions

Crowd-sourced unicorn probability assessments

Analysis model: GPT-4o · Last scored: 2/16/2026

Explore the full citation network and related research.

7-day free trial. Cancel anytime.

Understand the commercial significance and market impact.

7-day free trial. Cancel anytime.

Get detailed profiles of the research team.

7-day free trial. Cancel anytime.