LLM Control

Trending

4papers

5.0viability

+100%30d

Papers

1–4 of 4

Research Paper·Mar 19, 2026

WASD: Locating Critical Neurons as Sufficient Conditions for Explaining and Controlling LLM Behavior

Precise behavioral control of large language models (LLMs) is critical for complex applications. However, existing methods often incur high training costs, lack natural language controllability, or co...

7.0 viability

Research Paper·Jan 29, 2026

The Effectiveness of Style Vectors for Steering Large Language Models: A Human Evaluation

Controlling the behavior of large language models (LLMs) at inference time is essential for aligning outputs with human abilities and safety requirements. \emph{Activation steering} provides a lightwe...

6.0 viability

Research Paper·Mar 24, 2026

Steering Code LLMs with Activation Directions for Language and Library Control

Code LLMs often default to particular programming languages and libraries under neutral prompts. We investigate whether these preferences are encoded as approximately linear directions in activation s...

4.0 viability

Research Paper·Jan 8, 2026

Compositional Steering of Large Language Models with Steering Tokens

Deploying LLMs in real-world applications requires controllable output that satisfies multiple desiderata at the same time. While existing work extensively addresses LLM steering for a single behavior...

3.0 viability