LLM Fine-Tuning

6papers

4.3viability

State of the Field

Recent advancements in fine-tuning large language models (LLMs) are addressing critical challenges related to safety, efficiency, and performance. Researchers are increasingly focused on balancing safety alignment with task utility, as traditional fine-tuning methods often compromise one for the other. New approaches, such as safety-preserving fine-tuning, aim to maintain safety without sacrificing performance, effectively mitigating risks like jailbreak attacks. Concurrently, memory efficiency has emerged as a pressing concern, with techniques like instance-aware token ditching demonstrating significant reductions in memory usage while preserving or enhancing task performance. Additionally, the exploration of parameter-efficient fine-tuning strategies is gaining traction, particularly in optimizing layer selection to minimize costs and improve deployment efficiency. These innovations not only enhance the adaptability of LLMs for various applications but also pave the way for safer and more resource-conscious implementations in commercial settings, such as customer service automation and content generation.

Last updated Feb 28, 2026

Papers

1–4 of 4

Research Paper·Mar 2, 2026

FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents

Fine-tuning large language models for vertical domains remains a labor-intensive and expensive process, requiring domain experts to curate data, configure training, and iteratively diagnose model beha...

8.0 viability

Research Paper·Jan 15, 2026

Understanding and Preserving Safety in Fine-Tuned LLMs

Fine-tuning is an essential and pervasive functionality for applying large language models (LLMs) to downstream tasks. However, it has the potential to substantially degrade safety alignment, e.g., by...

3.0 viability

Research Paper·Jan 27, 2026

TokenSeek: Memory Efficient Fine Tuning via Instance-Aware Token Ditching

Fine tuning has been regarded as a de facto approach for adapting large language models (LLMs) to downstream tasks, but the high training memory consumption inherited from LLMs makes this process inef...

3.0 viability

Research Paper·Feb 3, 2026·B2B

Understanding and Guiding Layer Placement in Parameter-Efficient Fine-Tuning of Large Language Models

As large language models (LLMs) continue to grow, the cost of full-parameter fine-tuning has made parameter-efficient fine-tuning (PEFT) the default strategy for downstream adaptation. Constraints fro...

3.0 viability

LLM Fine-Tuning

State of the Field

Papers

FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents

Understanding and Preserving Safety in Fine-Tuned LLMs

TokenSeek: Memory Efficient Fine Tuning via Instance-Aware Token Ditching

Understanding and Guiding Layer Placement in Parameter-Efficient Fine-Tuning of Large Language Models

Filters