Transformers

4papers

3.8viability

Papers

1–4 of 4

Research Paper·Feb 5, 2026·B2BConsumer

CSRv2: Unlocking Ultra-Sparse Embeddings

In the era of large foundation models, the quality of embeddings has become a central determinant of downstream task performance and overall system capability. Yet widely used dense embeddings are oft...

6.0 viability

Research Paper·Feb 3, 2026

UAT-LITE: Inference-Time Uncertainty-Aware Attention for Pretrained Transformers

Neural NLP models are often miscalibrated, assigning high confidence to incorrect predictions, which undermines selective prediction and high-stakes deployment. Post-hoc calibration methods adjust out...

3.0 viability

Research Paper·Feb 2, 2026

Tabula RASA: Exposing and Breaking the Relational Bottleneck in Transformers

Transformers achieve remarkable performance across many domains, yet struggle with tasks requiring multi-hop relational reasoning over structured data. We analyze this limitation through circuit compl...

3.0 viability

Research Paper·Feb 11, 2026

Rotary Positional Embeddings as Phase Modulation: Theoretical Bounds on the RoPE Base for Long-Context Transformers

Rotary positional embeddings (RoPE) are widely used in large language models to encode token positions through multiplicative rotations, yet their behavior at long context lengths remains poorly chara...

3.0 viability

Transformers

Papers

CSRv2: Unlocking Ultra-Sparse Embeddings

UAT-LITE: Inference-Time Uncertainty-Aware Attention for Pretrained Transformers

Tabula RASA: Exposing and Breaking the Relational Bottleneck in Transformers

Rotary Positional Embeddings as Phase Modulation: Theoretical Bounds on the RoPE Base for Long-Context Transformers

Filters