AI Research

8papers

3.3viability

State of the Field

Current AI research is increasingly focused on enhancing the capabilities of large language models (LLMs) and their underlying architectures, particularly in areas like out-of-distribution generalization and task-oriented communication. Recent work has revealed significant limitations in LLMs' ability to generalize periodic patterns, prompting investigations into their underlying reasoning processes and the development of new evaluation frameworks. Researchers are also exploring novel normalization techniques to improve transformer stability and performance, while advancements in latent-space regularization are showing promise for optimizing preference learning from human feedback. Additionally, investigations into task-oriented communication protocols highlight both the efficiency and potential opacity of LLM interactions, raising important questions about transparency in AI systems. As these studies converge on understanding the nuances of model behavior, they aim to address commercial challenges in deploying AI solutions that require robust reasoning and effective communication in dynamic environments.

Last updated Mar 3, 2026

Papers

1–8 of 8

Research Paper·Jan 30, 2026

Do Transformers Have the Ability for Periodicity Generalization?

Large language models (LLMs) based on the Transformer have demonstrated strong performance across diverse tasks. However, current models still exhibit substantial limitations in out-of-distribution (O...

5.0 viability

Research Paper·Feb 4, 2026·B2B

Enhanced QKNorm normalization for neural transformers with the Lp norm

The normalization of query and key vectors is an essential part of the Transformer architecture. It ensures that learning is stable regardless of the scale of these vectors. Some normalization approac...

4.0 viability

Research Paper·Feb 12, 2026

GPT-4o Lacks Core Features of Theory of Mind

Do Large Language Models (LLMs) possess a Theory of Mind (ToM)? Research into this question has focused on evaluating LLMs against benchmarks and found success across a range of social tasks. However,...

4.0 viability

Research Paper·Jan 28, 2026

Investigating the Development of Task-Oriented Communication in Vision-Language Models

We investigate whether \emph{LLM-based agents} can develop task-oriented communication protocols that differ from standard natural language in collaborative reasoning tasks. Our focus is on two core p...

3.0 viability

Research Paper·Jan 29, 2026

Latent Adversarial Regularization for Offline Preference Optimization

Learning from human feedback typically relies on preference optimization that constrains policy updates through token-level regularization. However, preference optimization for language models is part...

3.0 viability

Research Paper·Jan 30, 2026

Controllable Information Production

Intrinsic Motivation (IM) is a paradigm for generating intelligent behavior without external utilities. The existing information-theoretic methods for IM are predominantly based on information transmi...

3.0 viability

Research Paper·Feb 25, 2026

How Do Latent Reasoning Methods Perform Under Weak and Strong Supervision?

Latent reasoning has been recently proposed as a reasoning paradigm and performs multi-step reasoning through generating steps in the latent space instead of the textual space. This paradigm enables r...

2.0 viability

Research Paper·Mar 3, 2026

Retrievit: In-context Retrieval Capabilities of Transformers, State Space Models, and Hybrid Architectures

Transformers excel at in-context retrieval but suffer from quadratic complexity with sequence length, while State Space Models (SSMs) offer efficient linear-time processing but have limited retrieval ...

2.0 viability

AI Research

State of the Field

Papers

Do Transformers Have the Ability for Periodicity Generalization?

Enhanced QKNorm normalization for neural transformers with the Lp norm

GPT-4o Lacks Core Features of Theory of Mind

Investigating the Development of Task-Oriented Communication in Vision-Language Models

Latent Adversarial Regularization for Offline Preference Optimization

Controllable Information Production

How Do Latent Reasoning Methods Perform Under Weak and Strong Supervision?

Retrievit: In-context Retrieval Capabilities of Transformers, State Space Models, and Hybrid Architectures

Filters