Proof partial. Core topic fields are ready, but questions or supporting reports are still catching up.
Current research on large language models (LLMs) is increasingly focused on understanding their internal mechanisms and improving their outputs for practical applications. Recent work has highlighted the importance of discourse coherence in persona discovery, revealing that deeper semantic structures govern LLM behavior rather than superficial lexical patterns. This has implications for creating more relatable AI interactions. Additionally, the emergence of verbal tics in model outputs raises concerns about the authenticity of LLM-generated text, suggesting that alignment techniques may impose a cost on naturalness. Researchers are also investigating the independence of LLMs, uncovering behavioral entanglements that could compromise multi-model systems. Furthermore, new frameworks for analyzing representation drift and reasoning trajectories aim to enhance model reliability and performance. Together, these efforts address commercial challenges in deploying LLMs across various sectors, from healthcare to social media, by striving for more nuanced, coherent, and independent AI systems that can better engage with human users.
Topic-specific paper and score movement from the daily diff ledger.
Mixture-of-Experts (MoE) models scale capacity by combining specialized experts, but most existing approaches assume centralized access to training data. In practice, data are distributed across clien...
Recent work has demonstrated the curse of depth in large language models (LLMs), where later layers contribute less to learning and representation than earlier layers. Such under-utilization is linked...
We present Marco-MoE, a suite of fully open multilingual sparse Mixture-of-Experts (MoE) models. Marco-MoE features a highly sparse design in which only around 5\% of the total parameters are activate...
Tokenization is a central component of natural language processing in current large language models (LLMs), enabling models to convert raw text into processable units. Although learned tokenizers are ...
Large Language Models (LLMs) have become the central paradigm in artificial intelligence, yet the core computational primitive of attention has remained structurally unchanged. Local Linear Attention ...
Continual fine-tuning of large language models (LLMs) is becoming increasingly crucial as these models are deployed in dynamic environments where tasks and data distributions evolve over time. While s...
Deploying Large Language Models (LLMs) for discriminative workloads is often limited by inference latency, compute, and API costs at scale. Active distillation reduces these costs by querying an LLM o...
On-policy self-distillation (self-OPD) densifies reinforcement learning with verifiable rewards (RLVR) by letting a policy teach itself under privileged context. We find that when this guidance spans ...
The cosine similarity between a large language model's hidden activations before and after Supervised Fine-Tuning (SFT) remains very high. This, at first glance, suggests that SFT leaves the model's a...
Large language models (LLMs) often hallucinate, producing fluent but false information, partly because supervised fine-tuning (SFT) implicitly rewards always responding. We introduce $\textit{HypoTerm...
Freshness
Canonical route: /topics
Agent Handoff
Canonical ID llm-analysis | Route /topic/llm-analysis
REST example
curl https://sciencetostartup.com/api/v1/agent-handoff/topic/llm-analysisMCP example
{
"tool": "search_papers",
"arguments": {
"query": "LLM Analysis",
"cluster": "LLM Analysis"
}
}source_context
{
"surface": "topic",
"mode": "topic",
"query": "LLM Analysis",
"normalized_query": "llm-analysis",
"route": "/topic/llm-analysis",
"paper_ref": null,
"topic_slug": "llm-analysis",
"benchmark_ref": null,
"dataset_ref": null
}Use This Via API or MCP
Topic pages bundle paper counts, viability trends, author concentration, and top questions into one canonical surface your agents can reference before they open Signal Canvas or create a workspace.