LLM Testing

Trending

3papers

4.0viability

+100%30d

Papers

1–3 of 3

Research Paper·Mar 24, 2026

LLMORPH: Automated Metamorphic Testing of Large Language Models

Automated testing is essential for evaluating and improving the reliability of Large Language Models (LLMs), yet the lack of automated oracles for verifying output correctness remains a key challenge....

7.0 viability

Research Paper·Mar 25, 2026

From Untestable to Testable: Metamorphic Testing in the Age of LLMs

This article discusses the challenges of testing software systems with increasingly integrated AI and LLM functionalities. LLMs are powerful but unreliable, and labeled ground truth for testing rarely...

3.0 viability

Research Paper·Mar 1, 2026

LLM Self-Explanations Fail Semantic Invariance

We present semantic invariance testing, a method to test whether LLM self-explanations are faithful. A faithful self-report should remain stable when only the semantic context changes while the functi...

2.0 viability

LLM Testing

Papers

LLMORPH: Automated Metamorphic Testing of Large Language Models

From Untestable to Testable: Metamorphic Testing in the Age of LLMs

LLM Self-Explanations Fail Semantic Invariance

Filters