Papers
1–3 of 3Research Paper·Jan 15, 2026
MathDoc: Benchmarking Structured Extraction and Active Refusal on Noisy Mathematics Exam Papers
The automated extraction of structured questions from paper-based mathematics exams is fundamental to intelligent education, yet remains challenging in real-world settings due to severe visual noise. ...
7.0 viability
Research Paper·Jan 29, 2026
Search-Based Risk Feature Discovery in Document Structure Spaces under a Constrained Budget
Enterprise-grade Intelligent Document Processing (IDP) systems support high-stakes workflows across finance, insurance, and healthcare. Early-phase system validation under limited budgets mandates unc...
5.0 viability
Research Paper·Feb 17, 2026
DocSplit: A Comprehensive Benchmark Dataset and Evaluation Approach for Document Packet Recognition and Splitting
Document understanding in real-world applications often requires processing heterogeneous, multi-page document packets containing multiple documents stitched together. Despite recent advances in visua...
5.0 viability