Memento-Skills: Let Agents Design Agents Build Now
Memento-Skills is a self-improving LLM system that autonomously designs task-specific agents.
Agents Mar 19 Pending High viability
SEAR: Simple and Efficient Adaptation of Visual Geometric Transformers for RGB+Thermal 3D Reconstruction Build Now
Adapt existing visual geometric transformers for enhanced RGB-Thermal 3D reconstruction.
3D Reconstruction Mar 19 Pending High viability
HORNet: Task-Guided Frame Selection for Video Question Answering with Vision-Language Models Build Now
Optimize video question answering efficiency with HORNet's advanced frame selection for vision-language models.
Video Question Answering Mar 19 Pending High viability
EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation Build Now
EdgeCrafter offers a compact ViT framework for efficient dense prediction on edge devices, outperforming traditional models.
Compact Vision Transformers Mar 19 Code High viability
MeInTime: Bridging Age Gap in Identity-Preserving Face Restoration Build Now
MeInTime is a diffusion-based face restoration method that enhances identity fidelity and age consistency using cross-age references.
Face Restoration Mar 19 Pending High viability
Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens Build Now
CubiD is a novel discrete generation model that enhances visual generation using high-dimensional representation tokens.
Generative Visual Models Mar 19 Pending High viability
Not All Features Are Created Equal: A Mechanistic Study of Vision-Language-Action Models Build Now
A mechanistic study of Vision-Language-Action models revealing insights into multimodal input translation for robotics.
Vision-Language-Action Models Mar 19 Code High viability
SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing Build Now
SAMA revolutionizes instruction-guided video editing by factorizing semantic anchoring and motion alignment for superior performance.
Instruction-Guided Video Editing Mar 19 Code High viability
F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World Build Now
F2LLM-v2 offers efficient multilingual embeddings with state-of-the-art performance for over 200 languages.
Multilingual Embeddings Mar 19 Code High viability
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Build Now
Nemotron-Cascade 2 is a compact yet powerful open-weight LLM that excels in reasoning and agentic tasks, backed by a robust training methodology.
LLM Training Mar 19 Code High viability
DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent Denoising Build Now
DreamPartGen enables semantically grounded, part-aware text-to-3D generation for coherent and interpretable 3D object synthesis.
3D Generation Mar 19 Code High viability
Reconstruction Matters: Learning Geometry-Aligned BEV Representation through 3D Gaussian Splatting Build Now
Splat2BEV enhances autonomous driving perception by integrating explicit 3D reconstruction for superior BEV representation.
Autonomous Driving Mar 19 Code High viability
Sparse Autoencoders Reveal Interpretable and Steerable Features in VLA Models Build Now
A mechanistic interpretability approach to enhance generalization in Vision-Language-Action models for robot manipulation.
Robotics Mar 19 Code High viability
ARIADNE: A Perception-Reasoning Synergy Framework for Trustworthy Coronary Angiography Analysis Build Now
ARIADNE is a novel framework for reliable coronary angiography analysis that enhances vessel segmentation through preference-aligned perception and reasoning.
Medical AI Mar 19 Code High viability
On Optimizing Multimodal Jailbreaks for Spoken Language Models Build Now
JAMA optimizes multimodal jailbreaks for spoken language models, enhancing security against adversarial prompts.
Security in AI Mar 19 Code High viability
TAU-R1: Visual Language Model for Traffic Anomaly Understanding Build Now
TAU-R1 is a vision-language model designed to enhance traffic anomaly understanding using a specialized dataset and innovative training strategies.
Traffic Anomaly Detection Mar 19 Pending High viability
Multi-Modal Building Change Detection for Large-Scale Small Changes: Benchmark and Baseline Build Now
A benchmark dataset and network for accurate building change detection using multi-modal RGB-NIR imagery.
Change Detection Mar 19 Pending High viability
CAMO: A Conditional Neural Solver for the Multi-objective Multiple Traveling Salesman Problem Build Now
CAMO is a conditional neural solver that optimizes multi-agent coordination for complex multi-objective traveling salesman problems.
Robotics Optimization Mar 19 Code High viability
TerraScope: Pixel-Grounded Visual Reasoning for Earth Observation Build Now
TerraScope revolutionizes earth observation with pixel-grounded visual reasoning for enhanced geospatial analysis.
Geospatial AI Mar 19 Code High viability
ATG-MoE: Autoregressive trajectory generation with mixture-of-experts for assembly skill learning Build Now
ATG-MoE revolutionizes robotic assembly by integrating multi-modal inputs for efficient trajectory generation and skill learning.
Robotic Assembly Mar 19 Code High viability
Rethinking MLLM Itself as a Segmenter with a Single Segmentation Token Build Now
A novel segmentation method leveraging a single segmentation token in Multi-modal Large Language Models for enhanced object-level segmentation.
Computer Vision Mar 19 Pending High viability
What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time? Build Now
MultiTempBench is a multilingual benchmark for evaluating temporal reasoning in large language models across various languages and calendar systems.
Temporal Reasoning Mar 19 Pending High viability
Hypothesis-Conditioned Query Rewriting for Decision-Useful Retrieval Watch
Hypothesis-Conditioned Query Rewriting enhances decision-making in retrieval-augmented generation by refining query strategies for better evidence retrieval.
Retrieval-Augmented Generation Mar 19 High viability
CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think Build Now
CRAFT is a lightweight fine-tuning paradigm for diffusion models that drastically reduces data requirements while enhancing efficiency.
Diffusion Models Mar 19 Code High viability
PRIOR: Perceptive Learning for Humanoid Locomotion with Reference Gait Priors Build Now
PRIOR is an efficient framework for humanoid locomotion that achieves robust terrain traversal with human-like gaits using a simple design.
Humanoid Robotics Mar 19 Code High viability
PromptHub: Enhancing Multi-Prompt Visual In-Context Learning with Locality-Aware Fusion, Concentration and Alignment Build Now
PromptHub enhances visual in-context learning through locality-aware fusion and alignment techniques.
Visual Learning Mar 19 Pending High viability
DriftGuard: Mitigating Asynchronous Data Drift in Federated Learning Build Now
DriftGuard is a federated continual learning framework that efficiently mitigates asynchronous data drift in resource-constrained environments.
Federated Learning Mar 19 Pending High viability
RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models Build Now
RewardFlow enhances agentic reasoning in large language models through topology-aware reward propagation on state graphs.
Agents Mar 19 Pending High viability
Statistical Characteristic-Guided Denoising for Rapid High-Resolution Transmission Electron Microscopy Imaging Build Now
A novel denoising network that enhances high-resolution transmission electron microscopy imaging by effectively reducing noise while preserving atomic details.
Imaging Denoising Mar 19 Pending High viability
Agent Control Protocol: Admission Control for Agent Actions Build Now
Agent Control Protocol provides a robust governance framework for autonomous agents in B2B environments through cryptographic admission control.
Agent Governance Mar 19 Pending High viability
V-Dreamer: Automating Robotic Simulation and Trajectory Synthesis via Video Generation Priors Build Now
V-Dreamer automates the creation of diverse robotic simulation environments and trajectories from natural language, enhancing robot training efficiency.
Robotic Simulation Mar 19 Code High viability
Perceptio: Perception Enhanced Vision Language Models via Spatial Token Generation Build Now
Perceptio enhances vision language models with spatial reasoning through innovative token generation.
Vision Language Models Mar 19 Code High viability
Mi:dm K 2.5 Pro is a flagship LLM optimized for enterprise-grade complexity and Korean-language understanding.
LLM Training Mar 19 Code High viability
ProCal: Probability Calibration for Neighborhood-Guided Source-Free Domain Adaptation Build Now
ProCal is a novel probability calibration method for enhancing source-free domain adaptation by balancing knowledge retention and local noise mitigation.
Domain Adaptation Mar 19 Pending High viability
NeuroGame Transformer: Gibbs-Inspired Attention Driven by Game Theory and Statistical Physics Build Now
NeuroGame Transformer redefines attention mechanisms using game theory and statistical physics for improved performance in NLP tasks.
Transformers Mar 19 Pending High viability
CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks Build Now
CausalRM offers a scalable solution for reward modeling in RLHF using observational user feedback to improve alignment of language models.
Reinforcement Learning Mar 19 Code High viability
MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-Evolution Build Now
MemMA is a multi-agent framework that enhances memory management in LLMs through coordinated reasoning and self-evolution.
Memory Augmentation Mar 19 Pending High viability
Towards High-Quality Image Segmentation: Improving Topology Accuracy by Penalizing Neighbor Pixels Build Now
SCNP enhances image segmentation by improving topology accuracy through neighbor pixel penalization.
Image Segmentation Mar 19 Code High viability
Thinking with Constructions: A Benchmark and Policy Optimization for Visual-Text Interleaved Geometric Reasoning Build Now
A framework that enhances geometric reasoning in MLLMs through strategic visual aid construction and optimization.
Geometric Reasoning Mar 19 Code High viability
Multiscale Switch for Semi-Supervised and Contrastive Learning in Medical Ultrasound Image Segmentation Build Now
Switch is a novel semi-supervised learning framework for robust ultrasound image segmentation that outperforms existing methods with limited labeled data.
Medical AI Mar 19 Pending High viability
Benchmarking PDF Parsers on Table Extraction with LLM-based Semantic Evaluation Build Now
A benchmarking framework for evaluating PDF parsers using LLM-based semantic evaluation to enhance table extraction accuracy.
Document Processing Mar 19 Pending High viability
Click-to-Ask: An AI Live Streaming Assistant with Offline Copywriting and Online Interactive QA Build Now
Click-to-Ask is an AI assistant that enhances live streaming commerce by providing real-time responses and generating promotional copy.
Live Streaming Commerce Mar 19 Code High viability
PhysVideo: Physically Plausible Video Generation with Cross-View Geometry Guidance Build Now
PhysVideo generates physically plausible videos by leveraging cross-view geometry guidance for enhanced realism.
Generative Video Mar 19 Code High viability
MOSAIC: Multi-Objective Slice-Aware Iterative Curation for Alignment Build Now
MOSAIC is a multi-objective framework for optimizing dataset curation to enhance AI safety and instruction following.
Data Curation Mar 19 Pending High viability
Agentic Flow Steering and Parallel Rollout Search for Spatially Grounded Text-to-Image Generation Build Now
AFS-Search revolutionizes Text-to-Image generation with a training-free, closed-loop framework that enhances performance and speed.
Text-to-Image Generation Mar 19 Code High viability
Matryoshka Gaussian Splatting Build Now
Matryoshka Gaussian Splatting enables continuous level of detail rendering for 3D scenes without sacrificing quality.
3D Rendering Mar 19 Code High viability
MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction Build Now
MonoArt offers a unified framework for efficient and accurate 3D reconstruction of articulated objects from single images.
3D Reconstruction Mar 19 Code High viability
Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer Build Now
A novel framework that enhances motion generation by integrating semantic and kinematic conditions through a diffusion-based discrete motion tokenizer.
Motion Generation Mar 19 Pending High viability
NavTrust: Benchmarking Trustworthiness for Embodied Navigation Build Now
NavTrust is a benchmark for evaluating and enhancing the trustworthiness of embodied navigation systems under realistic conditions.
Embodied Navigation Mar 19 Code High viability
EffectErase: Joint Video Object Removal and Insertion for High-Quality Effect Erasing Build Now
EffectErase offers a novel solution for high-quality video object removal and insertion using a comprehensive dataset and advanced learning techniques.
Video Editing Mar 19 Code High viability
RPiAE: A Representation-Pivoted Autoencoder Enhancing Both Image Generation and Editing Build Now
RPiAE enhances image generation and editing by improving reconstruction fidelity through a novel representation-based tokenizer.
Image Generation and Editing Mar 19 Code High viability
Tinted Frames: Question Framing Blinds Vision-Language Models Build Now
A lightweight prompt-tuning method to enhance attention allocation in Vision-Language Models for improved visual reasoning.
Vision-Language Models Mar 19 Code High viability
FASTER: Rethinking Real-Time Flow VLAs Build Now
FASTER enhances real-time responsiveness in Vision-Language-Action models by optimizing action sampling for immediate reactions.
Vision-Language-Action Mar 19 Code High viability
MIDST Challenge at SaTML 2025: Membership Inference over Diffusion-models-based Synthetic Tabular data Build Now
A challenge evaluating the privacy resilience of synthetic tabular data generated by diffusion models against membership inference attacks.
Synthetic Data Privacy Mar 19 Pending High viability
Few-shot Acoustic Synthesis with Multimodal Flow Matching Build Now
FLAC is a few-shot acoustic synthesis method that generates room impulse responses using a probabilistic flow-matching approach.
Audio Synthesis Mar 19 Code High viability
ADMM-Based Distributed MPC with Control Barrier Functions for Safe Multi-Robot Quadrupedal Locomotion Build Now
A decentralized MPC framework for safe trajectory planning in multi-robot quadrupedal systems.
Robotics Control Mar 19 Code High viability
Meanings and Measurements: Multi-Agent Probabilistic Grounding for Vision-Language Navigation Build Now
MAPG enables robots to convert natural language commands into actionable decisions in 3D environments through probabilistic grounding.
Agents Mar 19 Code High viability
Adaptive Auxiliary Prompt Blending for Target-Faithful Diffusion Generation Build Now
A unified framework for stable and target-faithful image generation in low-density regions using adaptive prompt blending.
Generative Image Mar 19 Code High viability
ADAPT: Attention Driven Adaptive Prompt Scheduling and InTerpolating Orthogonal Complements for Rare Concepts Generation Build Now
ADAPT is a training-free framework that enhances text-to-image synthesis by providing deterministic control over rare concept generation.
Text-to-Image Synthesis Mar 19 Code High viability
D5P4: Partition Determinantal Point Process for Diversity in Parallel Discrete Diffusion Decoding Build Now
D5P4 enhances discrete diffusion decoding by improving candidate diversity through a novel beam-search framework.
Discrete Diffusion Models Mar 19 Code High viability
Enhancing Pretrained Model-based Continual Representation Learning via Guided Random Projection Build Now
A novel approach to enhance continual representation learning using a data-guided Random Projection Layer.
Continual Learning Mar 19 Code High viability
Adaptive Regime-Aware Stock Price Prediction Using Autoencoder-Gated Dual Node Transformers with Reinforcement Learning Control Build Now
An adaptive stock price prediction framework that utilizes autoencoders and dual node transformers to enhance accuracy during volatile market conditions.
Stock Market Prediction Mar 19 Code High viability
Introducing M: A Modular, Modifiable Social Robot Build Now
M is an open-source social robot platform that simplifies the reproduction and modification of robots for research and real-world applications.
Social Robotics Mar 19 Code High viability
Revisiting Autoregressive Models for Generative Image Classification Build Now
A novel autoregressive model for generative image classification that outperforms diffusion models in efficiency and accuracy.
Generative Image Classification Mar 19 Code High viability
CustomTex: High-fidelity Indoor Scene Texturing via Multi-Reference Customization Build Now
CustomTex enables high-fidelity, customizable 3D indoor scene texturing using reference images for precise instance-level control.
3D Scene Texturing Mar 19 Code High viability
FedTrident: Resilient Road Condition Classification Against Poisoning Attacks in Federated Learning Build Now
FedTrident enhances road condition classification in federated learning by effectively countering targeted label-flipping attacks.
Federated Learning Security Mar 19 Code High viability
DaPT: A Dual-Path Framework for Multilingual Multi-hop Question Answering Build Now
DaPT is a novel multilingual framework that enhances retrieval-augmented generation for multi-hop question answering across languages.
Multilingual QA Mar 19 Code High viability
Articulated-Body Dynamics Network: Dynamics-Grounded Prior for Robot Learning Build Now
A graph neural network that enhances robot learning by incorporating dynamics properties for improved policy efficiency.
Robotics Mar 19 Code High viability
DROID-SLAM in the Wild Build Now
DROID-SLAM optimizes real-time 3D mapping for autonomous drones navigating complex environments.
Robotics Vision Mar 19 Pending High viability
Fire as a Service: Augmenting Robot Simulators with Thermally and Visually Accurate Fire Dynamics Build Now
Fire as a Service enhances robot simulators with accurate fire dynamics for safer firefighting training.
Robotics Simulation Mar 19 Code High viability
SignAgent: Agentic LLMs for Linguistically-Grounded Sign Language Annotation and Dataset Curation Build Now
SignAgent leverages LLMs for efficient and linguistically-grounded sign language annotation and dataset curation.
Sign Language Processing Mar 19 Code High viability
Em-Garde: A Propose-Match Framework for Proactive Streaming Video Understanding Build Now
Em-Garde is a framework that enhances proactive video understanding by efficiently matching user queries to visual proposals.
Streaming Video Understanding Mar 19 Code High viability
SwiftTailor: Efficient 3D Garment Generation with Geometry Image Representation Build Now
SwiftTailor revolutionizes 3D garment generation with a fast and efficient geometry image representation.
3D Garment Generation Mar 19 Code High viability
FUMO: Prior-Modulated Diffusion for Single Image Reflection Removal Build Now
FUMO leverages prior modulation in diffusion models for effective single image reflection removal.
Image Processing Mar 19 Pending High viability
SEM: Sparse Embedding Modulation for Post-Hoc Debiasing of Vision-Language Models Build Now
A novel framework for debiasing vision-language models using sparse embedding modulation.
Vision-Language Debiasing Mar 19 Code High viability
Behavioral Fingerprints for LLM Endpoint Stability and Identity Build Now
Stability Monitor provides a black-box solution for monitoring the behavioral consistency of AI model endpoints.
Model Monitoring Mar 19 Code High viability
Generalized Hand-Object Pose Estimation with Occlusion Awareness Build Now
GenHOI offers a novel framework for accurate hand-object pose estimation under occlusion using hierarchical semantic knowledge.
Pose Estimation Mar 19 Code High viability
AgentDS Technical Report: Benchmarking the Future of Human-AI Collaboration in Domain-Specific Data Science Build Now
AgentDS benchmarks AI agents against human performance in domain-specific data science tasks.
Human-AI Collaboration Mar 19 Code High viability
Unmasking Algorithmic Bias in Predictive Policing: A GAN-Based Simulation Framework with Multi-City Temporal Analysis Build Now
A GAN-based framework to analyze and mitigate racial bias in predictive policing systems using multi-city crime data.
Algorithmic Bias Analysis Mar 19 Code High viability
Balancing Performance and Fairness in Explainable AI for Anomaly Detection in Distributed Power Plants Monitoring Build Now
A supervised ML framework for reliable anomaly detection in power plant monitoring that balances performance, interpretability, and fairness.
Explainable AI Mar 19 Code High viability
Context Bootstrapped Reinforcement Learning Build Now
Context Bootstrapped Reinforcement Learning enhances exploration efficiency in RL by integrating few-shot demonstrations into training.
Reinforcement Learning Mar 19 Code High viability
Unsupervised Contrastive Learning for Efficient and Robust Spectral Shape Matching Build Now
A novel unsupervised contrastive learning approach for efficient and robust 3D shape matching.
3D Shape Matching Mar 19 Code High viability
Lightweight Model Predictive Control for Spacecraft Rendezvous Attitude Synchronization Build Now
Lightweight model predictive control solutions for real-time spacecraft attitude synchronization.
Spacecraft Control Mar 19 Code High viability
GHOST: Fast Category-agnostic Hand-Object Interaction Reconstruction from RGB Videos using Gaussian Splatting Build Now
Create realistic hand-object interaction simulations using RGB videos with GHOST's Gaussian Splatting technique.
Computer Vision - Hand Tracking Mar 19 Pending High viability
Bridging Network Fragmentation: A Semantic-Augmented DRL Framework for UAV-aided VANETs Build Now
A Semantic-Augmented DRL framework that enhances UAV deployment strategies in VANETs by integrating LLMs for improved connectivity.
UAV and Network Optimization Mar 19 Code High viability
Confidential Databases Without Cryptographic Mappings Build Now
FEDB revolutionizes confidential databases by eliminating cryptographic overhead for secure queries in cloud environments.
Database Security Mar 19 Code High viability
Detecting Basic Values in A Noisy Russian Social Media Text Data: A Multi-Stage Classification Framework Build Now
A multi-stage classification framework for detecting human values in noisy Russian social media text using LLMs.
NLP Mar 19 Code High viability
dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models Build Now
dTRPO enhances policy optimization in diffusion large language models for improved efficiency and performance.
LLM Training Mar 19 Code High viability
Functional Subspace Watermarking for Large Language Models Build Now
A robust watermarking framework for large language models that ensures ownership protection without compromising performance.
Model Watermarking Mar 19 Code High viability
Weaver: Fuzzing JavaScript Engines at the JavaScript-WebAssembly Boundary Build Now
Weaver is a greybox fuzzing framework that uncovers vulnerabilities at the JavaScript-WebAssembly boundary.
Security Testing Mar 19 Code High viability
Points-to-3D: Structure-Aware 3D Generation with Point Cloud Priors Build Now
Points-to-3D leverages point cloud priors for enhanced 3D asset and scene generation.
3D Generation Mar 19 Code High viability
Automatic Configuration of LLM Post-Training Pipelines Build Now
AutoPipe optimizes LLM post-training configurations efficiently using a budget-aware framework.
LLM Optimization Mar 19 Code High viability
A Concept is More Than a Word: Diversified Unlearning in Text-to-Image Diffusion Models Build Now
A novel framework for concept unlearning in text-to-image diffusion models that enhances robustness and precision.
Text-to-Image Diffusion Mar 19 Code High viability
Implicit Grading Bias in Large Language Models: How Writing Style Affects Automated Assessment Across Math, Programming, and Essay Tasks Build Now
This research identifies implicit grading biases in LLMs based on writing style, highlighting the need for bias auditing in educational AI systems.
Educational AI Mar 19 Code High viability
DA-Mamba: Learning Domain-Aware State Space Model for Global-Local Alignment in Domain Adaptive Object Detection Build Now
DA-Mamba enhances domain adaptive object detection by integrating CNNs with State Space Models for improved global-local feature alignment.
Domain Adaptive Object Detection Mar 19 Code High viability
Are complicated loss functions necessary for teaching LLMs to reason? Build Now
A simplified training method for LLMs that enhances reasoning without complex loss functions.
LLM Training Mar 19 Code High viability
Automatic detection of Gen-AI texts: A comparative framework of neural models Build Now
A comparative framework for detecting AI-generated texts using advanced neural models.
AI Text Detection Mar 19 Code High viability
6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models Build Now
A mixed-precision quantization framework that optimizes video diffusion models for efficient inference.
Video Diffusion Mar 19 Code High viability
Measuring and Exploiting Confirmation Bias in LLM-Assisted Security Code Review Build Now
A study revealing how confirmation bias in LLMs affects security code reviews and proposing debiasing methods.
Security AI Mar 19 Code High viability
Ontology-Guided Diffusion for Zero-Shot Visual Sim2Real Transfer Build Now
Ontology-Guided Diffusion (OGD) enables interpretable and efficient zero-shot visual sim2real transfer using structured knowledge.
Sim2Real Transfer Mar 19 Code High viability
Accurate and Efficient Multi-Channel Time Series Forecasting via Sparse Attention Mechanism Build Now
Li-Net is a novel architecture for efficient multi-channel time series forecasting using sparse attention mechanisms.
Time Series Forecasting Mar 19 Code High viability
OCP: Orthogonal Constrained Projection for Sparse Scaling in Industrial Commodity Recommendation Build Now
OCP optimizes embedding representation for industrial commodity recommendation systems, enhancing scalability and performance.
Recommendation Systems Mar 19 Code High viability
Balanced Thinking: Improving Chain of Thought Training in Vision Language Models Build Now
SCALe enhances vision-language model training by optimizing reasoning segment supervision for improved accuracy and efficiency.
Vision Language Models Mar 19 Code High viability
Training-Free Sparse Attention for Fast Video Generation via Offline Layer-Wise Sparsity Profiling and Online Bidirectional Co-Clustering Build Now
SVOO is a training-free sparse attention framework that enhances video generation speed without compromising quality.
Video Generation Mar 19 Code High viability
D-Mem: A Dual-Process Memory System for LLM Agents Build Now
D-Mem is a dual-process memory system designed to enhance long-horizon reasoning in autonomous agents by combining lightweight vector retrieval with high-fidelity deliberation.
Agents Mar 19 Code High viability
GEAR: Geography-knowledge Enhanced Analog Recognition Framework in Extreme Environments Build Now
GEAR is a framework for efficiently retrieving geological analogs in extreme environments using advanced topographic analysis.
Geospatial Analysis Mar 19 Code High viability
REST: Receding Horizon Explorative Steiner Tree for Zero-Shot Object-Goal Navigation Build Now
REST is a training-free framework for efficient zero-shot object-goal navigation using a tree of paths for decision-making.
Navigation AI Mar 19 Code High viability
OpenT2M: No-frill Motion Generation with Open-source,Large-scale, High-quality Data Build Now
OpenT2M provides a large-scale, high-quality motion dataset and a novel motion model for realistic human movement generation from text.
Motion Generation Mar 19 Code High viability
Cyber-Resilient Digital Twins: Discriminating Attacks for Safe Critical Infrastructure Control Build Now
i-SDT is an intelligent self-defending digital twin that enhances cyber-physical system resilience against attacks while maintaining operational efficiency.
Cybersecurity Mar 19 Code High viability
DriveTok: 3D Driving Scene Tokenization for Unified Multi-View Reconstruction and Understanding Watch
DriveTok is a 3D driving scene tokenizer that enhances multi-view reconstruction and understanding for autonomous driving systems.
3D Scene Understanding Mar 19 Pending
OS-Themis: A Scalable Critic Framework for Generalist GUI Rewards Watch
OS-Themis is a scalable multi-agent critic framework designed to enhance GUI agent performance through improved reward functions.
Reinforcement Learning Mar 19 Code
GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning Watch
GSMem enhances embodied exploration by using 3D Gaussian Splatting for persistent spatial memory and high-fidelity reasoning.
Embodied AI Mar 19 Code
Tendon-Actuated Robots with a Tapered, Flexible Polymer Backbone: Design, Fabrication, and Modeling Watch
A customizable, low-cost tendon-actuated continuum robot design for versatile inspection and manipulation tasks.
Robotics Mar 19 Code
A Dataset and Resources for Identifying Patient Health Literacy Information from Clinical Notes Watch
HEALIX provides a novel dataset for identifying patient health literacy from clinical notes, enhancing automated detection methods.
Medical AI Mar 19 Code
MERGE: Guided Vision-Language Models for Multi-Actor Event Reasoning and Grounding in Human-Robot Interaction Watch
Developing AI models for improving human-robot interaction through advanced vision-language reasoning and grounding capabilities.
AI for Human-Robot Interaction Mar 19 Code
VGGT-360: Geometry-Consistent Zero-Shot Panoramic Depth Estimation Watch
VGGT-360 is a training-free framework for zero-shot panoramic depth estimation that leverages 3D consistency for improved accuracy.
Depth Estimation Mar 19 Code
Translating MRI to PET through Conditional Diffusion Models with Enhanced Pathology Awareness Watch
Develop a tool to translate MRI scans to PET-like images using conditional diffusion models for improved pathology detection.
Healthcare AI Mar 19 Pending
Reasoning over mathematical objects: on-policy reward modeling and test time aggregation Watch
A framework for enhancing mathematical reasoning in LLMs through improved training data and methodologies.
Mathematical Reasoning Mar 19 Code
ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents Watch
ProRL Agent offers a scalable API service for efficient RL training of multi-turn LLM agents.
Agents Mar 19
Can LLM generate interesting mathematical research problems? Watch
An agent that generates unique and valuable mathematical research problems using LLMs.
Mathematical AI Mar 19 Code
VesselTok: Tokenizing Vessel-like 3D Biomedical Graph Representations for Reconstruction and Generation Watch
VesselTok offers a novel framework for encoding and generating complex 3D biomedical graph representations of anatomical structures.
Biomedical Graphs Mar 19 Code
CSSDF-Net: Safe Motion Planning Based on Neural Implicit Representations of Configuration Space Distance Field Watch
CSSDF-Net provides a differentiable distance query mechanism for safe motion planning in robotics.
Robotics Mar 19 Code
Enhancing Multi-Corpus Training in SSL-Based Anti-Spoofing Models: Domain-Invariant Feature Extraction Watch
A framework for enhancing speech spoofing detection through invariant domain feature extraction.
Speech Recognition Mar 19 Code
GenVideoLens: Where LVLMs Fall Short in AI-Generated Video Detection? Watch
GenVideoLens is a fine-grained benchmark for evaluating LVLMs in detecting AI-generated videos, revealing critical performance gaps.
AI-Generated Video Detection Mar 19 Code
Cross-Modal Rationale Transfer for Explainable Humanitarian Classification on Social Media Watch
A multimodal classification framework that enhances explainability in humanitarian crisis classification using cross-modal rationale transfer.
Humanitarian AI Mar 19 Code
FinTradeBench: A Financial Reasoning Benchmark for LLMs Watch
FinTradeBench is a benchmark designed to evaluate financial reasoning in LLMs by integrating company fundamentals and trading signals.
Financial AI Mar 19 Code
Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders Watch
Evaluating state space models as competitive vision encoders for vision-language models.
Vision-Language Models Mar 19 Pending
Robustness, Cost, and Attack-Surface Concentration in Phishing Detection Watch
A cost-aware framework for enhancing the robustness of phishing detection systems against feature manipulation attacks.
Phishing Detection Mar 19 Code
OmniVTA: Visuo-Tactile World Modeling for Contact-Rich Robotic Manipulation Watch
OmniVTA aids robots in efficiently manipulating objects using integrated visual and tactile feedback.
Robotics & Automation Mar 19 Code
Box Maze: A Process-Control Architecture for Reliable LLM Reasoning Watch
Box Maze is a novel architecture designed to enhance the reliability of LLM reasoning through explicit cognitive control layers.
LLM Reasoning Mar 19 Code
Optimal Splitting of Language Models from Mixtures to Specialized Domains Watch
A novel method for optimizing language model training by effectively allocating compute resources between pretraining and specialization.
LLM Training Mar 19 Code
UGID: Unified Graph Isomorphism for Debiasing Large Language Models Watch
UGID is a framework for debiasing large language models by modeling their internal representations as structured computational graphs.
Debiasing LLMs Mar 19 Code
SHAPCA: Consistent and Interpretable Explanations for Machine Learning Models on Spectroscopy Data Watch
SHAPCA provides interpretable explanations for machine learning models applied to spectroscopy data, enhancing trust in clinical settings.
Explainable AI Mar 19 Code
SAVeS: Steering Safety Judgments in Vision-Language Models via Semantic Cues Watch
SAVeS introduces a framework to steer safety judgments in vision-language models using semantic cues.
Vision-Language Safety Mar 19 Code
An Optimised Greedy-Weighted Ensemble Framework for Financial Loan Default Prediction Watch
A dynamic ensemble framework that enhances loan default prediction accuracy through optimized model weighting.
Financial AI Mar 19 Code
Safety-Guaranteed Imitation Learning from Nonlinear Model Predictive Control for Spacecraft Close Proximity Operations Watch
A safety-guaranteed imitation learning framework for spacecraft close proximity control using Control Barrier Functions.
Spacecraft Control Mar 19 Code
Secure Linear Alignment of Large Language Models Watch
A privacy-preserving framework for cross-model alignment of language models enabling secure inference without direct data sharing.
Privacy-Preserving AI Mar 19 Code
Motion-o: Trajectory-Grounded Video Reasoning Watch
Enable advanced video reasoning through trajectory-grounded analysis for educational and security applications.
Video Analysis Mar 19 Pending
ClawTrap: A MITM-Based Red-Teaming Framework for Real-World OpenClaw Security Evaluation Watch
ClawTrap is a MITM-based framework designed for evaluating the security of OpenClaw agents against real-world network threats.
Security Evaluation Mar 19 Code
WeNLEX: Weakly Supervised Natural Language Explanations for Multilabel Chest X-ray Classification Watch
WeNLEX generates weakly supervised natural language explanations for multilabel chest X-ray classification, enhancing interpretability and classification performance.
Medical AI Mar 19 Code
Analysis Of Linguistic Stereotypes in Single and Multi-Agent Generative AI Architectures Watch
A study exploring bias in LLM outputs based on dialect and proposing mitigation strategies through multi-agent architectures.
Bias Mitigation in LLMs Mar 19 Code
Off-Policy Learning with Limited Supply Watch
A novel off-policy learning method for optimizing item allocation in constrained environments like e-commerce.
Reinforcement Learning Mar 19 Code
HISR: Hindsight Information Modulated Segmental Process Rewards For Multi-turn Agentic Reinforcement Learning Watch
HISR enhances multi-turn decision-making in reinforcement learning by improving reward assignment through hindsight information.
Reinforcement Learning Mar 19 Code
Words at Play: Benchmarking Audio Pun Understanding in Large Audio-Language Models Watch
APUN-Bench is a benchmark for evaluating audio pun understanding in large audio language models.
Audio Understanding Mar 19 Code
Evaluating Model-Free Policy Optimization in Masked-Action Environments via an Exact Blackjack Oracle Watch
A novel approach to policy optimization in blackjack using a dynamic programming oracle for improved sample efficiency.
Reinforcement Learning Mar 19 Code
A Comparative Empirical Study of Catastrophic Forgetting Mitigation in Sequential Task Adaptation for Continual Natural Language Processing Systems Watch
A study on mitigating catastrophic forgetting in continual intent classification for NLP systems.
Continual Learning Mar 19 Code
Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding Ignore
Build a tool to extract implicit 3D priors from 2D scenes for enhanced scene understanding in AR applications.
3D Scene Understanding Mar 19 Pending
Under One Sun: Multi-Object Generative Perception of Materials and Illumination Ignore
MultiGP is a generative inverse rendering method that disentangles reflectance, texture, and illumination from a single image.
Generative Perception Mar 19 Code
Online Learning and Equilibrium Computation with Ranking Feedback Ignore
Developing algorithms for online learning using ranking feedback to improve decision-making in adversarial environments.
Online Learning Mar 19 Code
Rethinking Vector Field Learning for Generative Segmentation Ignore
A novel approach to generative segmentation using vector field learning to enhance performance over traditional methods.
Generative Segmentation Mar 19 Code
LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs Ignore
LVOmniBench is a benchmark for evaluating long-form audio-video comprehension in omnimodal large language models.
Multimodal Evaluation Mar 19 Code
The Exponentially Weighted Signature Ignore
A novel framework for enhancing multidimensional path representation with exponentially weighted signatures for improved memory dynamics.
Statistical Learning Mar 19 Code
SOL-ExecBench: Speed-of-Light Benchmarking for Real-World GPU Kernels Against Hardware Limits Ignore
SOL-ExecBench provides a novel benchmarking framework for optimizing GPU kernels against hardware limits.
GPU Optimization Mar 19 Code
VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models Ignore
VEPO optimizes low-resource language models through a novel reinforcement learning framework that enhances tokenization and translation quality.
NLP Mar 19 Code
LuMamba: Latent Unified Mamba for Electrode Topology-Invariant and Efficient EEG Modeling Ignore
LuMamba enhances EEG modeling by providing electrode topology-invariant and efficient data processing.
Healthcare AI Mar 19 Pending
Communication-Efficient and Robust Multi-Modal Federated Learning via Latent-Space Consensus Ignore
CoMFed is a framework for efficient multi-modal federated learning that enhances model training while preserving privacy.
Federated Learning Mar 19 Code
Measuring 3D Spatial Geometric Consistency in Dynamic Generated Videos Ignore
Introducing a novel metric for evaluating 3D spatial geometric consistency in dynamically generated videos.
Generative Video Mar 19 Pending
Fast and Interpretable Autoregressive Estimation with Neural Network Backpropagation Ignore
A neural network approach for fast and interpretable autoregressive parameter estimation in time series analysis.
Time Series Analysis Mar 19 Code
Unleashing the Power of Simplicity: A Minimalist Strategy for State-of-the-Art Fingerprint Enhancement Ignore
A minimalist approach to fingerprint enhancement that outperforms complex methods for clearer and more accurate images.
Fingerprint Recognition Mar 19
Maximum-Entropy Exploration with Future State-Action Visitation Measures Ignore
A novel intrinsic reward mechanism for reinforcement learning that enhances exploration efficiency.
Reinforcement Learning Mar 19 Code
BVSIMC: Bayesian Variable Selection-Guided Inductive Matrix Completion for Improved and Interpretable Drug Discovery Ignore
BVSIMC enhances drug discovery by improving predictive accuracy and interpretability through Bayesian variable selection.
Drug Discovery Mar 19 Code
Controller Datapath Aware Verification of Masked Hardware Generated via High Level Synthesis Ignore
A verification tool for ensuring the security of masked hardware generated through High Level Synthesis.
Hardware Security Mar 19 Code
Act While Thinking: Accelerating LLM Agents via Pattern-Aware Speculative Tool Execution Ignore
PASTE enhances LLM agent performance by reducing latency through speculative tool execution.
Agents Mar 19
From Accuracy to Readiness: Metrics and Benchmarks for Human-AI Decision-Making Ignore
A framework for evaluating human-AI decision-making readiness to enhance collaboration safety and effectiveness.
Human-AI Collaboration Mar 19 Code
MultihopSpatial: Multi-hop Compositional Spatial Reasoning Benchmark for Vision-Language Model Ignore
MultihopSpatial is a benchmark for enhancing multi-hop spatial reasoning in Vision-Language Models.
Vision-Language Models Mar 19 Code
A Passive Elastic-Folding Mechanism for Stackable Airdrop Sensors Ignore
A passive mechanism for stackable airdrop sensors that enhances environmental monitoring efficiency.
Environmental Monitoring Mar 19 Code
Towards Interpretable Foundation Models for Retinal Fundus Images Ignore
Dual-IFM is an interpretable foundation model for retinal fundus images, enhancing decision-making in medical imaging.
Medical AI Mar 19 Code
A Model Ensemble-Based Post-Processing Framework for Fairness-Aware Prediction Ignore
A post-processing framework that enhances fairness in predictions across various machine learning tasks.
Fairness in AI Mar 19 Code
Signals of Success and Struggle: Early Prediction and Physiological Signatures of Human Performance across Task Complexity Ignore
A system that predicts user performance in interactive tasks using physiological signals.
Human Performance Prediction Mar 19 Code
Rethinking Uncertainty Quantification and Entanglement in Image Segmentation Ignore
A comprehensive study on uncertainty quantification in medical image segmentation to improve interpretability and performance.
Medical AI Mar 19 Code
ViTac-Tracing: Visual-Tactile Imitation Learning of Deformable Object Tracing Ignore
A visual-tactile imitation learning method for tracing deformable objects to enhance manipulation tasks.
Robotics Mar 19 Code
SoK: Practical Aspects of Releasing Differentially Private Graphs Ignore
A comprehensive framework for releasing differentially private graphs to enhance privacy without sacrificing utility.
Privacy in Graphs Mar 19 Code
Enhancing the Parameterization of Reservoir Properties for Data Assimilation Using Deep VAE-GAN Ignore
A deep learning model combining VAE and GAN for improved parameterization in petroleum reservoir simulation.
Reservoir Simulation Mar 19 Code
From ex(p) to poly: Gaussian Splatting with Polynomial Kernels Ignore
A new polynomial kernel for Gaussian Splatting enhances performance while maintaining dataset compatibility.
3D Graphics Mar 19 Code
STEP: Scientific Time-Series Encoder Pretraining via Cross-Domain Distillation Ignore
STEP is a framework for unified representation learning of scientific time series through cross-domain distillation.
Time Series Analysis Mar 19 Code
Revisiting Label Inference Attacks in Vertical Federated Learning: Why They Are Vulnerable and How to Defend Ignore
A novel defense mechanism against label inference attacks in vertical federated learning using task reassignment and layer adjustment.
Federated Learning Security Mar 19 Code
MANAR: Memory-augmented Attention with Navigational Abstract Conceptual Representation Ignore
MANAR is a memory-augmented attention mechanism that enhances contextualization by implementing principles of Global Workspace Theory.
Attention Mechanisms Mar 19 Code
Benchmarking CNN-based Models against Transformer-based Models for Abdominal Multi-Organ Segmentation on the RATIC Dataset Ignore
A benchmarking study comparing CNN and transformer models for abdominal multi-organ segmentation in CT scans.
Medical AI Mar 19 Code
ZEBRAARENA: A Diagnostic Simulation Environment for Studying Reasoning-Action Coupling in Tool-Augmented LLMs Ignore
ZebraArena is a diagnostic simulation environment designed to study the coupling of reasoning and action in tool-augmented LLMs.
Diagnostic Environments Mar 19 Code
DiscoPhon: Benchmarking the Unsupervised Discovery of Phoneme Inventories With Discrete Speech Units Ignore
DiscoPhon is a multilingual benchmark for unsupervised phoneme discovery from discrete speech units.
Speech Processing Mar 19 Code
AutORAN: LLM-driven Natural Language Programming for Agile xApp Development Ignore
AutORAN is an LLM-driven framework that automates xApp development for Open Radio Access Networks, enabling rapid deployment from user intents.
Natural Language Programming Mar 19
DyMoE: Dynamic Expert Orchestration with Mixed-Precision Quantization for Efficient MoE Inference on Edge Ignore
DyMoE optimizes MoE inference for edge devices through dynamic mixed-precision quantization.
Edge AI Mar 19
cuGenOpt: A GPU-Accelerated General-Purpose Metaheuristic Framework for Combinatorial Optimization Ignore
cuGenOpt provides a general-purpose GPU-accelerated framework for combinatorial optimization.
Optimization Framework Mar 19 Pending
Parallelograms Strike Back: LLMs Generate Better Analogies than People Ignore
This research explores how LLMs generate analogies better than humans, challenging traditional models of analogy.
NLP Mar 19
Adaptive Nonlinear Data Assimilation through P-Spline Triangular Measure Transport Ignore
A novel adaptive algorithm for nonlinear data assimilation using P-spline triangular measure transport.
Data Assimilation Mar 19 Code
MoRI: Learning Motivation-Grounded Reasoning for Scientific Ideation in Large Language Models Ignore
MoRI aims to integrate motivation-grounded reasoning in scientific ideation within large language models.
AI Research Mar 19 Pending
Towards Verifiable AI with Lightweight Cryptographic Proofs of Inference Ignore
A lightweight cryptographic proof framework for verifying AI inference correctness in cloud-based services.
Verifiable AI Mar 19
RADIUS: Ranking, Distribution, and Significance - A Comprehensive Alignment Suite for Survey Simulation Ignore
RADIUS is an alignment suite for improving the evaluation of survey simulations using LLMs.
Survey Simulation Mar 19
Revisiting OmniAnomaly for Anomaly Detection: performance metrics and comparison with PCA-based models Ignore
A comparative study of OmniAnomaly and PCA for multivariate time series anomaly detection.
Anomaly Detection Mar 19 Code
Book your room in the Turing Hotel! A symmetric and distributed Turing Test with multiple AIs and humans Ignore
A novel platform for conducting a distributed Turing Test with LLMs and humans.
Agents Mar 19
Evaluating 5W3H Structured Prompting for Intent Alignment in Human-AI Interaction Ignore
A framework for structured intent representation to enhance human-AI interaction.
NLP Mar 19
A conceptual framework for ideology beyond the left and right Ignore
A framework for analyzing complex ideologies beyond traditional left/right paradigms using NLP.
NLP and Ideology Mar 19 Code
Progressive Training for Explainable Citation-Grounded Dialogue: Reducing Hallucination to Zero in English-Hindi LLMs Ignore
A novel training pipeline for bilingual dialogue systems that aims to eliminate hallucination through citation grounding.
Dialogue Systems Mar 19
BeamAgent: LLM-Aided MIMO Beamforming with Decoupled Intent Parsing and Alternating Optimization for Joint Site Selection and Precoding Ignore
BeamAgent optimizes MIMO beamforming by decoupling intent parsing from numerical optimization using LLMs.
Wireless Communication Optimization Mar 19
Empathetic Motion Generation for Humanoid Educational Robots via Reasoning-Guided Vision--Language--Motion Diffusion Architecture Ignore
A framework for generating instruction-aware gestures in humanoid educational robots using a reasoning-guided approach.
Humanoid Robotics Mar 19
Dual-Model Prediction of Affective Engagement and Vocal Attractiveness from Speaker Expressiveness in Video Learning Ignore
A speaker-centric Emotion AI approach predicting audience engagement and vocal attractiveness from speaker expressiveness in video learning.
Emotion AI Mar 19
ROFT-VINS: Robust Feature Tracking-based Visual-Inertial State Estimation for Harsh Environment Ignore
A deep learning method for robust visual feature tracking in monocular camera images for SLAM and Odometry.
Visual-Inertial Odometry Mar 19
Cross-Ecosystem Vulnerability Analysis for Python Applications Ignore
A novel approach for analyzing vulnerabilities in Python applications through cross-ecosystem dependency analysis.
Security Analysis Mar 19
Beyond TVLA: Anderson-Darling Leakage Assessment for Neural Network Side-Channel Leakage Detection Ignore
A new framework for improved side-channel leakage detection in neural networks using the Anderson-Darling test.
Side-Channel Leakage Detection Mar 19
An Onto-Relational-Sophic Framework for Governing Synthetic Minds Ignore
A philosophical framework for governing the development and integration of synthetic minds in society.
Governance of AI Mar 19 Code
Learning to Self-Evolve Ignore
A reinforcement learning framework that enables large language models to iteratively refine their contexts for improved performance.
Reinforcement Learning Mar 19
Spectrally-Guided Diffusion Noise Schedules Ignore
This paper proposes a new method for designing noise schedules in diffusion models to enhance image and video generation quality.
Diffusion Models Mar 19
Improving RCT-Based Treatment Effect Estimation Under Covariate Mismatch via Calibrated Alignment Ignore
CALM improves treatment effect estimation by aligning covariate representations from RCTs and observational studies.
Medical AI Mar 19
Evaluating Counterfactual Strategic Reasoning in Large Language Models Ignore
This paper evaluates LLMs' strategic reasoning in game-theoretic contexts, revealing limitations in their performance.
NLP Research Mar 19
Rigorous Error Certification for Neural PDE Solvers: From Empirical Residuals to Solution Guarantees Ignore
This paper addresses the generalization error in physics-informed neural networks for solving partial differential equations.
Neural PDE Solvers Mar 19
PPI is the Difference Estimator: Recognizing the Survey Sampling Roots of Prediction-Powered Inference Ignore
PPI integrates machine learning predictions with statistical inference, drawing from survey sampling methods.
Statistical Inference Mar 19
Performance Testing of ChaCha20-Poly1305 for Internet of Things and Industrial Control System devices Ignore
This paper evaluates the performance of ChaCha20-Poly1305 encryption for IoT and ICS devices, highlighting its low impact on latency.
IoT Security Mar 19
Hierarchical Latent Structure Learning through Online Inference Ignore
HOLMES is a computational framework for hierarchical latent structure learning through online inference.
Hierarchical Learning Mar 19
Implicit Patterns in LLM-Based Binary Analysis Ignore
This paper presents a study on implicit patterns in LLM-based binary vulnerability analysis.
Binary Analysis Mar 19
From Inference Efficiency to Embodied Efficiency: Revisiting Efficiency Metrics for Vision-Language-Action Models Ignore
This paper critiques existing efficiency metrics for Vision-Language-Action models and proposes a focus on embodied efficiency.
Vision-Language-Action Mar 19
How Uncertainty Estimation Scales with Sampling in Reasoning Models Ignore
This paper explores uncertainty estimation in reasoning models but lacks a clear product application.
Uncertainty Estimation Mar 19
Position: Spectral GNNs Are Neither Spectral Nor Superior for Node Classification Ignore
This paper critiques the theoretical foundations of Spectral Graph Neural Networks and their effectiveness in node classification.
Graph Neural Networks Mar 19
Serendipity by Design: Evaluating the Impact of Cross-domain Mappings on Human and LLM Creativity Ignore
This research explores the effects of cross-domain mappings on creativity in humans and LLMs.
Creativity in LLMs Mar 19
On The Effectiveness of the UK NIS Regulations as a Mandatory Cybersecurity Reporting Regime Ignore
This paper analyzes the effectiveness of the UK NIS Regulations in reporting cybersecurity incidents affecting critical infrastructure.
Cybersecurity Mar 19
Hardness of High-Dimensional Linear Classification Ignore
This paper presents theoretical lower bounds for linear classification problems.
Theoretical Foundations Mar 19
Man and machine: artificial intelligence and judicial decision making Ignore
Exploring the integration of AI in judicial decision-making to enhance transparency and accountability.
Legal AI Mar 19
When Differential Privacy Meets Wireless Federated Learning: An Improved Analysis for Privacy and Convergence Ignore
This paper analyzes privacy and convergence in differential privacy for wireless federated learning.
Federated Learning Mar 19
Security awareness in LLM agents: the NDAI zone case Ignore
This paper explores the limitations of LLM agents in recognizing secure environments for privacy-preserving negotiations.
Agents Mar 19
Regret Bounds for Competitive Resource Allocation with Endogenous Costs Ignore
This paper presents a theoretical analysis of online resource allocation with endogenous costs and interaction effects.
Resource Allocation Mar 19
Evaluating Game Difficulty in Tetris Block Puzzle Ignore
A study on evaluating game difficulty in Tetris using a novel planning agent.
Game AI Mar 19
Foundations of Schrödinger Bridges for Generative Modeling Ignore
This paper develops the mathematical foundations of Schrödinger bridges for generative modeling.
Generative Modeling Mar 19
Best-of-Both-Worlds Multi-Dueling Bandits: Unified Algorithms for Stochastic and Adversarial Preferences under Condorcet and Borda Objectives Ignore
A novel algorithm for multi-dueling bandits that optimally adapts to both stochastic and adversarial environments.
Multi-Dueling Bandits Mar 19
Teleological Inference in Structural Causal Models via Intentional Interventions Ignore
This paper explores the use of structural causal models to understand the intentions of goal-directed agents through intentional interventions.
Causal Inference Mar 19
Unified Taxonomy for Multivariate Time Series Anomaly Detection using Deep Learning Ignore
A comprehensive taxonomy for multivariate time series anomaly detection using deep learning.
Anomaly Detection Mar 19
Entropy trajectory shape predicts LLM reasoning reliability: A diagnostic study of uncertainty dynamics in chain-of-thought Ignore
This study explores how the shape of uncertainty dynamics in LLM reasoning can predict accuracy.
LLM Reasoning Mar 19
Kernel Single-Index Bandits: Estimation, Inference, and Learning Ignore
A theoretical exploration of kernelized algorithms for contextual bandits with single-index models.
Contextual Bandits Mar 19
Agentic Business Process Management: A Research Manifesto Ignore
A manifesto proposing a new paradigm for Business Process Management focused on autonomous agents.
Agents Mar 19
Security, privacy, and agentic AI in a regulatory view: From definitions and distinctions to provisions and reflections Ignore
This paper reviews the evolving regulatory landscape for AI, focusing on security and privacy in the context of agentic AI.
Regulatory AI Mar 19
Neural Galerkin Normalizing Flow for Transition Probability Density Functions of Diffusion Models Ignore
A framework for approximating transition probability density functions using Neural Galerkin Normalizing Flows.
Diffusion Models Mar 19
Uniform a priori bounds and error analysis for the Adam stochastic gradient descent optimization method Ignore
This paper presents a theoretical error analysis of the Adam optimizer for stochastic optimization problems.
Optimization Algorithms Mar 19
I Can't Believe It's Corrupt: Evaluating Corruption in Multi-Agent Governance Systems Ignore
This research evaluates the integrity of AI agents in governance roles, highlighting the need for pre-deployment safeguards.
Agents Mar 19
Quantitative Introspection in Language Models: Tracking Internal States Across Conversation Ignore
This research explores using numeric self-reports in LLMs to track internal emotive states during conversations.
NLP Interpretability Mar 19
Authority-Level Priors: An Under-Specified Constraint in Hierarchical Predictive Processing Ignore
This paper proposes Authority-Level Priors as constraints in hierarchical predictive processing to explain stress reactivity and behavioral control.
Predictive Processing Mar 19
Geography According to ChatGPT -- How Generative AI Represents and Reasons about Geography Ignore
Exploring how generative AI represents and reasons about geography through theoretical vignettes.
Geographic AI Mar 19
A Human-in/on-the-Loop Framework for Accessible Text Generation Ignore
A framework integrating human feedback into LLM-based text simplification for cognitive accessibility.
NLP Accessibility Mar 19
Evaluating LLM-Generated Lessons from the Language Learning Students' Perspective: A Short Case Study on Duolingo Ignore
This study identifies gaps in language learning apps like Duolingo and suggests personalized lesson generation for professional fluency.
Language Learning Mar 19
Through the Looking-Glass: AI-Mediated Video Communication Reduces Interpersonal Trust and Confidence in Judgments Ignore
This research explores how AI-mediated video communication affects trust and judgment in interpersonal interactions.
AI in Communication Mar 19
Conflict-Based Search for Multi Agent Path Finding with Asynchronous Actions Ignore
A new method for optimal multi-agent path finding that handles asynchronous actions.
Multi-Agent Path Finding Mar 19
Why Better Cross-Lingual Alignment Fails for Better Cross-Lingual Transfer: Case of Encoders Ignore
This paper analyzes the failure of cross-lingual alignment to improve transfer performance in NLP tasks.
Cross-Lingual Transfer Mar 19
Student views in AI Ethics and Social Impact Ignore
This paper explores student perspectives on the ethical and societal impacts of AI, highlighting gender differences in awareness and concerns.
AI Ethics Mar 19
Seasoning Generative Models for a Generalization Aftertaste Ignore
This paper presents a theoretical framework for improving the generalization of generative models using discriminator guidance.
Generative Models Mar 19
"You've got a friend in me": Co-Designing a Peer Social Robot for Young Newcomers' Language and Cultural Learning Ignore
Maple is a socially assistive robot designed to enhance language and cultural learning for young newcomers through interactive storytelling.
Socially Assistive Robots Mar 19
SRRM: Improving Recursive Transport Surrogates in the Small-Discrepancy Regime Ignore
SRRM enhances the efficiency of surrogates for the Wasserstein distance but lacks practical implementation details.
Statistical Methods Mar 19
Secure Wi-Fi Ranging Today: Security and Adoption of IEEE 802.11az/bk Ignore
This paper analyzes the security vulnerabilities and deployment challenges of the new IEEE 802.11az and 802.11bk standards for secure Wi-Fi ranging.
Wi-Fi Security Mar 19
Cognitive Amplification vs Cognitive Delegation in Human-AI Systems: A Metric Framework Ignore
This paper presents a framework for evaluating the balance between cognitive amplification and delegation in human-AI systems.
Human-AI Interaction Mar 19
Multimodal Model for Computational Pathology:Representation Learning and Image Compression Ignore
A comprehensive review of challenges and advancements in multimodal computational pathology.
Medical AI Mar 19
A Theoretical Comparison of No-U-Turn Sampler Variants: Necessary and Su?cient Convergence Conditions and Mixing Time Analysis under Gaussian Targets Ignore
This paper theoretically analyzes the convergence properties of No-U-Turn Sampler variants without providing a clear path to practical application.
Bayesian Sampling Mar 19
SwiftGS: Episodic Priors for Immediate Satellite Surface Recovery Ignore
SwiftGS offers a novel approach to rapid 3D reconstruction from satellite imagery using meta-learning techniques.
3D Reconstruction Mar 19
A Complexity Hierarchy of Shuffles in Card-Based Protocols Ignore
This paper classifies the complexity of shuffles in card-based cryptography protocols.
Cryptography Mar 19
Proceedings of the 2nd Workshop on Advancing Artificial Intelligence through Theory of Mind Ignore
An anthology of research papers on Theory of Mind in AI.
Theory of Mind in AI Mar 19