Edge Computing

Trending

4papers

7.5viability

+100%30d

Papers

1–4 of 4

Research Paper·Jan 15, 2026

LOOKAT: Lookup-Optimized Key-Attention for Memory-Efficient Transformers

Compressing the KV cache is a required step to deploy large language models on edge devices. Current quantization methods compress storage but fail to reduce bandwidth as attention calculation require...

9.0 viability

Research Paper·Mar 17, 2026

Deep Reinforcement Learning-driven Edge Offloading for Latency-constrained XR pipelines

Immersive extended reality (XR) applications introduce latency-critical workloads that must satisfy stringent real-time responsiveness while operating on energy- and battery-constrained devices, makin...

7.0 viability

Research Paper·Mar 16, 2026

Lightweight User-Personalization Method for Closed Split Computing

Split Computing enables collaborative inference between edge devices and the cloud by partitioning a deep neural network into an edge-side head and a server-side tail, reducing latency and limiting ex...

7.0 viability

Research Paper·Mar 9, 2026

RAPID: Redundancy-Aware and Compatibility-Optimal Edge-Cloud Partitioned Inference for Diverse VLA models

Vision Language Action (VLA) models are mainstream in embodied intelligence but face high inference costs. Edge-Cloud Collaborative (ECC) inference offers an effective fix by easing edge-device comput...

7.0 viability

Edge Computing

Papers

LOOKAT: Lookup-Optimized Key-Attention for Memory-Efficient Transformers

Deep Reinforcement Learning-driven Edge Offloading for Latency-constrained XR pipelines

Lightweight User-Personalization Method for Closed Split Computing

RAPID: Redundancy-Aware and Compatibility-Optimal Edge-Cloud Partitioned Inference for Diverse VLA models

Filters