Papers
1–3 of 3Research Paper·Mar 7, 2026
StructSAM: Structure- and Spectrum-Preserving Token Merging for Segment Anything Models
Recent token merging techniques for Vision Transformers (ViTs) provide substantial speedups by reducing the number of tokens processed by self-attention, often without retraining. However, their direc...
7.0 viability
Research Paper·Mar 13, 2026
Spatio-Semantic Expert Routing Architecture with Mixture-of-Experts for Referring Image Segmentation
Referring image segmentation aims to produce a pixel-level mask for the image region described by a natural-language expression. Although pretrained vision-language models have improved semantic groun...
7.0 viability
Research Paper·Mar 9, 2026
Visualizing Coalition Formation: From Hedonic Games to Image Segmentation
We propose image segmentation as a visual diagnostic testbed for coalition formation in hedonic games. Modeling pixels as agents on a graph, we study how a granularization parameter shapes equilibrium...
6.0 viability