State of the Field
Recent advancements in object detection are increasingly focused on enhancing efficiency and adaptability in real-time applications. New methodologies, particularly those based on the Detection Transformer (DETR) framework, are addressing traditional challenges such as query utilization and computational overhead. For instance, recent work has introduced matching-free training schemes that eliminate the need for heuristic matching, significantly improving training speed and performance. Additionally, innovations like RiO-DETR are enabling real-time detection of oriented objects, overcoming issues related to angle periodicity and search space complexity. The introduction of dynamic query generation in frameworks like PaQ-DETR is further refining the balance between accuracy and interpretability. Meanwhile, specialized solutions for small object detection in UAV imagery, such as CollabOD, are optimizing feature alignment and structural detail preservation. These developments not only promise to enhance the robustness of object detection systems but also open avenues for commercial applications in surveillance, autonomous vehicles, and robotics, where efficiency and accuracy are paramount.
Papers
1–5 of 5Beyond Hungarian: Match-Free Supervision for End-to-End Object Detection
Recent DEtection TRansformer (DETR) based frameworks have achieved remarkable success in end-to-end object detection. However, the reliance on the Hungarian algorithm for bipartite matching between qu...
PaQ-DETR: Learning Pattern and Quality-Aware Dynamic Queries for Object Detection
Detection Transformer (DETR) has redefined object detection by casting it as a set prediction task within an end-to-end framework. Despite its elegance, DETR and its variants still rely on fixed learn...
CollabOD: Collaborative Multi-Backbone with Cross-scale Vision for UAV Small Object Detection
Small object detection in unmanned aerial vehicle (UAV) imagery is challenging, mainly due to scale variation, structural detail degradation, and limited computational resources. In high-altitude scen...
RiO-DETR: DETR for Real-time Oriented Object Detection
We present RiO-DETR: DETR for Real-time Oriented Object Detection, the first real-time oriented detection transformer to the best of our knowledge. Adapting DETR to oriented bounding boxes (OBBs) pose...
A comprehensive overview of deep learning models for object detection from videos/images
Object detection in video and image surveillance is a well-established yet rapidly evolving task, strongly influenced by recent deep learning advancements. This review summarises modern techniques by ...