object-detection

#object-detection

@VincentLogic: NVIDIA open-sourced a visual grounding model: LocateAnything-3B. Dozens of minions densely piled together — it detects every single one without missing any, all boxed. The technological shift behind this is worth more than just saying 'more accurate'.

X AI KOLs Timeline ↗ · 4d ago Cached

NVIDIA has open-sourced the visual grounding model LocateAnything-3B, which can accurately detect and bound all target objects in dense scenes.

0 favorites 0 likes

#object-detection

@pattssun: The @nyknicks championship inspired me to build an AI basketball coach to improve my 1v1 game Built with: - Robloflow R…

X AI KOLs Following ↗ · 6d ago Cached

A developer built an AI basketball coach using Roboflow RF-DETR for detection, MediaPipe for body angles, and OpenCV for analysis and annotation.

0 favorites 0 likes

#object-detection

An Introduction to YOLO26

Hacker News Top ↗ · 2026-06-23 Cached

YOLO26 is a multi-task computer vision model family released in January 2026, featuring end-to-end detection without Non-Maximum Suppression for lower latency and optimized for edge deployment with improved CPU inference and compact design.

0 favorites 0 likes

#object-detection

@DataChaz: @NVIDIA just dropped LocateAnything, making object detection ~10x faster by fixing one core bottleneck: How the model w…

X AI KOLs Following ↗ · 2026-06-17 Cached

NVIDIA released LocateAnything, an open-source model that achieves ~10x faster object detection by predicting all coordinates simultaneously instead of sequentially, reaching 12.7 FPS on a single H100 and outperforming 32B parameter models.

0 favorites 0 likes

#object-detection

Ultralytics YOLO26: Unified Real-Time End-to-End Vision Models

Hugging Face Daily Papers ↗ · 2026-06-02 Cached

Ultralytics YOLO26 introduces a unified real-time vision model family with NMS-free inference, improved training strategies, and multi-task capabilities for detection, segmentation, and pose estimation, achieving state-of-the-art accuracy-latency trade-offs.

0 favorites 0 likes

#object-detection

@ZhidingYu: Thank you NVIDIA! I will be presenting LocateAnything at #CVPR2026 at the NVIDIA Booth: June 5 4:20 - 4:40 pm MDT (Frid…

X AI KOLs Following ↗ · 2026-05-28 Cached

NVIDIA introduces LocateAnything, a unified generative grounding and detection framework that uses Parallel Box Decoding to improve decoding throughput and localization accuracy. This work will be presented at CVPR 2026.

0 favorites 0 likes

#object-detection

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Hugging Face Daily Papers ↗ · 2026-05-26 Cached

LocateAnything proposes Parallel Box Decoding for unified visual grounding and object detection, decoding geometric elements as atomic units to improve throughput and localization accuracy, supported by a large-scale dataset of 138M samples.

0 favorites 0 likes

#object-detection

@Phoenixyin13: In the world of object detection, there have always been two schools: the YOLO school, the traditional powerhouse, following the principle that speed is the ultimate weapon. Extremely fast, it dominates industries, drones, and surveillance cameras. The Transformer school, the academic aristocrat, highly intelligent with superior accuracy, but due to massive computational consumption, it was like a delicate Lin Daiyu in the past, unable to run in scenarios requiring real-time response...

X AI KOLs Timeline ↗ · 2026-05-24 Cached

The RF-DETR model proposed at ICLR2026 combines Transformer's high accuracy with real-time performance, achieving high scores in 100 real-world scenarios and offering sizes from Nano to 2XL, potentially replacing YOLO in real-time detection.

0 favorites 0 likes

#object-detection

@tenderizzation: it’s literally off the scale! welcome back yolov3

X AI KOLs Following ↗ · 2026-05-09 Cached

A social media post expresses excitement about the return or renewed relevance of the YOLOv3 object detection model.

0 favorites 0 likes

#object-detection

/yolo

Reddit r/LocalLLaMA ↗ · 2026-04-21

Article concerning YOLO, the widely used real-time object detection model family.

0 favorites 0 likes

#object-detection

@ycombinator: LLMs are great for human in the loop applications, but fail at deterministic developer tasks. @interfaze_ai is a new AI…

X AI KOLs Following ↗ · 2026-04-20 Cached

Interfaze AI introduces a specialized model that surpasses general LLMs on deterministic developer tasks including OCR, object detection, web scraping, speech-to-text, and classification.

0 favorites 0 likes

#object-detection

What should i do to have a good OD model?[P]

Reddit r/MachineLearning ↗ · 2026-04-20

A user is seeking advice on improving their object detection model trained with YOLO11n for deployment on a Raspberry Pi 5, struggling with the gap between theoretical mAP50 metrics and practical detection performance.

0 favorites 0 likes

#object-detection

SAM 3.1: Faster and More Accessible Real-Time Video Detection and Tracking With Multiplexing and Global Reasoning

Meta AI Blog ↗ · 2026-03-26

Meta AI releases SAM 3.1, an update to the Segment Anything Model that enhances real-time video detection and tracking through multiplexing and global reasoning capabilities.

0 favorites 0 likes

#object-detection

Generative AI improves a wireless vision system that sees through obstructions

MIT News — Artificial Intelligence ↗ · 2026-03-19 Cached

MIT researchers have developed a generative AI-enhanced wireless vision system that reconstructs hidden objects and entire room scenes using millimeter-wave signals, overcoming previous limitations in shape reconstruction and enabling applications in warehouse robotics and smart homes.

0 favorites 0 likes

#object-detection

Frigate with Hailo for object detection on a Raspberry Pi

Jeff Geerling ↗ · 2026-02-18 Cached

This blog post details how to set up Frigate with a Hailo AI coprocessor on a Raspberry Pi for object detection, including steps to fix a PCIe descriptor page size error. The setup works with the cheaper Hailo-8L and achieves low inference times.

0 favorites 0 likes

#object-detection

SAM 3: Segment Anything with Concepts

Papers with Code Trending ↗ · 2025-11-20 Cached

SAM 3 introduces a unified model for promptable concept segmentation and tracking, achieving state-of-the-art performance with a decoupled recognition and localization architecture and a scalable data engine.

0 favorites 0 likes

#object-detection

RF-DETR: Neural Architecture Search for Real-Time Detection Transformers

Papers with Code Trending ↗ · 2025-11-12 Cached

RF-DETR introduces a lightweight detection transformer that uses weight-sharing neural architecture search to achieve state-of-the-art real-time object detection, outperforming prior methods on COCO and Roboflow100-VL while running up to 20x faster.

0 favorites 0 likes

#object-detection

blakeblackshear/frigate

GitHub Trending (daily) ↗ · 2026-05-24 Cached

Frigate is an open-source NVR designed for Home Assistant that performs real-time AI object detection on IP camera feeds locally using OpenCV and TensorFlow. It features tight Home Assistant integration, motion-based detection, and efficient resource usage.

0 favorites 0 likes

#object-detection

adirik/grounding-dino

Replicate Explore ↗ · 2026-05-08 Cached

Grounding DINO is an open-vocabulary object detection model that can detect arbitrary objects based on text descriptions, now available on Replicate.

0 favorites 0 likes

object-detection

Submit Feedback