(ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations
AI & ML interests
Machine Learning, Computer Vision, Embodied AI
Recent Activity
[NeurIPS 2024] Grasp as You Say: Language-guided Dexterous Grasp Generation
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
-
fushh7/LLMDet
Zero-Shot Object Detection • Updated • 19 -
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
Paper • 2501.18954 • Published -
iSEE-Laboratory/llmdet_tiny
Zero-Shot Object Detection • 0.2B • Updated • 2.33k • 4 -
iSEE-Laboratory/llmdet_base
Zero-Shot Object Detection • 0.2B • Updated • 265k • 3
(ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations
[NeurIPS 2024] Grasp as You Say: Language-guided Dexterous Grasp Generation
EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
-
fushh7/LLMDet
Zero-Shot Object Detection • Updated • 19 -
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
Paper • 2501.18954 • Published -
iSEE-Laboratory/llmdet_tiny
Zero-Shot Object Detection • 0.2B • Updated • 2.33k • 4 -
iSEE-Laboratory/llmdet_base
Zero-Shot Object Detection • 0.2B • Updated • 265k • 3