Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

williamcstanford 's Collections
video segmentation
diffusion
RL
robotics
LLMs
video gen
Autonomous agents
Transformer improvements
Music gen
video understanding
brain
MUST FOLLOWS
relighting
singing portraits
Depth Estimation
Cellular Automata DL
Code Understanding
datasets

video understanding

updated Jun 26, 2024
Upvote
-

  • VideoPrism: A Foundational Visual Encoder for Video Understanding

    Paper • 2402.13217 • Published Feb 20, 2024 • 37

  • Sora Generates Videos with Stunning Geometrical Consistency

    Paper • 2402.17403 • Published Feb 27, 2024 • 18

  • Video as the New Language for Real-World Decision Making

    Paper • 2402.17139 • Published Feb 27, 2024 • 22

  • VideoHallucer: Evaluating Intrinsic and Extrinsic Hallucinations in Large Video-Language Models

    Paper • 2406.16338 • Published Jun 24, 2024 • 27
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs