Stanford AI

university

https://www.ai.stanford.edu

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

kzliu authored a paper 26 days ago

UQ: Assessing Language Models on Unsolved Questions

keshik6 authored a paper 3 months ago

Spatial Mental Modeling from Limited Views

keshik6 authored a paper 3 months ago

Re-thinking Temporal Search for Long-Form Video Understanding

View all activity

Articles

TimeScope: How Long Can Your Video Large Multimodal Model Go?

SmolVLM2: Bringing Video Understanding to Every Device

SmolVLM2: 让视频理解能力触手可及

yuegao

authored a paper 4 months ago

Aligning Pretraining for Detection via Object-Level Contrastive Learning

Paper • 2106.02637 • Published Jun 4, 2021

kushalt

authored a paper 4 months ago

INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning

Paper • 2505.07291 • Published May 12 • 14

Muennighoff

authored 2 papers 5 months ago

Crosslingual Reasoning through Test-Time Scaling

Paper • 2505.05408 • Published May 8 • 8

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29 • 54

Kameshr

updated a dataset 6 months ago

Stanford/Compiled_COT

Viewer • Updated Mar 15 • 2.23M • 45 • 2

Kameshr

published a dataset 6 months ago

Stanford/Compiled_COT

Viewer • Updated Mar 15 • 2.23M • 45 • 2

nicholswang

authored a paper 6 months ago

Video Action Differencing

Paper • 2503.07860 • Published Mar 10 • 33

yuegao

authored a paper 7 months ago

FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video

Paper • 2503.04720 • Published Mar 6 • 1

ayushchakravarthy

authored a paper 7 months ago

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

Paper • 2503.01307 • Published Mar 3 • 38

Muennighoff

authored a paper 8 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 125

nicholswang

authored 2 papers 8 months ago

Temporal Preference Optimization for Long-Form Video Understanding

Paper • 2501.13919 • Published Jan 23 • 23

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Paper • 2501.07171 • Published Jan 13 • 55

nicholswang

authored 8 papers 9 months ago

Action Sensitivity Learning for Temporal Action Localization

Paper • 2305.15701 • Published May 25, 2023

Whitening-based Contrastive Learning of Sentence Embeddings

Paper • 2305.17746 • Published May 28, 2023

Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models

Paper • 2305.18010 • Published May 29, 2023

Describing Differences in Image Sets with Natural Language

Paper • 2312.02974 • Published Dec 5, 2023 • 16

Clustering based Point Cloud Representation Learning for 3D Analysis

Paper • 2307.14605 • Published Jul 27, 2023

JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery

Paper • 2307.16377 • Published Jul 31, 2023

Bird's-Eye-View Scene Graph for Vision-Language Navigation

Paper • 2308.04758 • Published Aug 9, 2023

VideoAgent: Long-form Video Understanding with Large Language Model as Agent

Paper • 2403.10517 • Published Mar 15, 2024 • 37