Beijing Academy of Artificial Intelligence

Team

non-profit

https://www.baai.ac.cn/english.html

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

zhizhou57 updated a dataset 1 day ago

BAAI/ROME

zhizhou57 published a dataset 1 day ago

BAAI/ROME

hanhainebula updated a model 1 day ago

BAAI/bge-reasoner-embed-qwen3-8b-0923

View all activity

Articles

Letting Large Models Debate: The First Multilingual LLM Debate Competition

zhizhou57

updated a dataset 1 day ago

BAAI/ROME

Viewer • Updated 1 day ago • 281 • 176 • 2

zhizhou57

published a dataset 1 day ago

BAAI/ROME

Viewer • Updated 1 day ago • 281 • 176 • 2

hanhainebula

updated a model 1 day ago

BAAI/bge-reasoner-embed-qwen3-8b-0923

Feature Extraction • 8B • Updated 1 day ago • 23 • 8

xuanricheng

authored a paper 2 days ago

FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions

Paper • 2509.17177 • Published 3 days ago • 11

Yonghua

authored a paper 2 days ago

FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions

Paper • 2509.17177 • Published 3 days ago • 11

philokey

authored 4 papers 2 days ago

CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning

Paper • 2401.14011 • Published Jan 25, 2024

Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs

Paper • 2505.11842 • Published May 17

RoboBrain 2.0 Technical Report

Paper • 2507.02029 • Published Jul 2 • 31

FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions

Paper • 2509.17177 • Published 3 days ago • 11

lilaczheng

authored a paper 2 days ago

FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions

Paper • 2509.17177 • Published 3 days ago • 11

hanhainebula

updated a collection 2 days ago

BGE

31 items • Updated 2 days ago • 133

hanhainebula

published a model 2 days ago

BAAI/bge-reasoner-embed-qwen3-8b-0923

Feature Extraction • 8B • Updated 1 day ago • 23 • 8

HelloGitHub

updated a Space 7 days ago

EmbodiedVerse

Explore and compare model evaluations

s1u23mmer

updated a dataset 13 days ago

BAAI/RealTalk-CN

Updated 13 days ago • 133 • 2

s1u23mmer

published a dataset 14 days ago

BAAI/RealTalk-CN

Updated 13 days ago • 133 • 2

Hui519

published a dataset 16 days ago

BAAI/MusicEval

Viewer • Updated Aug 18 • 2.75k • 33

ldwang

updated 3 models 17 days ago

BAAI/OpenSeek-Small-v1-SFT

2B • Updated 17 days ago • 19.5k • 4

BAAI/OpenSeek-Small-v1

2B • Updated 17 days ago • 6 • 17

BAAI/OpenSeek-Small-v1-Baseline

2B • Updated 17 days ago • 99 • 5

ZacLiu

updated a model 17 days ago

BAAI/OpenSeek-Small-v1

2B • Updated 17 days ago • 6 • 17