Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
DataComp
non-profit
https://www.datacomp.ai/dclm/index.html#home
Activity Feed
Follow
92
AI & ML interests
None defined yet.
Recent Activity
wannaphong
authored
a paper
about 15 hours ago
Mangosteen: An Open Thai Corpus for Language Model Pretraining
yixinsong
authored
a paper
9 days ago
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment
lx865712528
authored
a paper
16 days ago
Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training
View all activity
Team members
88
+54
+41
+20
+10
dclm
's models
None public yet