-
NExT-GPT: Any-to-Any Multimodal LLM
Paper • 2309.05519 • Published • 78 -
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
Paper • 2309.03883 • Published • 35 -
apple/DCLM-7B
7B • Updated • 32 • 830 -
Aria: An Open Multimodal Native Mixture-of-Experts Model
Paper • 2410.05993 • Published • 112
Tom Mulder PRO
tommulder
AI & ML interests
None yet
Recent Activity
liked
a Space
9 days ago
black-forest-labs/FLUX.1-Kontext-Dev
updated
a collection
about 1 month ago
Transformers
upvoted
a
paper
about 1 month ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data
Processing to Every Language