-
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper • 2412.11768 • Published • 44 -
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
Paper • 2501.06842 • Published • 16 -
The GAN is dead; long live the GAN! A Modern GAN Baseline
Paper • 2501.05441 • Published • 96
nDimensional
nDimensional
AI & ML interests
Computer Vision, Diffusers, Transformers, ML, NLP, Diffusion Models, Unsupervised Learning, JAX, Neural Networks
Recent Activity
liked
a model
4 days ago
lodestones/Chroma1-Base
liked
a model
4 days ago
lodestones/Chroma1-HD
upvoted
a
paper
4 days ago
Self-Rewarding Vision-Language Model via Reasoning Decomposition
Organizations
None yet