-
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Paper • 2401.00448 • Published • 31 -
Improving Text Embeddings with Large Language Models
Paper • 2401.00368 • Published • 83 -
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models
Paper • 2401.06951 • Published • 27 -
The Unreasonable Ineffectiveness of the Deeper Layers
Paper • 2403.17887 • Published • 82
allthingsdisaggregated
lastweek
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
25 days ago
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed
Inference
upvoted
a
paper
3 months ago
Inference-Time Hyper-Scaling with KV Cache Compression
upvoted
a
paper
3 months ago
Cosmos World Foundation Model Platform for Physical AI
Organizations
None yet