Guided Decoding and Its Critical Role in Retrieval-Augmented Generation Paper • 2509.06631 • Published 8 days ago • 7
view article Article Guided Decoding and Its Critical Role in Retrieval-Augmented Generation: A Deep Dive into Structured LLM Outputs By nmmursit and 7 others • 8 days ago • 15
view article Article Theoretical Limitations of Embedding Models and Their Applications in Turkish: An In-Depth Look By nmmursit and 1 other • 12 days ago • 14
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model By tomaarsen and 5 others • 13 days ago • 208
view article Article 🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders By adaamko and 1 other • 16 days ago • 9
view article Article Turk-LettuceDetect: A Hallucination Detection Models for Turkish RAG Applications By nmmursit and 5 others • 18 days ago • 26
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • Jul 29 • 178
view article Article Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders By orionweller and 5 others • Jul 16 • 67
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • Jul 8 • 669
view article Article Welcome Llama 4 Maverick & Scout on Hugging Face! By burtenshaw and 6 others • Apr 5 • 146
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.3k
view article Article Introducing smolagents: simple agents that write actions in code. By m-ric and 2 others • Dec 31, 2024 • 1.12k
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • Jan 31 • 51
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 14 items • Updated Feb 25 • 19
view article Article Train 400x faster Static Embedding Models with Sentence Transformers By tomaarsen • Jan 15 • 210
Deliberation in Latent Space via Differentiable Cache Augmentation Paper • 2412.17747 • Published Dec 23, 2024 • 32
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published Dec 13, 2024 • 108
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 469 items • Updated about 22 hours ago • 60