Rishabh Singh

lulzx

AI & ML interests

Information retrieval

Recent Activity

liked a model 2 days ago

zai-org/GLM-4.5

liked a model 3 days ago

MetaStoneTec/XBai-o4

liked a model 3 days ago

Qwen/Qwen3-30B-A3B-Instruct-2507

View all activity

Organizations

upvoted an article 16 days ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

and 3 others •

18 days ago

• 47

upvoted a paper 22 days ago

Test-Time Scaling with Reflective Generative Model

Paper • 2507.01951 • Published Jul 2 • 104

upvoted an article about 1 month ago

Article

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

and 11 others •

Jun 27

• 27

upvoted an article about 2 months ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 495

upvoted a paper about 2 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 253

upvoted a paper 2 months ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 127

upvoted 2 articles 3 months ago

Article

Blazingly fast whisper transcriptions with Inference Endpoints

and 5 others •

May 13

• 74

Article

Object Detection Leaderboard

and 1 other •

Sep 18, 2023

• 19

upvoted an article 4 months ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

and 1 other •

Oct 14, 2024

• 96

upvoted 2 papers 5 months ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 153

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 73

upvoted 2 articles 5 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

and 3 others •

Mar 12

• 448

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

upvoted a collection 5 months ago

Q-Filters

Collection

Pre-computed Q-Filters for efficient KV cache compression. • 15 items • Updated Mar 3 • 7

upvoted a paper 5 months ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25 • 74

upvoted an article 5 months ago

Article

FastRTC: The Real-Time Communication Library for Python

and 1 other •

Feb 25

• 172

upvoted 2 papers 6 months ago

Text2World: Benchmarking Large Language Models for Symbolic World Model Generation

Paper • 2502.13092 • Published Feb 18 • 13

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published Feb 14 • 35

upvoted a collection 6 months ago

DeepSeek-R1-abliterated

Collection

9 items • Updated May 30 • 111

upvoted a paper 6 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 57

Rishabh Singh

AI & ML interests

Recent Activity

Organizations

lulzx's activity

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

Vision Language Models (Better, Faster, Stronger)

Blazingly fast whisper transcriptions with Inference Endpoints

Object Detection Leaderboard

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Open R1: Update #3

FastRTC: The Real-Time Communication Library for Python