6 51 59

Lê Võ Quyết Thắng PRO

thangvip

https://vualidon.github.io/

AI & ML interests

Adapting LLM to specific domain

Recent Activity

updated a model 6 days ago

thangvip/qwen3-1.7b-legal-pretrain-synthetic-8k

published a model 9 days ago

thangvip/qwen3-1.7b-legal-pretrain-synthetic-8k

updated a model 10 days ago

thangvip/qwen3-4b-legal-pretrain-synthetic-8k

View all activity

Organizations

upvoted 3 papers about 1 month ago

upvoted 2 papers 3 months ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 130

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

Paper • 2505.21600 • Published May 27 • 71

upvoted an article 4 months ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 522

upvoted a paper 5 months ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published Feb 20 • 91

upvoted an article 5 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

•

Mar 26

• 161

upvoted an article 6 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

upvoted 2 articles 7 months ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

and 2 others •

Jan 23

• 182

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 879

upvoted 2 papers 8 months ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published Jan 8 • 94

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 96

upvoted 3 papers 9 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 132

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 31

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 46

upvoted a collection 9 months ago

Tulu 3 Datasets

Collection

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated Apr 30 • 88

upvoted a paper 9 months ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 66

upvoted 2 papers 10 months ago

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 110

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Paper • 2411.07133 • Published Nov 11, 2024 • 39

Lê Võ Quyết Thắng PRO

AI & ML interests

Recent Activity

Organizations

thangvip's activity

Vision Language Models (Better, Faster, Stronger)

Training and Finetuning Reranker Models with Sentence Transformers v4

Open R1: Update #3

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Open-R1: a fully open reproduction of DeepSeek-R1