44 54 113

Junlin Zhou

jlzhou

edwardzjl

AI & ML interests

None yet

Recent Activity

reacted to codelion's post with 👀 13 days ago

I recently added a recipe in ellora to improve reasoning capabilities to Gemma-3-1B using self-supervised learning. Model now shows step-by-step thinking in <think> tags before answering. Logic puzzle accuracy: 61% → 84%. 3 hours training on single GPU. 🧠 Used GRPO where model generates multiple responses and learns to prefer better reasoning. Works surprisingly well for making smaller models more transparent. 🔗 Colab: https://colab.research.google.com/github/codelion/ellora/blob/main/Ellora_Recipe_2_Reasoning_LoRA_with_Self-Rewarding_GRPO.ipynb 🤗 Model: https://huggingface.co/codelion/gemma-3-1b-it-reasoning-grpo-lora 💻 Code: https://github.com/codelion/ellora

reacted to Narsil's post with 😎 20 days ago

Me: This function is too slow. Find a faster algorithm. Cursor: Hold my beer. Me: *Slacking off with colleagues* Cursor: Ping. Me: 🤯

reacted to Akhil-Theerthala's post with ❤️ 27 days ago

I'm excited to announce that I've just released the newest versions of my Kuvera models and the expanded Personal Finance Reasoning dataset on Hugging Face! What's new: I've expanded the Personal Finance Reasoning Dataset, which now includes 18.9k samples of real-world financial questions paired with detailed, empathetic answers. The previous generation pipeline was also streamlined with better psychological context and response validations. I've also released new Kuvera models trained on this improved dataset: - Kuvera-4B & 8B: These are my upgraded non-reasoning models, fine-tuned to provide practical financial advice. I've specifically trained the 8B model to better understand the user's emotional context. - Kuvera-12B: A first experimental reasoning model focused on the query resolution. As the sole person working on this project, this release is a noticeable step forward from my previous work, offering more powerful and nuanced tools for financial AI. I am actively looking to collaborate with others who are passionate about analyzing and improving the quality of personal finance advice generated by large language models. If this sounds like you, please reach out! You can check these out on the following links: Models: - https://huggingface.co/Akhil-Theerthala/Kuvera-8B-qwen3-v0.2.1 - https://huggingface.co/Akhil-Theerthala/Kuvera-4B-unsloth-gemma3 - https://huggingface.co/Akhil-Theerthala/kuvera-12B-v0.2.0-unsloth-gemma3 Dataset: - https://huggingface.co/datasets/Akhil-Theerthala/Kuvera-PersonalFinance-V2.1 P.S. The paper on the framework used to generate these models along with the detailed evaluation of the main 8B model's responses is going to be released soon!

View all activity

Organizations

liked a dataset about 1 month ago

UCLNLP/adversarial_qa

Viewer • Updated Dec 21, 2023 • 72k • 4.07k • 41

liked 2 models 4 months ago

tiiuae/Falcon3-10B-Instruct-1.58bit

Text Generation • 3B • Updated Jan 13 • 1.18k • 20

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27 • 1.4M • • 11.4k

liked a dataset 4 months ago

b-mc2/sql-create-context

Viewer • Updated Jan 25, 2024 • 78.6k • 3.27k • 478

liked a model 4 months ago

meta-llama/Llama-Prompt-Guard-2-86M

Text Classification • 0.3B • Updated Apr 29 • 19.1k • • 59

liked a model 5 months ago

zai-org/GLM-Z1-9B-0414

Text Generation • 9B • Updated Apr 28 • 1.58k • • 74

liked 2 models 6 months ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

24B • Updated Jul 28 • 291k • 1.31k

open-r1/OlympicCoder-32B

Text Generation • 33B • Updated Mar 17 • 1.2k • • 155

liked a dataset 6 months ago

smolagents/benchmark-v1

Viewer • Updated Mar 4 • 132 • 209 • 15

liked a model 7 months ago

NousResearch/DeepHermes-3-Llama-3-8B-Preview

Text Generation • 8B • Updated Apr 10 • 4.14k • • 350

liked 2 datasets 7 months ago

nvidia/HelpSteer2

Viewer • Updated Dec 18, 2024 • 21.4k • 10.4k • 427

QuixiAI/dolphin-r1

Viewer • Updated Jan 30 • 814k • 783 • 285

liked a Space 7 months ago

Open LLM Leaderboard Results PR Opener

🧐

Update model card with leaderboard results

liked 2 models 7 months ago

Qwen/Qwen2.5-3B-Instruct

Text Generation • 3B • Updated Sep 25, 2024 • 3.9M • 302

mistralai/Mistral-Small-24B-Instruct-2501

24B • Updated Jul 28 • 487k • 939

liked a dataset 7 months ago

open-thoughts/OpenThoughts-114k

Viewer • Updated 9 days ago • 228k • 31.3k • 754

liked 2 models 8 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 368k • • 12.7k

microsoft/phi-4

Text Generation • 15B • Updated Feb 24 • 1.16M • • 2.16k

liked 2 datasets 8 months ago

openai/MMMLU

Viewer • Updated Oct 16, 2024 • 393k • 11.8k • 497

openai/gsm8k

Viewer • Updated Jan 4, 2024 • 17.6k • 414k • 856

Junlin Zhou

AI & ML interests

Recent Activity

Organizations

jlzhou's activity

Open LLM Leaderboard Results PR Opener