Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
44
54
113
Junlin Zhou
jlzhou
Follow
lizhijin's profile picture
fblgit's profile picture
reubenrouse's profile picture
5 followers
·
43 following
edwardzjl
AI & ML interests
None yet
Recent Activity
reacted
to
codelion
's
post
with 👀
13 days ago
I recently added a recipe in ellora to improve reasoning capabilities to Gemma-3-1B using self-supervised learning. Model now shows step-by-step thinking in <think> tags before answering. Logic puzzle accuracy: 61% → 84%. 3 hours training on single GPU. 🧠 Used GRPO where model generates multiple responses and learns to prefer better reasoning. Works surprisingly well for making smaller models more transparent. 🔗 Colab: https://colab.research.google.com/github/codelion/ellora/blob/main/Ellora_Recipe_2_Reasoning_LoRA_with_Self-Rewarding_GRPO.ipynb 🤗 Model: https://huggingface.co/codelion/gemma-3-1b-it-reasoning-grpo-lora 💻 Code: https://github.com/codelion/ellora
reacted
to
Narsil
's
post
with 😎
20 days ago
Me: This function is too slow. Find a faster algorithm. Cursor: Hold my beer. Me: *Slacking off with colleagues* Cursor: Ping. Me: 🤯
reacted
to
Akhil-Theerthala
's
post
with ❤️
27 days ago
I'm excited to announce that I've just released the newest versions of my Kuvera models and the expanded Personal Finance Reasoning dataset on Hugging Face! What's new: I've expanded the Personal Finance Reasoning Dataset, which now includes 18.9k samples of real-world financial questions paired with detailed, empathetic answers. The previous generation pipeline was also streamlined with better psychological context and response validations. I've also released new Kuvera models trained on this improved dataset: - Kuvera-4B & 8B: These are my upgraded non-reasoning models, fine-tuned to provide practical financial advice. I've specifically trained the 8B model to better understand the user's emotional context. - Kuvera-12B: A first experimental reasoning model focused on the query resolution. As the sole person working on this project, this release is a noticeable step forward from my previous work, offering more powerful and nuanced tools for financial AI. I am actively looking to collaborate with others who are passionate about analyzing and improving the quality of personal finance advice generated by large language models. If this sounds like you, please reach out! You can check these out on the following links: Models: - https://huggingface.co/Akhil-Theerthala/Kuvera-8B-qwen3-v0.2.1 - https://huggingface.co/Akhil-Theerthala/Kuvera-4B-unsloth-gemma3 - https://huggingface.co/Akhil-Theerthala/kuvera-12B-v0.2.0-unsloth-gemma3 Dataset: - https://huggingface.co/datasets/Akhil-Theerthala/Kuvera-PersonalFinance-V2.1 P.S. The paper on the framework used to generate these models along with the detailed evaluation of the main 8B model's responses is going to be released soon!
View all activity
Organizations
jlzhou
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
about 1 month ago
UCLNLP/adversarial_qa
Viewer
•
Updated
Dec 21, 2023
•
72k
•
4.07k
•
41
liked
2 models
4 months ago
tiiuae/Falcon3-10B-Instruct-1.58bit
Text Generation
•
3B
•
Updated
Jan 13
•
1.18k
•
20
black-forest-labs/FLUX.1-dev
Text-to-Image
•
Updated
Jun 27
•
1.4M
•
•
11.4k
liked
a dataset
4 months ago
b-mc2/sql-create-context
Viewer
•
Updated
Jan 25, 2024
•
78.6k
•
3.27k
•
478
liked
a model
4 months ago
meta-llama/Llama-Prompt-Guard-2-86M
Text Classification
•
0.3B
•
Updated
Apr 29
•
19.1k
•
•
59
liked
a model
5 months ago
zai-org/GLM-Z1-9B-0414
Text Generation
•
9B
•
Updated
Apr 28
•
1.58k
•
•
74
liked
2 models
6 months ago
mistralai/Mistral-Small-3.1-24B-Instruct-2503
24B
•
Updated
Jul 28
•
291k
•
1.31k
open-r1/OlympicCoder-32B
Text Generation
•
33B
•
Updated
Mar 17
•
1.2k
•
•
155
liked
a dataset
6 months ago
smolagents/benchmark-v1
Viewer
•
Updated
Mar 4
•
132
•
209
•
15
liked
a model
7 months ago
NousResearch/DeepHermes-3-Llama-3-8B-Preview
Text Generation
•
8B
•
Updated
Apr 10
•
4.14k
•
•
350
liked
2 datasets
7 months ago
nvidia/HelpSteer2
Viewer
•
Updated
Dec 18, 2024
•
21.4k
•
10.4k
•
427
QuixiAI/dolphin-r1
Viewer
•
Updated
Jan 30
•
814k
•
783
•
285
liked
a Space
7 months ago
Running
53
53
Open LLM Leaderboard Results PR Opener
🧐
Update model card with leaderboard results
liked
2 models
7 months ago
Qwen/Qwen2.5-3B-Instruct
Text Generation
•
3B
•
Updated
Sep 25, 2024
•
3.9M
•
302
mistralai/Mistral-Small-24B-Instruct-2501
24B
•
Updated
Jul 28
•
487k
•
939
liked
a dataset
7 months ago
open-thoughts/OpenThoughts-114k
Viewer
•
Updated
9 days ago
•
228k
•
31.3k
•
754
liked
2 models
8 months ago
deepseek-ai/DeepSeek-R1
Text Generation
•
685B
•
Updated
Mar 27
•
368k
•
•
12.7k
microsoft/phi-4
Text Generation
•
15B
•
Updated
Feb 24
•
1.16M
•
•
2.16k
liked
2 datasets
8 months ago
openai/MMMLU
Viewer
•
Updated
Oct 16, 2024
•
393k
•
11.8k
•
497
openai/gsm8k
Viewer
•
Updated
Jan 4, 2024
•
17.6k
•
414k
•
856
Load more