pascalmusabyimana's picture

pascalmusabyimana

pascal-maker

·

https://pascal-maker.github.io/developedbypascalmusabyimana/

AI & ML interests

computer vision, nlp , machine learning and deeplearning

Recent Activity

liked a model 6 days ago

deepseek-ai/DeepSeek-V3.1

liked a model 6 days ago

deepseek-ai/DeepSeek-V3.1-Base

reacted to prithivMLmods's post with ❤️ 10 days ago

Excited to introduce the Tiny VLMs Lab App for experiencing 15+ multimodal VLMs, ranging from a 250M parameter model to a 4B parameter model, for tasks like OCR, reasoning, small models for single-shot answering, and captioning (abliterated), across a broad range of visual categories including images with complex, sensitive, or nuanced content, while handling varying aspect ratios and resolutions.🧪 🤗 Space/App: https://huggingface.co/spaces/prithivMLmods/Tiny-VLMs-Lab ✦︎ Also introducing https://huggingface.co/prithivMLmods/Qwen2.5-VL-3B-Abliterated-Caption-it, tailored for Abliterated Captioning / Uncensored Image Captioning. This release comes as a lighter alternative to the existing Qwen2.5-VL-7B-Abliterated-Caption-it https://huggingface.co/prithivMLmods/Qwen2.5-VL-7B-Abliterated-Caption-it model, making it usable on mid-range GPUs and even experimental on T4 GPUs. ✦︎ Collection: https://huggingface.co/collections/prithivMLmods/vl-abliterated-caption-68a0443b63182e97a15c47a3 ✦︎ GitHub: https://github.com/PRITHIVSAKTHIUR/Tiny-VLMs-Lab . . . To know more about it, visit the app page or the respective model page!!

View all activity

Organizations

upvoted a collection 13 days ago

qqWen-Series

Based off the Qwen-2.5 Series - model finetuned for the Q programming language. • 11 items • Updated about 1 hour ago • 9

upvoted 4 collections about 1 month ago

GLM-4.5

GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated 17 days ago • 221

OpenReasoning-Nemotron

Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science. • 6 items • Updated 13 days ago • 42

🚀 Optimized Models: torchao & Pruna Quantization

Quantized Models using torchao & Pruna for efficient inference and deployment. • 8 items • Updated 21 days ago • 1

Releases July 4

25 items • Updated Jul 7 • 7

upvoted a paper about 2 months ago

MedGemma Technical Report

Paper • 2507.05201 • Published Jul 7 • 14

upvoted 2 articles about 2 months ago

Article

Creating custom kernels for the AMD MI300

By

and 1 other •

Jul 9

• 43

Article

SmolLM3: smol, multilingual, long-context reasoner

By

and 22 others •

Jul 8

• 640

upvoted 2 collections about 2 months ago

🍉 June 2025 - Open works from the Chinese community

29 items • Updated about 1 month ago • 7

ERNIE 4.5

collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 25 items • Updated Jul 11 • 159

upvoted a paper about 2 months ago

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Paper • 2506.11763 • Published Jun 13 • 69

upvoted 2 articles 2 months ago

Article

Gemma 3n fully available in the open-source ecosystem!

By

and 7 others •

Jun 26

• 115

Article

Transformers backend integration in SGLang

By

and 4 others •

Jun 23

• 53

upvoted a collection 2 months ago

Releases June 13

40 items • Updated Jun 18 • 5

upvoted an article 2 months ago

Article

MCP is at a Tipping Point: Here's Why You Should Care

By

•

Jun 10

• 17

upvoted a collection 2 months ago

Qwen3

84 items • Updated 22 days ago • 1.15k

upvoted an article 2 months ago

Article

Sensitivity Aware Mixed Precision Quantization V1

By

and 1 other •

Jun 13

• 19

upvoted an article 3 months ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

By

and 5 others •

Jun 3

• 84

upvoted 2 collections 3 months ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13 • 160

Any-to-Any Models, Datasets, Spaces

18 items • Updated Jun 20 • 24