1 74 169

pascalmusabyimana

pascal-maker

https://pascal-maker.github.io/developedbypascalmusabyimana/

AI & ML interests

computer vision, nlp , machine learning and deeplearning

Recent Activity

liked a model about 11 hours ago

openai/gpt-oss-20b

reacted to prithivMLmods's post with 🤗 about 11 hours ago

Qwen Image – The Latest Image Generation Model🔥 Below are some samples generated using the Qwen Image Diffusion Model. Qwen-Image, a 20B MMDiT model for next-generation text-to-image generation, preserves typographic details, layout coherence, and contextual harmony with stunning accuracy. It is especially strong at creating stunning graphic posters with native text. The model is now open-source. [ 𝚀𝚠𝚎𝚗-𝙸𝚖𝚊𝚐𝚎 : https://huggingface.co/Qwen/Qwen-Image ] ⤷ Try the Qwen Image demo here: https://huggingface.co/spaces/prithivMLmods/Qwen-Image-Diffusion, https://huggingface.co/spaces/Qwen/Qwen-Image & more ... ⤷ Qwen-Image Technical Report : https://huggingface.co/papers/2508.02324 ⤷ Qwen Image [GitHub] : https://github.com/QwenLM/Qwen-Image Even more impressively, it demonstrates a strong ability to understand images. The model supports a wide range of vision-related tasks such as object detection, semantic segmentation, depth and edge (Canny) estimation, novel view synthesis, and image super-resolution. While each task is technically distinct, they can all be viewed as advanced forms of intelligent image editing driven by deep visual understanding. Collectively, these capabilities position Qwen-Image as more than just a tool for generating appealing visuals, it serves as a versatile foundation model for intelligent visual creation and transformation, seamlessly blending language, layout, and imagery. Qwen-Image uses a dual-stream MMDiT architecture with a frozen Qwen2.5-VL, VAE encoder, RMSNorm for QK-Norm, LayerNorm elsewhere, and a custom MSRoPE scheme for joint image-text positional encoding. . . . To know more about it, visit the model card of the respective model. !!

liked a Space about 11 hours ago

abidlabs/openai-gpt-oss-120b-test

View all activity

Organizations

liked a model about 11 hours ago

openai/gpt-oss-20b

Text Generation • 12B • Updated about 11 hours ago • 91.2k • • 1.74k

reacted to prithivMLmods's post with 🤗 about 11 hours ago

Post

2276

Qwen Image – The Latest Image Generation Model🔥

Below are some samples generated using the Qwen Image Diffusion Model. Qwen-Image, a 20B MMDiT model for next-generation text-to-image generation, preserves typographic details, layout coherence, and contextual harmony with stunning accuracy. It is especially strong at creating stunning graphic posters with native text. The model is now open-source. [ 𝚀𝚠𝚎𝚗-𝙸𝚖𝚊𝚐𝚎 : Qwen/Qwen-Image ]

⤷ Try the Qwen Image demo here: prithivMLmods/Qwen-Image-Diffusion, Qwen/Qwen-Image & more ...

⤷ Qwen-Image Technical Report : Qwen-Image Technical Report (2508.02324)
⤷ Qwen Image [GitHub] : https://github.com/QwenLM/Qwen-Image

Even more impressively, it demonstrates a strong ability to understand images. The model supports a wide range of vision-related tasks such as object detection, semantic segmentation, depth and edge (Canny) estimation, novel view synthesis, and image super-resolution. While each task is technically distinct, they can all be viewed as advanced forms of intelligent image editing driven by deep visual understanding. Collectively, these capabilities position Qwen-Image as more than just a tool for generating appealing visuals, it serves as a versatile foundation model for intelligent visual creation and transformation, seamlessly blending language, layout, and imagery.

Qwen-Image uses a dual-stream MMDiT architecture with a frozen Qwen2.5-VL, VAE encoder, RMSNorm for QK-Norm, LayerNorm elsewhere, and a custom MSRoPE scheme for joint image-text positional encoding.

.
.
.
To know more about it, visit the model card of the respective model. !!

liked a Space about 11 hours ago

Openai Gpt Oss 120b Test

📚

Generate text using a powerful AI model

reacted to Tonic's post with 🤗 3 days ago

Post

3025

🫡 I am the first and only one to like the French Tax Code Dataset

that's it , that's the post

find the dataset here : louisbrulenaudet/code-impots
follow : @louisbrulenaudet

2 replies

reacted to prithivMLmods's post with ❤️ 3 days ago

Post

3096

Introducing Camel-Doc-OCR-080125(v2), a document content-structure retrieval VLM designed for content extraction and summarization. This is the second model in the Camel Doc OCR VLM series, following Camel-Doc-OCR-062825(v1). The new version fixes formal table reconstruction issues in both en and zh language, achieving optimal performance for long-context inferences.🤗🐪

⤷ Camel-Doc-OCR(v2) : prithivMLmods/Camel-Doc-OCR-080125
⤷ Camel-Doc-OCR(v1) : prithivMLmods/Camel-Doc-OCR-062825
⤷ Demo : prithivMLmods/core-OCR

Multimodal Model Collections and Spaces:

➝ Camel-Doc-OCR : prithivMLmods/camel-doc-ocr-080125-688c0c61c5dba648756f31f8
➝ Vision-Language (VLr) : prithivMLmods/vision-language-for-reasoning-vlr-6889b3f45917352b5e3a6f7a
➝ Multimodal Spaces : prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0
➝ Multimodal VLMs : prithivMLmods/multimodal-vlms-until-july25-688312e6b840e1e156f13027

.
.
.
To know more about it, visit the model card of the respective model. !!

2 replies

reacted to AdinaY's post with 🚀 6 days ago

Post

1701

Qwen just released Qwen3-30B-A3B-Instruct-2507 🔥 an upgrade to the non-thinking mode model

Qwen/Qwen3-30B-A3B-Instruct-2507

✨ 30B MoE / 3.3B active - Apache 2.0
✨ Strong gains in reasoning, math, coding, & multilingual tasks
✨ Native support for 256K long-context inputs

liked a Space 6 days ago

407

Qwen3 Coder WebDev

🌍

Generate web application code from descriptions

liked a Space 9 days ago

GLM 4.5 Demo (API)

🏃

Chat with GLM-4.5 to get answers and reasoning

upvoted a collection 9 days ago

GLM-4.5

Collection

10 items • Updated 2 days ago • 189

liked a model 10 days ago

mlx-community/Qwen3-235B-A22B-Thinking-2507-3bit-DWQ

Text Generation • 235B • Updated 11 days ago • 559 • 5

liked a model 11 days ago

internlm/Intern-S1

Image-Text-to-Text • 241B • Updated 2 days ago • 14.5k • 173

liked a Space 13 days ago

105

Addit

⚡

Add objects to images using text prompts

liked a model 13 days ago

Kwaipilot/KAT-V1-40B

Text Generation • 41B • Updated 16 days ago • 833 • 104

reacted to AdinaY's post with 👍 13 days ago

Post

2671

KAT-V1 🔥 a LLM that tackles overthinking by switching between reasoning and direct answers, by Kuaishou.

Kwaipilot/KAT-V1-40B

✨ 40B
✨ Step-SRPO: smarter reasoning control via RL
✨ MTP + Distillation: efficient training, lower cost