Yauhen Yavorski

slappatuski

AI & ML interests

image generation, image-to-image, text-to-image, inpainting, and video generation

Recent Activity

upvoted an article 4 days ago

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

upvoted an article 12 days ago

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

upvoted a paper 12 days ago

PaliGemma: A versatile 3B VLM for transfer

View all activity

Organizations

upvoted an article 4 days ago

Article

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

and 4 others •

28 days ago

• 72

upvoted an article 12 days ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

and 2 others •

Apr 15, 2024

• 186

upvoted a paper 12 days ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 73

upvoted 3 papers 14 days ago

PubTables-1M: Towards comprehensive table extraction from unstructured documents

Paper • 2110.00061 • Published Sep 30, 2021 • 2

LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 4

DiT: Self-supervised Pre-training for Document Image Transformer

Paper • 2203.02378 • Published Mar 4, 2022 • 2

upvoted a paper 16 days ago

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Paper • 1912.13318 • Published Dec 31, 2019 • 4

upvoted an article 17 days ago

Article

Accelerating Document AI

and 3 others •

Nov 21, 2022

• 72

upvoted a paper 17 days ago

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models

Paper • 2109.10282 • Published Sep 21, 2021 • 7

upvoted a paper 2 months ago

DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models

Paper • 2210.08933 • Published Oct 17, 2022 • 6

upvoted an article 2 months ago

Article

Diffusion Language Models: The New Paradigm

•

Jun 10

• 13

upvoted an article 4 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 879

upvoted an article 7 months ago

Article

The Annotated Diffusion Model

and 1 other •

Jun 7, 2022

• 259

upvoted a collection 7 months ago

Cosmos

Collection

The collection of Cosmos models • 31 items • Updated 6 days ago • 297

upvoted a paper 8 months ago

FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors

Paper • 2501.08225 • Published Jan 14 • 19

Yauhen Yavorski

AI & ML interests

Recent Activity

Organizations

slappatuski's activity

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Accelerating Document AI

Diffusion Language Models: The New Paradigm

Open-R1: a fully open reproduction of DeepSeek-R1

The Annotated Diffusion Model