Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alexey Gritsenko's picture
5 1

Alexey Gritsenko

AlexeyG
shuyuej's profile picture
·
  • AlexeyG

AI & ML interests

None yet

Organizations

Google's profile picture

authored a paper 6 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 146
authored a paper 9 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 134
authored 2 papers about 1 year ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 73

VUT: Versatile UI Transformer for Multi-Modal Multi-Task User Interface Modeling

Paper • 2112.05692 • Published Dec 10, 2021
authored a paper almost 2 years ago

SCENIC: A JAX Library for Computer Vision Research and Beyond

Paper • 2110.11403 • Published Oct 18, 2021
authored 4 papers about 2 years ago

Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution

Paper • 2307.06304 • Published Jul 12, 2023 • 31

Simple Open-Vocabulary Object Detection with Vision Transformers

Paper • 2205.06230 • Published May 12, 2022 • 2

Video Diffusion Models

Paper • 2204.03458 • Published Apr 7, 2022 • 5

Scaling Open-Vocabulary Object Detection

Paper • 2306.09683 • Published Jun 16, 2023 • 13
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs