MAIR Lab

university

mair-lab

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

BAJUKA authored a paper 14 days ago

CulturalFrames: Assessing Cultural Expectation Alignment in Text-to-Image Models and Evaluation Metrics

oscmansan authored a paper 24 days ago

Controlling Multimodal LLMs via Reward-guided Decoding

BAJUKA updated a dataset 28 days ago

mair-lab/CulturalFrames

View all activity

mair-lab 's collections 5

WebMMU

WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation

McGill-NLP/WebMMU

Viewer • Updated Jun 10 • 4.24k • 132 • 1

CulturalVQA

Benchmarking Vision Language Models for Cultural Understanding

Running

1

1

CulturalVQABench

🚀

Fetch and display competition information and leaderboard
mair-lab/CulturalVQA

Viewer • Updated Feb 17 • 2.37k • 23 • 6

VisMin

VisMin (visual minimal-change ) is a controlled benchmark and fine-tuned models trained on vismin training set e.g. VisMin-CLIP and VisMin-Idefics2.

mair-lab/vismin

Viewer • Updated Nov 28, 2024 • 114k • 277 • 3
mair-lab/vismin-clip-vit-large

Updated Aug 9, 2024
mair-lab/vismin-idefics2-8b

Updated Aug 9, 2024
mair-lab/vismin-bench

Viewer • Updated Jan 22 • 2.08k • 122

CTRL-O

CTRL-O: Language-Controllable Object-Centric Visual Representation Learning

adidolkar123/visual_genome_coco

Updated Oct 30, 2024
adidolkar123/visual_genome

Updated Nov 5, 2024
adidolkar123/pretrained_coco_vgcoco

Updated Jun 10 • 3

EARL

Official artifacts for the paper, The Promise of RL for Autoregressive Image Editing (EARL).

mair-lab/sft-simple

8B • Updated Aug 8 • 7
mair-lab/sft-simple.rl-simple-n-complex

Updated Aug 8
mair-lab/earl-datasets

Updated Aug 10 • 254
mair-lab/thinking-sft-simple

8B • Updated Aug 8 • 5

WebMMU

WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation

McGill-NLP/WebMMU

Viewer • Updated Jun 10 • 4.24k • 132 • 1

CTRL-O

CTRL-O: Language-Controllable Object-Centric Visual Representation Learning

adidolkar123/visual_genome_coco

Updated Oct 30, 2024
adidolkar123/visual_genome

Updated Nov 5, 2024
adidolkar123/pretrained_coco_vgcoco

Updated Jun 10 • 3

CulturalVQA

Benchmarking Vision Language Models for Cultural Understanding

Running

1

1

CulturalVQABench

🚀

Fetch and display competition information and leaderboard
mair-lab/CulturalVQA

Viewer • Updated Feb 17 • 2.37k • 23 • 6

EARL

Official artifacts for the paper, The Promise of RL for Autoregressive Image Editing (EARL).

mair-lab/sft-simple

8B • Updated Aug 8 • 7
mair-lab/sft-simple.rl-simple-n-complex

Updated Aug 8
mair-lab/earl-datasets

Updated Aug 10 • 254
mair-lab/thinking-sft-simple

8B • Updated Aug 8 • 5

VisMin

VisMin (visual minimal-change ) is a controlled benchmark and fine-tuned models trained on vismin training set e.g. VisMin-CLIP and VisMin-Idefics2.

mair-lab/vismin

Viewer • Updated Nov 28, 2024 • 114k • 277 • 3
mair-lab/vismin-clip-vit-large

Updated Aug 9, 2024
mair-lab/vismin-idefics2-8b

Updated Aug 9, 2024
mair-lab/vismin-bench

Viewer • Updated Jan 22 • 2.08k • 122

AI & ML interests

Recent Activity

Team members 8

mair-lab 's collections 5

CulturalVQABench

CulturalVQABench