-
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Paper • 2401.10529 • Published • 1 -
ShareGPT4V: Improving Large Multi-Modal Models with Better Captions
Paper • 2311.12793 • Published • 18 -
Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models
Paper • 2311.06783 • Published • 28 -
SVIT: Scaling up Visual Instruction Tuning
Paper • 2307.04087 • Published • 7
Sulabh
sulabh-research
AI & ML interests
None yet
Organizations
None yet
LLM Datasets
-
INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning
Paper • 2401.06532 • Published • 12 -
Textbooks Are All You Need II: phi-1.5 technical report
Paper • 2309.05463 • Published • 88 -
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Paper • 2309.00267 • Published • 52 -
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning
Paper • 2312.15685 • Published • 16
Long Context
small_models
LLMs
-
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
Paper • 2402.19427 • Published • 57 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 152 -
Tuning Language Models by Proxy
Paper • 2401.08565 • Published • 24 -
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Paper • 2401.06066 • Published • 56
Datasets
Multilingual LLMs
-
A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models
Paper • 2309.11674 • Published • 32 -
Extrapolating Large Language Models to Non-English by Aligning Languages
Paper • 2308.04948 • Published • 1 -
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation
Paper • 2401.08417 • Published • 37
PEFT
RAG
embeddings
MM Datasets
-
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Paper • 2401.10529 • Published • 1 -
ShareGPT4V: Improving Large Multi-Modal Models with Better Captions
Paper • 2311.12793 • Published • 18 -
Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models
Paper • 2311.06783 • Published • 28 -
SVIT: Scaling up Visual Instruction Tuning
Paper • 2307.04087 • Published • 7
Datasets
LLM Datasets
-
INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning
Paper • 2401.06532 • Published • 12 -
Textbooks Are All You Need II: phi-1.5 technical report
Paper • 2309.05463 • Published • 88 -
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Paper • 2309.00267 • Published • 52 -
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning
Paper • 2312.15685 • Published • 16
Multilingual LLMs
-
A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models
Paper • 2309.11674 • Published • 32 -
Extrapolating Large Language Models to Non-English by Aligning Languages
Paper • 2308.04948 • Published • 1 -
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation
Paper • 2401.08417 • Published • 37
Long Context
PEFT
small_models
RAG
LLMs
-
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
Paper • 2402.19427 • Published • 57 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 152 -
Tuning Language Models by Proxy
Paper • 2401.08565 • Published • 24 -
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Paper • 2401.06066 • Published • 56
embeddings