-
Can Large Language Models Understand Context?
Paper β’ 2402.00858 β’ Published β’ 24 -
OLMo: Accelerating the Science of Language Models
Paper β’ 2402.00838 β’ Published β’ 84 -
Self-Rewarding Language Models
Paper β’ 2401.10020 β’ Published β’ 152 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper β’ 2401.17072 β’ Published β’ 24
Collections
Discover the best community collections!
Collections including paper arxiv:2412.08486
-
Learning Flow Fields in Attention for Controllable Person Image Generation
Paper β’ 2412.08486 β’ Published β’ 37 -
franciszzj/Leffa
Image-to-Image β’ Updated β’ 335 -
TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models
Paper β’ 2411.18350 β’ Published β’ 30 -
56
TryOffDiff
π₯Extract garment images from everyday images!
-
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Paper β’ 2401.09048 β’ Published β’ 10 -
Improving fine-grained understanding in image-text pre-training
Paper β’ 2401.09865 β’ Published β’ 18 -
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Paper β’ 2401.10891 β’ Published β’ 63 -
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
Paper β’ 2401.13627 β’ Published β’ 77
-
LoRACLR: Contrastive Adaptation for Customization of Diffusion Models
Paper β’ 2412.09622 β’ Published β’ 8 -
AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models
Paper β’ 2412.04146 β’ Published β’ 23 -
Learning Flow Fields in Attention for Controllable Person Image Generation
Paper β’ 2412.08486 β’ Published β’ 37 -
LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation
Paper β’ 2412.05148 β’ Published β’ 12
-
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Paper β’ 2405.08748 β’ Published β’ 25 -
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Paper β’ 2405.10300 β’ Published β’ 31 -
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Paper β’ 2405.09818 β’ Published β’ 131 -
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework
Paper β’ 2405.11143 β’ Published β’ 39
-
Can Large Language Models Understand Context?
Paper β’ 2402.00858 β’ Published β’ 24 -
OLMo: Accelerating the Science of Language Models
Paper β’ 2402.00838 β’ Published β’ 84 -
Self-Rewarding Language Models
Paper β’ 2401.10020 β’ Published β’ 152 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper β’ 2401.17072 β’ Published β’ 24
-
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Paper β’ 2401.09048 β’ Published β’ 10 -
Improving fine-grained understanding in image-text pre-training
Paper β’ 2401.09865 β’ Published β’ 18 -
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Paper β’ 2401.10891 β’ Published β’ 63 -
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
Paper β’ 2401.13627 β’ Published β’ 77
-
Learning Flow Fields in Attention for Controllable Person Image Generation
Paper β’ 2412.08486 β’ Published β’ 37 -
franciszzj/Leffa
Image-to-Image β’ Updated β’ 335 -
TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models
Paper β’ 2411.18350 β’ Published β’ 30 -
56
TryOffDiff
π₯Extract garment images from everyday images!
-
LoRACLR: Contrastive Adaptation for Customization of Diffusion Models
Paper β’ 2412.09622 β’ Published β’ 8 -
AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models
Paper β’ 2412.04146 β’ Published β’ 23 -
Learning Flow Fields in Attention for Controllable Person Image Generation
Paper β’ 2412.08486 β’ Published β’ 37 -
LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation
Paper β’ 2412.05148 β’ Published β’ 12
-
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Paper β’ 2405.08748 β’ Published β’ 25 -
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Paper β’ 2405.10300 β’ Published β’ 31 -
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Paper β’ 2405.09818 β’ Published β’ 131 -
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework
Paper β’ 2405.11143 β’ Published β’ 39