marlow deneal PRO
marlow
Β·
AI & ML interests
art
Recent Activity
liked
a Space
3 days ago
google/mood-palette
updated
a collection
5 days ago
Fe2e
upvoted
a
paper
5 days ago
Visual Representation Alignment for Multimodal Large Language Models
Organizations
None yet
Fast 4 steps Wan 2.2 I2V (14B) with Lightning LoRA
GMPO
Fill outpaint image extender
-
Runtime error44
Flux Fill Outpainting
πExtend images using AI to change size and alignment
-
FlexPainter: Flexible and Multi-View Consistent Texture Generation
Paper β’ 2506.02620 β’ Published β’ 14 -
LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Paper β’ 2507.02813 β’ Published β’ 60 -
UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoning and Understanding
Paper β’ 2506.23219 β’ Published β’ 7
Musicgen
Text to video
-
FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline
Paper β’ 2311.13073 β’ Published β’ 58 -
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Paper β’ 2403.03206 β’ Published β’ 68 -
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Paper β’ 2406.06525 β’ Published β’ 71
Ai coders
Ultra3D: Efficient and High-Fidelity 3D Part Attention
Diffuman4d
Hunyuan3D-2:scaling diffusion models
GPT-4V in Wonderland
Mesh/anything
-
MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers
Paper β’ 2406.10163 β’ Published β’ 33 -
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
Paper β’ 2408.00653 β’ Published β’ 32 -
ControlNeXt: Powerful and Efficient Control for Image and Video Generation
Paper β’ 2408.06070 β’ Published β’ 54 -
TacSL: A Library for Visuotactile Sensor Simulation and Learning
Paper β’ 2408.06506 β’ Published β’ 7
Fe2e
Ai coders
Fast 4 steps Wan 2.2 I2V (14B) with Lightning LoRA
Ultra3D: Efficient and High-Fidelity 3D Part Attention
GMPO
Diffuman4d
Fill outpaint image extender
-
Runtime error44
Flux Fill Outpainting
πExtend images using AI to change size and alignment
-
FlexPainter: Flexible and Multi-View Consistent Texture Generation
Paper β’ 2506.02620 β’ Published β’ 14 -
LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Paper β’ 2507.02813 β’ Published β’ 60 -
UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoning and Understanding
Paper β’ 2506.23219 β’ Published β’ 7
Hunyuan3D-2:scaling diffusion models
Musicgen
GPT-4V in Wonderland
Text to video
-
FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline
Paper β’ 2311.13073 β’ Published β’ 58 -
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Paper β’ 2403.03206 β’ Published β’ 68 -
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Paper β’ 2406.06525 β’ Published β’ 71
Mesh/anything
-
MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers
Paper β’ 2406.10163 β’ Published β’ 33 -
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
Paper β’ 2408.00653 β’ Published β’ 32 -
ControlNeXt: Powerful and Efficient Control for Image and Video Generation
Paper β’ 2408.06070 β’ Published β’ 54 -
TacSL: A Library for Visuotactile Sensor Simulation and Learning
Paper β’ 2408.06506 β’ Published β’ 7