Merlin Li
MerlinLi
AI & ML interests
NLP, Multimodal, diffusion models
Organizations
Agentic-llm
QWenX
Agent
Code-LLM
-
Nexusflow/NexusRaven-V2-13B
Text Generation • 13B • Updated • 4.41k • 469 -
ArunMoonpai/CodeLlama-SQL-13b
Question Answering • Updated • 6 • 1 -
codellama/CodeLlama-13b-Instruct-hf
Text Generation • 13B • Updated • 14.1k • 153 -
codellama/CodeLlama-7b-Instruct-hf
Text Generation • 7B • Updated • 251k • 241
Yi-LLM
models based on 01ai/Yi-6/34B
Chinese-Speech-Data
Speech-App
function-llm
dpo-datasets
text-embedding
role-play-llm
timeseries-llm
domain-specific-llm
3D-Gen
gpt4-data
Merged-LLM
text-to-image
synthetic-data
llm-structured-data
mm-lm
text-to-speech
-
FlashSpeech: Efficient Zero-Shot Speech Synthesis
Paper • 2404.14700 • Published • 33 -
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Paper • 2306.15687 • Published -
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Paper • 2403.03100 • Published • 39 -
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization
Paper • 2404.09956 • Published • 12
llm-guard
any-to-embedding
timeseries-llm
Agentic-llm
domain-specific-llm
QWenX
3D-Gen
Agent
gpt4-data
Code-LLM
-
Nexusflow/NexusRaven-V2-13B
Text Generation • 13B • Updated • 4.41k • 469 -
ArunMoonpai/CodeLlama-SQL-13b
Question Answering • Updated • 6 • 1 -
codellama/CodeLlama-13b-Instruct-hf
Text Generation • 13B • Updated • 14.1k • 153 -
codellama/CodeLlama-7b-Instruct-hf
Text Generation • 7B • Updated • 251k • 241
Merged-LLM
Yi-LLM
models based on 01ai/Yi-6/34B
text-to-image
Chinese-Speech-Data
synthetic-data
Speech-App
llm-structured-data
function-llm
mm-lm
dpo-datasets
text-to-speech
-
FlashSpeech: Efficient Zero-Shot Speech Synthesis
Paper • 2404.14700 • Published • 33 -
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Paper • 2306.15687 • Published -
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Paper • 2403.03100 • Published • 39 -
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization
Paper • 2404.09956 • Published • 12
text-embedding
llm-guard
role-play-llm