Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

deepseek-ai
/
DeepSeek-V2-Chat-0628

Text Generation
Transformers
Safetensors
deepseek_v2
conversational
custom_code
text-generation-inference
Model card Files Files and versions
xet
Community
5
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

MLA的实现代码

#5 opened 11 months ago by
yuyijiong

What is the FSDP value for `fsdp_transformer_layer_cls_to_wrap`?

#4 opened about 1 year ago by
migtissera

模型启动依赖问题

#3 opened about 1 year ago by
malowking

different between DeepSeek-V2-Chat-0628 and Deepseek-v2-API-0628

1
#2 opened about 1 year ago by
xxllp
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs