runtime error

Exit code: 1. Reason: odel-00004-of-00005.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.97G/4.97G [00:15<00:00, 313MB/s] model-00005-of-00005.safetensors: 0%| | 0.00/2.43G [00:00<?, ?B/s] model-00005-of-00005.safetensors: 3%|β–Ž | 67.9M/2.43G [00:01<00:46, 50.7MB/s] model-00005-of-00005.safetensors: 6%|β–Œ | 135M/2.43G [00:02<00:41, 55.6MB/s]  model-00005-of-00005.safetensors: 9%|β–‰ | 226M/2.43G [00:03<00:35, 62.6MB/s] model-00005-of-00005.safetensors: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 1.22G/2.43G [00:04<00:03, 370MB/s] model-00005-of-00005.safetensors: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 2.09G/2.43G [00:06<00:00, 494MB/s] model-00005-of-00005.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2.43G/2.43G [00:06<00:00, 383MB/s] Qwen2_5OmniToken2WavModel does not support eager attention implementation, fall back to sdpa Loading checkpoint shards: 0%| | 0/5 [00:00<?, ?it/s] Loading checkpoint shards: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 5/5 [00:00<00:00, 376.24it/s] generation_config.json: 0%| | 0.00/74.0 [00:00<?, ?B/s] generation_config.json: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 74.0/74.0 [00:00<00:00, 496kB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 13, in <module> model = Qwen2_5OmniModel.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/models/qwen2_5_omni/modeling_qwen2_5_omni.py", line 4764, in from_pretrained model = super().from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 272, in _wrapper return func(*args, **kwargs) File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 4519, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 504, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.

Container logs:

Fetching error logs...