Chunked inference?
#6
by
ponceoscarj
- opened
Is there a way to do chunked inference with canary-qwen-2.5b?
This link works for canary but could not implement it for canary-qwen-2.5b - https://github.com/NVIDIA/NeMo/blob/main/examples/asr/asr_chunked_inference/aed/speech_to_text_aed_chunked_infer.py
ponceoscarj
changed discussion title from
Chunked inference
to Chunked inference?
Yes, there is, please refer to this code in the spaces demo:
https://huggingface.co/spaces/nvidia/canary-qwen-2.5b/blob/main/app.py#L30-L66
No ETA yet but this feature will also eventually land in NeMo speechlm2 collection.