Chunked inference?

#6
by ponceoscarj - opened

Is there a way to do chunked inference with canary-qwen-2.5b?
This link works for canary but could not implement it for canary-qwen-2.5b - https://github.com/NVIDIA/NeMo/blob/main/examples/asr/asr_chunked_inference/aed/speech_to_text_aed_chunked_infer.py

ponceoscarj changed discussion title from Chunked inference to Chunked inference?

Yes, there is, please refer to this code in the spaces demo:
https://huggingface.co/spaces/nvidia/canary-qwen-2.5b/blob/main/app.py#L30-L66

No ETA yet but this feature will also eventually land in NeMo speechlm2 collection.

Sign up or log in to comment