nvidia/canary-qwen-2.5b · Chunked inference?

Chunked inference?

#6

by ponceoscarj - opened 15 days ago

15 days ago

Is there a way to do chunked inference with canary-qwen-2.5b?
This link works for canary but could not implement it for canary-qwen-2.5b - https://github.com/NVIDIA/NeMo/blob/main/examples/asr/asr_chunked_inference/aed/speech_to_text_aed_chunked_infer.py

ponceoscarj changed discussion title from Chunked inference to Chunked inference? 15 days ago

NVIDIA org 13 days ago

Yes, there is, please refer to this code in the spaces demo:
https://huggingface.co/spaces/nvidia/canary-qwen-2.5b/blob/main/app.py#L30-L66

No ETA yet but this feature will also eventually land in NeMo speechlm2 collection.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment