KIT's Offline Speech Translation and Instruction Following Submission for IWSLT 2025 Paper • 2505.13036 • Published May 19
ViCocktail: Automated Multi-Modal Data Collection for Vietnamese Audio-Visual Speech Recognition Paper • 2506.04635 • Published Jun 5
Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging Paper • 1908.02404 • Published Aug 7, 2019
Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models Paper • 2010.00198 • Published Oct 1, 2020