Spaces:

MohamedRashad
/

Voxtral

Running on Zero

App Files Files Community

multimodalart HF Staff commited on 29 days ago

Commit

18832a9

verified ·

1 Parent(s): 29f0e6f

feat: Enable MCP

Browse files

Hello! This is an automated PR adding MCP compatibility to your AI App 🤖.

![image.png](https://cdn-uploads.huggingface.co/production/uploads/624bebf604abc7ebb01789af/HQQK38I_MDXLDMYDYBq8H.png)This PR introduces two improvements:
1. Adds docstrings to the functions in the app file that are directly connected to the Gradio UI, for the downstream LLM to use.
2. Enables the Model-Compute-Platform by adding `mcp_server=True` to the `.launch()` call.

No other logic has been changed. Please review and merge if it looks good!Learn more about MCP compatibility in Spaces here: https://huggingface.co/changelog/add-compatible-spaces-to-your-mcp-tools

Files changed (1) hide show

app.py +16 -2

app.py CHANGED Viewed

@@ -29,7 +29,21 @@ LANGUAGES = {
 @spaces.GPU()
 def process_audio(audio_path, model_name, lang_name, max_tokens=500):
-    """Process audio with selected Voxtral model and return the generated response"""
     if not audio_path:
         return "Please upload an audio file."
@@ -111,4 +125,4 @@ with gr.Blocks(title="Voxtral Demo") as demo:
 # Launch the app
 if __name__ == "__main__":
-    demo.queue().launch(share=False, ssr_mode=False)

 @spaces.GPU()
 def process_audio(audio_path, model_name, lang_name, max_tokens=500):
+    """Process audio with selected Voxtral model and return the generated response.
+    This function takes an audio file and processes it using the selected Voxtral model
+    to generate a transcription in the specified language.
+    Args:
+        audio_path: Path to the audio file to be transcribed.
+        model_name: Name of the Voxtral model to use ("Voxtral Mini (3B)" or "Voxtral Small (24B)").
+        lang_name: Name of the language for transcription (e.g., "English", "French", etc.).
+        max_tokens: Maximum number of tokens to generate in the output (default: 500).
+    Returns:
+        String containing the transcribed text from the audio file, or an error message
+        if the audio file is missing or an invalid model is selected.
+    """
     if not audio_path:
         return "Please upload an audio file."
 # Launch the app
 if __name__ == "__main__":
+    demo.queue().launch(share=False, ssr_mode=False, mcp_server=True)