File size: 1,602 Bytes
67d6834 71e7460 67d6834 418519b 67d6834 2652b5a |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 |
This is the SATE MVP, integrate the all the pipelines into one framework.
Contain the main entrance, build for docker.
main.py:
Input: Entire audio file
Output: Transcription with annotation
Preprocess:
Segmentation + speaker diarization -> crisper whisper transcriptions for each segmentation
P.S. Should keep transcript consist in each pipelines.
IMAGE CREATION:
docker build -t sate_0.11 .
(New) HOW TO USE after image created:
docker run --gpus all -it --rm \
-v /home/easgrad/shuweiho/workspace/volen/SATE_docker_test/input:/sate/input \
-v /home/easgrad/shuweiho/workspace/volen/SATE_docker_test/session_data:/sate/session_data \
-p 7860:7860 \
sate_0.11
curl -X POST http://localhost:7860/process \
-F "audio_file=@/home/easgrad/shuweiho/workspace/volen/SATE_docker_test/input/454.mp3" \
-F "device=cuda" \
-F "pause_threshold=0.25"
(Old - don't follow it) HOW TO USE after image created:
docker run --gpus all -it --rm \
-v /home/easgrad/shuweiho/workspace/volen/SATE_docker_test/input:/sate/input \
-v /home/easgrad/shuweiho/workspace/volen/SATE_docker_test/session_data:/sate/session_data \
-p 5000:5000 \
sate_0.10
curl -X POST http://localhost:5000/process \
-H "Content-Type: application/json" \
-d '{
"input_audio_file": "/sate/input/454.mp3",
"device": "cuda",
"pause_threshold": 0.5
}'
Test on HF space:
curl -X POST https://Sven33-SATE.hf.space/process -F "audio_file=@/home/easgrad/shuweiho/workspace/volen/SATE_docker_test/input/454.mp3" -F "device=cuda" -F "pause_threshold=0.25" |