Spaces:

chakkale
/

minicpm-video-analyzer

Paused

App Files Files Community

minicpm-video-analyzer / DEPLOYMENT_GUIDE.md

chakkale's picture

Upload 4 files

3bb1b87 verified about 1 month ago

|

2.69 kB

🚀 HF Spaces Deployment Guide

Quick Deployment Steps

1. Create Hugging Face Space

Go to Hugging Face Spaces
Click "Create new Space"
Fill in the details:
- Space name: minicpm-video-analyzer (or your choice)
- License: Apache 2.0
- SDK: Gradio
- Hardware: GPU (T4 or better) - Required for MiniCPM-o
- Visibility: Public or Private (your choice)

2. Upload Files

Upload these files to your space:

app.py - Main application
requirements.txt - Dependencies
README.md - Documentation

3. Configure Hardware (Important!)

With your HF Pro account:

Go to your space settings
Select "Hardware"
Choose "T4 small" or better GPU
Set timeout to 30+ minutes for processing

4. Deploy & Test

Your space will automatically build (takes 5-10 minutes)
First model load will download ~8GB (takes 5-10 minutes)
Test with a short video (15-30 seconds)

Hardware Recommendations

GPU	VRAM	Performance	Cost/hour
T4	16GB	Good	$0.60
A10G	24GB	Better	$3.15
A100	40GB	Best	$4.13

For testing: T4 is sufficient and cost-effective

Expected Performance

Model Loading: 5-10 minutes (first time only)
30-second video: 5-15 minutes processing
Memory Usage: ~8-12GB VRAM
Processing: 1 frame per second analysis

Troubleshooting

Common Issues:

Out of Memory:
- Upgrade to larger GPU (A10G recommended)
- Reduce video length/resolution
Model Loading Fails:
- Check internet connection
- Restart the space
- Ensure GPU is selected
Slow Processing:
- Normal for first run (model download)
- Subsequent runs should be faster

Cost Optimization:

Development: Use CPU for testing UI (no model loading)
Production: Use T4 GPU for actual analysis
Pause: Turn off GPU when not in use

Comparison Testing

Once deployed, you can:

Test same videos on both systems
Compare analysis quality
Measure processing times
Evaluate cost differences

Next Steps

After successful deployment:

Test with your existing video samples
Compare results with your Node.js GPT-4o system
Evaluate which approach works better for your use case
Consider hybrid approach: use both systems for different scenarios

Support

If you encounter issues:

Check HF Spaces logs
Verify GPU allocation
Ensure all files are uploaded correctly
Test with smaller videos first