Convert spoken words to text
A private and powerful AI that runs locally in your browser
Convert spoken words into text
Generate web pages using Jinja templates
Classify images in real-time using labels
Estimate depth from your webcam video
Separate speakers in audio recordings
ML-powered speech synthesis directly in your browser
Segment objects in images using points
Generate text using a sample React app
Classify objects in real-time using webcam
In-browser speech recognition w/ word-level timestamps
Generate images from text prompts
Transcribe audio to text
Experiment with and compare different tokenizers
Run Gemini Nano locally in your browser with Transformers.js
Classify objects in real-time with webcam video
Classify images in real-time using your webcam
Generate depth map from image
Generate text based on your input prompts
Upload an image to detect objects