Features Reference
Complete feature list for Music Video Creator.
Visual Styles (19 Total)
GPU-Accelerated Styles (10)
| Style |
Description |
Performance |
sphere_3d_gpu |
3D audio-reactive sphere with noise displacement |
~5 fps |
sphere_3d_calm_gpu |
Calmer sphere with reduced reactivity |
~5 fps |
terrain_3d_gpu |
Synthwave terrain flyover with grid lines |
~1 fps |
tunnel_3d_gpu |
Infinite tunnel with bass-reactive walls |
~8 fps |
reactive_disk_gpu |
Bass-reactive spinning disk with rings |
~30 fps |
spectrum_texture_gpu |
128-bin spectrum texture mapping |
~30 fps |
frequency_bars_gpu |
GPU spectrum analyzer bars |
~30 fps |
circular_gpu |
GPU radial spectrum |
~30 fps |
particles_gpu |
GPU particle system |
~30 fps |
waveform_gpu |
GPU waveform display |
~30 fps |
Matplotlib Styles (9)
| Style |
Description |
Performance |
waveform_spectrogram |
Combined waveform + spectrogram display |
~3s/frame |
spectrogram_only |
Full spectrogram with playhead |
~2s/frame |
waveform_only |
Waveform with playhead marker |
~1s/frame |
frequency_bars |
Classic spectrum analyzer bars |
~2s/frame |
particles |
Audio-reactive particle system |
~5s/frame |
geometric |
Pulsing polygons and circles |
~3s/frame |
circular_spectrum |
Radial spectrum analyzer |
~4s/frame |
beat_pulse |
Rhythm-synced pulsing effects |
~3s/frame |
beat_rings |
Expanding rings on beats |
~3s/frame |
Scene Detection
Automatically segments audio into scenes based on energy profiles.
| Scene Type |
Detection Criteria |
intro |
Low energy, beginning of track |
verse |
Moderate energy, stable |
build |
Rising energy, increasing intensity |
drop |
High energy peak |
breakdown |
Energy dip after drop |
outro |
Declining energy, end of track |
Genre Presets
Pre-configured scene-to-style mappings for different music genres.
| Genre |
Intro |
Verse |
Build |
Drop |
Breakdown |
Outro |
| Electronic |
tunnel_3d |
particles_gpu |
waveform_gpu |
reactive_disk |
sphere_3d |
terrain_3d |
| Rock |
frequency_bars |
waveform_gpu |
circular_gpu |
reactive_disk |
beat_pulse |
waveform_gpu |
| Pop |
circular_gpu |
particles_gpu |
frequency_bars |
reactive_disk |
geometric |
particles_gpu |
| Ambient |
terrain_3d |
sphere_3d_calm |
waveform_gpu |
sphere_3d |
terrain_3d |
sphere_3d_calm |
| Blues |
waveform_only |
frequency_bars |
circular_gpu |
beat_rings |
waveform_only |
frequency_bars |
Beat Effects
Triggered effects synchronized to detected beats.
| Effect |
Description |
Parameters |
| Zoom |
Momentary zoom on beat, decays back |
zoom_amount, zoom_decay |
| Flash |
Brightness pulse on beat |
flash_intensity, flash_decay |
| Shake |
Subtle camera shake on beat |
shake_amount, shake_decay |
Post-Processing
Optional cinematic effects applied after visualization.
| Effect |
Description |
Parameters |
| Bloom |
Glow on bright areas |
bloom_intensity, bloom_threshold, bloom_radius |
| Vignette |
Darkening at edges |
vignette_intensity, vignette_radius, vignette_softness |
| Grain |
Film grain overlay |
grain_intensity, grain_size |
HUD Overlays
Heads-up display elements.
| Overlay |
Description |
| Title |
Configurable text with fade in/out |
| BPM |
Real-time tempo display |
| Progress |
Time bar with elapsed/total display |
Video (MP4)
- Codec: libx264 (default) or VP9 (via 1440p upscale)
- Resolution: Up to 4K (3840x2160)
- Frame rate: 30-60 fps
- Audio: AAC 384kbps
GIF Presets
| Preset |
Width |
FPS |
Max Duration |
Target Size |
| discord |
480px |
15 |
15s |
8MB |
| twitter |
480px |
15 |
60s |
15MB |
| slack |
400px |
12 |
30s |
10MB |
| reddit |
640px |
20 |
60s |
20MB |
| preview |
320px |
10 |
5s |
2MB |
| high_quality |
720px |
24 |
30s |
50MB |
Lyrics & Subtitles
| Feature |
Description |
| Whisper transcription |
Auto-generate subtitles from audio |
| Subtitle burn-in |
FFmpeg ASS/SRT filter |
| Karaoke highlighting |
Word-by-word ASS generation |
| Custom subtitle file |
Support for existing ASS/SRT files |