Generate spectrograms and multi-panel audio visualizations from the command line using the songsee CLI.
npx clawhub@latest install songseeRequirements
songsee is a command-line tool that converts audio files into spectrograms and rich feature-panel visualizations. It supports a wide range of acoustic analysis panels — from mel spectrograms and chroma to MFCC, tempogram, and loudness — and outputs them as image files. Whether you need a quick single-panel spectrogram or a full multi-panel grid for in-depth audio analysis, songsee handles it in a single command.
npx clawhub@latest install songseeClick the Install button at the top of this page for one-click setup
--start and --duration flags.Render multiple acoustic analysis panels in a single grid image using --viz with a comma-separated list. Supported panels include spectrogram, mel, chroma, hpss, selfsim, loudness, tempogram, mfcc, and flux.
Focus on any segment of an audio file using --start and --duration flags. This is useful for inspecting specific moments in long recordings without pre-editing the file.
Accepts audio files directly or via stdin (cat track.mp3 | songsee -), making it easy to integrate into shell pipelines. WAV and MP3 are decoded natively; other formats are handled via ffmpeg if available.
Choose from multiple color palettes (classic, magma, inferno, viridis, gray) and control output dimensions with --width / --height. Output format can be jpg or png.
Fine-tune the analysis with --window and --hop for FFT settings, and --min-freq / --max-freq to constrain the frequency range displayed.
Producers can run songsee track.mp3 to instantly generate a spectrogram of a mix, visually inspecting frequency balance, clipping, or noise before export.
Researchers can generate a full 9-panel visualization covering mel, chroma, HPSS, tempogram, MFCC, and more in one command, providing a comprehensive acoustic fingerprint of a recording.
Using stdin support, songsee can be embedded in CI or batch processing pipelines to automatically generate spectrogram thumbnails or diagnostic images for audio datasets.
Audio engineers can use --start and --duration to isolate and visualize a specific section of a track — such as the first chorus or a suspected artifact — without editing the source file.
npx clawhub@latest install songseeRequirements
Log in to write a review
No reviews yet. Be the first to share your experience!