Real-valued Swin VAE · ×64 compression

High-fidelity music,
compressed.

SAGE is a Swin-based variational autoencoder that reconstructs music at a ×64 compression rate.

SCROLL TO DISCOVER

A/B comparison

Hear the difference

Same clip, every model. Switch instantly.

Benchmarks

Objective metrics

Reconstruction quality on the FMA-large test split.

Subjective evaluation

Take the listening test

A short MUSHRA test.

Start the test