ZeroSep is a training-free audio source separation framework that repurposes pre-trained text-guided diffusion models for zero-shot separation. No fine-tuning, no task-specific data—just latent ...