MelodyFlow
MelodyFlow generates and edits high-fidelity music based on text descriptions, offering diverse styles and efficient editing.
What is MelodyFlow?
MelodyFlow is a cutting-edge text-controlled music generation and editing model. It uses continuous latent representations to create high-fidelity stereo samples without losing information. Based on diffusion transformers, it is trained with flow matching objectives to produce diverse music that aligns with simple text descriptions. It also introduces a novel regularization method for zero-shot text-guided editing, excelling in various music editing tasks.