Stability AI releases a new audio model that can create six-minute songs

https://techcrunch.com/wp-content/uploads/2025/06/GettyImages-1480808838.jpg?resize=1200,800

Stability AI, the company behind Stable Diffusion, is releasing a new family of audio models, called Stability Audio 3.0. The top model can generate professional-grade music of more than six minutes long, the company claimed.

The company is releasing four new models under the Stable Audio 3.0 name: small SFX (459M parameters), small (459M parameters), medium (1.4B parameters), and large (2.7B parameters). The duo of small models is suitable for on-device sound and music generation of up to two minutes.

Both medium and large models can create full compositions of 6 minutes 20 seconds long that can maintain musical structure and melodic tone. This is more than double the length of what Stable Audio 2.0, released in 2024, was capable of generating.

Stability AI is making small SFX, small, and medium models available with open weights for anyone to use and modify. In 2024, the company released Stable Audio Open...

Copyright of this story solely belongs to techcrunch.com. To see the full text click HERE