AceStep Logo AceStep.io

Free to start: try up to 2 songs/day. No credit card required.

Generate AI music with ACE-Step 1.5

Run the full ACE-Step 1.5 stack online — describe your track or paste lyrics, then hit generate. No install, no GPU, no waitlist.

Free to start. Try up to 2 songs/day. No credit card required.

What is ACE-Step 1.5?

What is ACE-Step 1.5?

ACE-Step 1.5 is the latest open-source music foundation model from ACE Studio and StepFun. It pairs a Language Model planner with a Diffusion Transformer (DiT) decoder, so short prompts expand into structured song blueprints — lyrics, sections, metadata and all — that drive high-fidelity audio synthesis.

Hybrid LM + DiT Architecture

The LM acts as an omni planner (query rewriting, Chain-of-Thought metadata, lyric alignment) while the DiT handles raw waveform generation. Intrinsic reinforcement learning keeps both halves aligned without human preference data, producing stable output across tempos and genres.

Why run it on acestep.io

Running ACE-Step 1.5 locally needs Python 3.11+, a capable CUDA or ROCm GPU, and multi-gigabyte model downloads. acestep.io hosts the whole stack — 2B turbo/sft, XL 4B, and all 5Hz LM tiers — so you can try every model with one click from any browser.

Key Features of ACE-Step 1.5

Generation Quality

  • Commercial-grade vocals with expressive phrasing and breath control
  • Rich timbre control across 1,000+ instruments and style tags
  • Multi-language lyric input with structural markers (verse / chorus / bridge)
  • Automatic LRC timestamps for every generated track

Editing & Control

  • Reference audio — guide style, tempo and timbre from any MP3 or WAV
  • Cover & repaint — rebuild or locally edit an existing track
  • Vocal2BGM — auto-produce accompaniment for your own vocals
  • Metadata control — pin BPM, key, time signature and duration

Performance & Scale

  • Turbo mode renders a full song in under 10 seconds of server time
  • Generate up to 8 tracks in parallel for fast A/B iteration
  • Durations from 10 seconds to 10-minute long-form compositions
  • Priority queue and MP3/WAV export for creators on the go

Why Creators Choose acestep.io for ACE-Step 1.5

Free to Start

New accounts receive daily credits so you can generate real ACE-Step 1.5 songs without a subscription or card on file.

No GPU Required

We handle the 12–24 GB VRAM, CUDA drivers and model zoo — your laptop, tablet or phone just streams the finished audio.

Every Model Tier

Switch between 2B Turbo, 2B SFT and XL 4B Turbo/SFT on the fly to trade speed for maximum audio quality.

Lyric + Voice Control

Feed your own lyrics, upload a reference vocal, or let the planner write a topline for you — all from the same studio.

Editor, Not Just a Generator

Use repaint to rewrite a single section, lego to reorder parts, or extract to pull stems from a finished song.

Royalty-Free Output

Every render from acestep.io is yours to keep for personal and commercial use — no watermarks, no surprise licensing.

FAQ — ACE-Step 1.5

What is ACE-Step 1.5?

ACE-Step 1.5 is an open-source AI music foundation model that combines a Language Model planner with a Diffusion Transformer audio decoder. It can generate full songs with vocals from a text prompt, turn lyrics into melodies, and perform editing tasks such as cover, repaint and vocal-to-BGM.

How is ACE-Step 1.5 different from 1.0?

Version 1.5 introduces the hybrid LM + DiT design, the 4B XL decoder for higher fidelity, faster turbo scheduling, native reference-audio conditioning and unified tools for cover, repaint, stem separation and LRC generation — all of which are available on acestep.io.

Do I need a GPU to use ACE-Step 1.5 on this site?

No. acestep.io hosts the full ACE-Step 1.5 stack on cloud GPUs. You only need a modern browser. Local installation is only necessary if you want to fine-tune your own LoRA or host the model offline.

Can I bring my own lyrics?

Yes. Paste structured lyrics (verse / chorus / bridge) directly into the Create page. You can also let the ACE-Step 1.5 LM planner auto-write lyrics from a one-line idea if you prefer.

Which formats can I download?

Every generation is exportable as MP3 or WAV. Studio projects additionally expose LRC lyric timestamps and stem files when you enable the separation workflow.

How long can a song be?

ACE-Step 1.5 supports durations from 10 seconds up to 10 minutes in a single pass. Longer arrangements can be built by stitching repaint or lego segments in the Studio.

Is the output safe for commercial use?

Yes. All audio you generate on acestep.io is royalty-free and cleared for personal and commercial projects. You are responsible for the lyrics and references you supply.

Ready to make a song with ACE-Step 1.5?

Launch the Create studio and have your first ACE-Step 1.5 song — vocals, lyrics, mix and all — ready for download in under a minute.