# Applio

Last update: August 9, 2025


Applio Banner Logo

# Introduction ‎

  • Applio is a VITS-based Voice Conversion Tool developed by the IA Hispano team. It's a Fork of Original/Mainline RVC.

  • It's liked for its great UI, performance improvements and lots of extra features, such as TTS (with RVC models too), plugins, automatic model upload, customizable theme & more.

  • Because of its user-friendly experience & active development, it's considered to be one of the best forks.

  • Applio has it's own Applio Docs, which may have more info about the tool.


# Are RVC Models Safe?

RVC Models are PyTorch Models, a Python library used for AI. PyTorch uses serialization via Pythons' Pickle Module, converting the model to a file. Since pickle can execute arbitrary code when loading a model, it could be theoretically used for malware, but this fork has a built-in feature to prevent code execution along the model. Also, HuggingFace has a Security Scanner which scans for any unsafe pickle exploits and uses also ClamAV for scanning dangerous files.


# Pros & Cons

✔️ PROS
CONS
  • Very complete
  • Has an active development
  • Currently stable
  • Faster interface
  • Faster Training
  • Has (not Mangio) Crepe for Training
  • TTS features
  • Automatic model upload
  • User-friendly UI
  • TensorBoard included
  • Extra features: (plugins, model fusion, etc)
  • None 😄

# System & Hardware Requirements

Check if you meet the requirements to run it locally.

If you don't meet the requirements, there are 4 Cloud Versions:

#

# Download & Installation

#

# Nvidia on Windows (Precompiled)

RTX 5000 Series Users require version 3.3.0 or newer.

  1. The easiest way to download Applio is by going to Applio's Hugging Face repo, and clicking the [ download ] button on the right-hand side.

    image

  1. Unzip the folder. This may take a few minutes.

  1. Open Applio's folder & execute run-applio.bat.

    image

  • A console tab will appear, and after a moment your default browser, with Applio ready to use.

# Linux & macOS

  1. The easiest way to download Applio is by going to Applio's Hugging Face repo, and clicking the [ download ] button on the right-hand side.

    image

  1. Unzip the folder. This may take a few minutes.

  1. Make sure you have Python 3.10.12 or 3.11.x installed. You can check your version by running python --version.
  2. Open a terminal in the Applio directory you just extracted.
  3. Run the commands corresponding to your Linux distribution:
apt install python3.10-venv -y
python -m venv .venv
find ".venv" -type f -exec sed -i -e 's/\r$//' -e "s|/home/runner/work/Applio/Applio|$(pwd)|g" -e "s|/.venv/bin/python|/.venv/bin/$(basename $(which python))|g" {} +
sudo pacman -S python-virtualenv --noconfirm
python -m venv .venv
find ".venv" -type f -exec sed -i -e 's/\r$//' -e "s|/home/runner/work/Applio/Applio|$(pwd)|g" -e "s|/.venv/bin/python|/.venv/bin/$(basename $(which python))|g" {} +
sudo dnf install python3-virtualenv -y
python -m venv .venv
find ".venv" -type f -exec sed -i -e 's/\r$//' -e "s|/home/runner/work/Applio/Applio|$(pwd)|g" -e "s|/.venv/bin/python|/.venv/bin/$(basename $(which python))|g" {} +
  1. Run Applio
  • In the terminal, run the following commands to make the script executable and launch the application:
chmod +x run-applio.sh
./run-applio.sh

  • A console tab will appear, and after a moment your default browser, with Applio ready to use.

# AMD on Windows (Precompiled Fix)

For AMD GPU users, follow these steps to set up Applio:


  1. Download and install the VC++ Runtime.

  1. First, check the official System Requirements on the AMD ROCm™ documentation site. In the "Windows-supported GPUs" section, determine which steps to follow below.
  • Install either v6.1.2 or v5.7.1 HIP SDK from the AMD ROCm Hub.
  1. Install v5.7.1 HIP SDK from the AMD ROCm Hub.
  2. Download the correct archive for your GPU:
  3. Navigate to C:\Program Files\AMD\ROCm\5.7\bin\rocblas\ and rename the library folder to library.old.
  4. Create a new, empty folder named library in its place.
  5. Unzip the content of the archive you downloaded into this new library folder.
  1. Find your GPU's gfxNNNN value. You can do this by searching "techpowerup your_gpu_name" (e.g., "techpowerup RX 7900 XTX") and finding the "Shader ISA" on the specifications page.
  2. Follow the steps for your corresponding gfx value:

    1. Install v5.7.1 HIP SDK from the AMD ROCm Hub.
    2. Download this archive.
    3. Navigate to C:\Program Files\AMD\ROCm\5.7\bin\rocblas\ and rename the library folder to library.old.
    4. Unzip the content of the archive directly into the C:\Program Files\AMD\ROCm\5.7\bin\rocblas\ folder.
    • Visit this repository and follow the specific instructions provided there.

  1. Download a compiled version of Applio (v3.2.5 or higher) and unzip it to your desired folder.

    image
  2. Open a Command Prompt in the Applio folder (type CMD in the address bar and press Enter). Run the following commands to install the correct version of PyTorch for Zluda.

    env\python -m pip uninstall torch torchvision torchaudio
    env\python -m pip install torch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 --upgrade --index-url https://download.pytorch.org/whl/cu118

  1. Navigate to the assets\zluda folder inside your Applio directory.
  2. Move all .bat files from this folder to the main (root) Applio folder.
  3. Run the patch file that corresponds to your HIP SDK version:
    • For HIP SDK 5.7: Run patch_zluda_hip57.bat.
    • For HIP SDK 6.1: Run patch_zluda_hip61.bat.
  4. Add the bin directory of your HIP SDK installation to your system's Path environment variables.
    • For HIP SDK 5.7: C:\Program Files\AMD\ROCm\5.7\bin
    • For HIP SDK 6.1: C:\Program Files\AMD\ROCm\6.1\bin

  1. Run run-applio-amd.bat to start Applio.

  • A console tab will appear, and after a moment your default browser, with Applio ready to use.

#
#

# Inference

#
#

# 1. Upload voice model.

  • Go to the Download tab.
    You have two ways of uploading it: through its link or manually inputting its files.

    1. Go to the Download tab & paste the link of the model in the Model Link bar. It must be from Hugging Face or Google Drive.

      image

    2. Press Download Model.

    1. Drag & drop the model's .PTH in the Drop files box below.

      image

    2. Then drag the .INDEX.

# 2. Select voice model.

  1. Return to the Inference tab & click the Refresh button on the right.

    image

  2. Select your model in the Voice Model dropdown.

    image

#

# 3. Input vocals.

  • With Applio you can convert audios individually or in batches:

    1. Drag & drop the audio or click the upload box to search it.

      image

    2. Then select it in the dropdown below.

      image

    1. Go to the Batch tab.
      image

    2. In the Input Folder bar, paste the path folder containing the audios.

      In Output Folder you can paste a path folder for the results.

      Ensure the paths don't contain spaces/special characters.

# 4. Modify settings. (optional)

  • Unfold Advanced Settings if you wish to modify the inference settings for better results, or to determine the output folder.

    image


#

# 5. Convert.

  1. Click Convert at the bottom. The audio will begin to process.
    The processing time will mainly depend on your specs, length of audio & the algorithm picked.

  2. Once it's done, you can hear the results in the Export Audio box below.

    By default the output files will be in the "audios" folder: \ApplioV3.0.7\assets\audios


#
#

# Training

#
#
#
# a. Model Name
#
  • Go to the Train tab. Input a name for your model in Model Name.
    Don't include spaces/special characters.

    image


#
# b. Dataset Path
#
  • Paste the path file of your dataset in the Dataset Path bar. Ensure the path doesn't contain spaces/special characters.

    image


#
# c. Sampling Rate
#
  • Select your dataset's sample rate. If you don't know the amount, click here.

    image


#
# d. Preprocess Dataset
#
  • Ensure RVC Version is set as V2 & click Preprocess Dataset.

    It'll finish when the output box says preprocessed successfully.

    image
#
# a. Pitch extraction algorithm
#
  • Select the algorithm you want. Use either RMVPE (most suggested) or Crepe. Applio removed pm. dio and harvest as they are outdated.

    image

#
# b. Embedder Model
#

#
# c. Custom Embedder Model
#
  • If you select "custom" for embedders, you can add your own, like Spin (which seems to have better pronunciation than contentvec and better for realtime as it handles context differently)

    image
  • Give it a Folder Name, like "spin".
  • Upload the .bin and .json files, which for example you can find them at https://huggingface.co/IAHispano/Applio/tree/main/Resources/embedders/spin for spin.
  • Click "Move files to custom embedder folder".
  • After you added your custom embedder, Refresh Embedders and select it from the Dropdown menu at the left of the refresh embedders button. image

#
# d. Extract Features
#
  • Press Extract Features.
    It'll finish when it says extracted successfully.

    image
#
# a. Batch Size
#
  • If you are a newbie, use 8. But in case your dataset is short (around 2 minutes or less), use 4.

    image

#
# b. Save Every Epoch
#
  • Frequency of the saving checkpoints, based on the epochs.

  • If you are a newbie, simply leave it at 15, but if you wish to be percise set it to 1.

    image

  • E.g: with a value of 10, they will be saved after the epoch 10, 20, 30, etc.


#
# c. Total Epoch
#
  • Input the total amount of epochs (training cycles) for the model.

  • But since we'll use TensorBoard, use an arbitrarily large value like 1000

    image


#
# d. Use Pretrained Models
#
  • RVC uses the Orignal Pretrain by default (Pretrained is always checked, unchecking it is highly not recommended as you won't use even the Original Pretrain and train from scratc) to significantly reduce training duration and enhance overall quality. You can use the original pretrain, or community made models downloaded via the Download tab or upload them yourself.
Pretrained and Custom Pretrained options in Applio
  • Download Custom Pretrained Models (Optional):
    • Go to the Download tab, go to the download custom pretrain, and select the community made ones like TITAN and for which sample rate you need.
    • To use a pretrained model that you downloaded from the Download tab, simply check the Pretrained box.
    • If you can't find the pretrain you want, you can check AI HUB's #pretrain-models or here
  • Use Custom Pretrained Models (Optional):

    • Check the Custom Pretrained box to use your own files. This will open the Pretrained Custom Settings section.
    Uploading a custom pretrained model in Applio
    • Upload: Click Upload Pretrained Model to open a file dialog. Here, you can drag and drop your files or click to upload the Generator (G) and Discriminator (D) .pth files. This is for when you want to upload your own pretrains which aren't in the community download pretrains tab.
    • Select: After uploading, click Refresh Custom Pretraineds. Then, select your custom generator and discriminator from the Custom Pretrained G and Custom Pretrained D dropdown menus. These dropdowns will also show any custom pretrained models you have downloaded from the Download tab.

#
# e. Sync Graphs
#
  • Sync graphs trains a single epoch and sets the log interval to the amount of steps that epoch trained for.

  • Doing this makes the Tensor Board's graphs accurate.

    image


#
# f. GPU Settings
#
  • If you have multiple GPUs, tick GPU Settings to use a specific one for the training.

    image


#
# g. Generate Index
#
  • Click Generate Index. This will create the model's .INDEX file.

#
# h. Start Training
#
  • Press Start Training to begin the training process.

  • To open TB, execute run-tensorboard in Applio's folder. Remember to monitor it, as well as the console just in case.

  • The latter will show you errors if they happen, and information about the epochs & checkpoints.

    image
#
# a. Stop training
#
  • When you're very sure of overtraining, you can stop training by going to the Settings tab & press Restart Applio.

    image


#
# b. Get the INDEX
#
  1. Create a new folder anywhere named as the model.

  2. Open Applio's folder, go to logs, and open the folder named as the model.

  3. Select the .INDEX named added_ & move it to your newly made folder.

    image‎ ‎


#
# c. Get the PTH
#
  1. In said folder you'll also find all the checkpoints.

  2. Select the one closest to before the overtraining point, and move it to the new folder.

    The checkpoints will be organized with this format: ModelName_Epoch_Step.pth
    Example: arianagrande_e60_s120.pth


  1. And that's all, have fun with your model. To test it, do a normal inference as usual.
  • In case the training finished but the model still needed training, you don't have to start from scratch.

  • Simply enter the same settings & criteria that you've previously inserted. You don't have to do the preprocess or train the .INDEX again.

  • You can change the save frequency, or increase the Total Epoch amount in case you didn't input enough before.

  • Begin training again & remember to monitor TB & console like before.

#
#
#

# TTS

+ with any RVC model

#
  • Applio is also known for having one TTS tool by default, with plenty of voices to choose for.

  • You can also use it with RVC models & apply the inference settings if you wish.

  • Aditionally, you can download the Eleven Labs TTS plugin.


#

# Instructions:

  1. Go to the TTS tab.

    image


#
  1. If you want to use an RVC model, download it, go to TTS, click Refresh & select it in Voice Model & Index File.

    image

  • To modify the inference settings or the output folder for the TTS/RVC audio, unfold Advanced Settings.

#
  1. In TTS Voices select the voice of your desired language, accent & gender.

    In Text to Synthesize input your text. Then click Convert.

    image

  • If you are using an RVC model, select a voice that matches the model the most, to guarantee great results.

#
  1. Once it's done, you'll be able to hear the result in the Export Audio box. By default, the output audio will be in the "audios" folder. < \ApplioV3.0.7\assets\audios >

    image


#
#

# Update

#

To Update Applio, you need to firstly Save your audios and models, then Delete the current Applio folder and reinstall the latest version.


#
#

# Extra

#
  • Applio has an Extra menu, containing an audio analyzer.

  • Making it convenient for determining the sample rate of datasets when training models.

  • It also contains the model fusion tool, ideal for advanced users.


#

# Audio Analyzer:

  1. Go to the Extra tab & press the upload box to input your audio. Or simply drag & drop.

    image


#
  1. Once it's done uploading, click Get information about the audio.

#
  1. In Sampling rate you'll see the audio's full sample rate. Use said value for training.

    image

#
#
  • # Example:

    image

Here it reached 20 kHz. Doubling it gives 40kHz. Therefore the ideal target sample rate would be 40k


#
#

# Plugins

  • Plugins are components that you can add to Applio, that add new features & enhance your experience.

  • These are made by the public, and are free & easy to install.

  • You can find them on their GitHub page. More will be added in the future.


#

# Installation:

  1. Access their GitHub page & click on the name of the plugin you want.

    image


#
  1. Click on the ZIP file.

    image

  • Click on the download button on the right. This will download the ZIP file of the plugin.

    image


#
  1. Open Applio & head over to the Plugins tab. Drag & drop the ZIP file to the upload box.

    image

  • You will be able to see its installation process in the console.

    image


#
  1. Go to the settings tab & click Restart Applio at the bottom. Then you'll be able to see the plugin in the Plugins tab.

    image


# Communities


#
#

# Troubleshooting

#
#
  • If it's lower than 32k: select 32k.

  • If it's 44.1k: select 40k.

  • If i'ts higher than 48k: select 48k.

#
  • This a phenomenon called artifacting. To fix it, read here.
#
  • Report your issue here.

#

# You have reached the end.

Report Issues