What's RVC

Last update: November 9, 2025


Introduction


  • RVC (Retrieval-Based Voice Conversion) is an advanced AI voice cloning software based on VITS, developed by the RVC-Project team. It's considered the best free & open-source one to date.

  • It was designed for desktop, requiring great specs to run it effectively, specially GPU for training models.

  • Though it can be executed through the cloud & be used in any device, in case you don't meet the previous requirement.

  • RVC doesn't have any major quality improvements since 2023, since its original devs are focused on other projects, RVC is hard to optimize, and it has limitations like non speech sounds such as realistic laughing, screaming, etc. Though, there are commmunity driven Forks that try to experiment with it, mostly about adding new features and performance improvements.


Forks



FAQ

Frequently asked questions.


  • As explained before, it depends on your needs. It's best to try them yourself.
  • For local users, Applio is a great starting point. For cloud users you can use either the Applio Google Colab or applio kaggle.

The minimum specs vary depending if it's for training models or inference.

SPEC MINIMUM REQUIREMENT
GPU NVIDIA GTX 900 Series / AMD RX580 (Mac isn't supported)
RAM 6GB
Storage 30 GB
SPEC SUGGESTED REQUIREMENT
GPU NVIDIA RTX 20 Series or later / AMD Radeon RX 5xxx or later (Mac isn't supported)
VRAM 8GB
RAM 8GB
Storage 30 GB
SPEC MINIMUM REQUIREMENT
RAM 6GB
Storage 6 GB
SPEC SUGGESTED REQUIREMENT
GPU NVIDIA RTX 20 Series or later / AMD Radeon RX 5xxx or later / Apple M3
RAM 6GB
Storage 6 GB
  • You can, but it's going to be slower, as they don't have CUDA cores.
  • So it's more convenient using RVC through the cloud.
  • If you're willing to use a slower version you can go ahead and follow this guide on how to get zluda working with Applio Zluda.
  • The total time depends on a lot of factors, like dataset length, batch size, pretrains, specs, etc.

  • A 10 min dataset with RMVPE may take around 1 to 2 hours.

  • Yes, on Macs of recent generations.
  • But you can only do inference & it's a little unstable.
  • If you're using RVC locally, no (the only exception would be Applio TTS as it uses Microsoft's Edge TTS as a base).
  • If you're using it through the cloud, then yes.

You have reached the end.

Report Issues

⚠️
SECURITY ALERT: COMMUNITY SAFETY NOTICE
  1. Discord Vanity Hijack: AI HUB official vanity discord.gg/aihub was hijacked on April 30, 2026. Do not join it, it's a malicious clone server. Please use our official permanent invite: discord.gg/mmRR2TUJF5.
  2. Weights.gg Vanity (At Risk): We are aware of actors targeting AI HUB's Former Partner Weights.gg's vanity discord.gg/weights. To ensure you remain on the legitimate server, please use our permanent invite: discord.gg/CHGgVRCHvm. Be aware that Weights.gg's Discord Server is semi closed since the shutdown on April 1st 2026.
  3. "W8ights" Phishing Scam: Malicious actors are impersonating AI HUB's former partner, Weights.gg. These "W8ights" sites are scams. We have no affiliation with them and are currently investigating into them.
  4. TL;DR: YOU ARE SAFE. This is the original, legitimate AI Hub Docs. We are just informing users about impersonations.