# Model Maker Role

Last update: October 20, 2024

# Introduction ‎

To upload your Voice Models in the AI HUB Discord Server's #voice-models forum channel, you need to pass a Quality Control (QC) check to be sure that you post good voice models.

# Requirements

# `Before proceeding, ensure you meet these requirements.`

# ‎

REQUIREMENTS

Model's .PTH file.
Model's .INDEX file.
General information about the model.
General information about its training process.
A Hugging Face or weights.com account.
At least 1 raw audio sample of the model WITH NO MUSIC.

# ‎

# Things to Avoid

# `These will disqualify your post`

# It lacks the correct files.

The .ZIP file must contain both the correct .INDEX & .PTH file.
The correct .index is the one named added_.
- The added index contains the voice's accent and speech manor.
The correct .pth is the one that has your model's name, for example: TylerSwift_e60_s120.pth
- The .pth contains the actual model and pitch data.

# ‎

# Model is low quality.

A bad model:
- Sounds scratchy/screechy.
- Has a muffled sound.
- Sounds inaccurate to the source.
- Is incapable of hitting certain notes.
- Has slurred speech.
- Is unable of pronouncing words correctly in its intended language.
- Has artifacting.
- Has noise.

# ‎

# An outdated extraction method was used.

Only Crepe & RMVPE are allowed. Learn about them here
Harvest, Dio, Crepe-Tiny, PM, etc. are obsolete.

# ‎

# The audio demo contains instrumental.

Don't include ANY music in the audio demo, even if it's not copyrighted. This is due to:
- Concerns over copyright.
- In many cases, the music can "hide" the flaws of the voice model, making it harder to judge its quality.

# The audio demo is altered.

Don't add reverb, equalize, or alter the demo in any way, as it won't be a faithful representation of the model. It must be the raw, unmodified output from the inference.
Trimming silences at the beginning/end of the audio demo is allowed.

# ‎

# Is a robotic or non-human voice.

Robotic, sound effect and drum models will also be rejected, because with these types of voices it is difficult to determine if you know how to clean a dataset properly.
However once you get model maker you will be able to post robotic, sound effect or drum models.

# ‎

# Step 3: Prepare the submission.

Once your model is ready, head over to the AI HUB's #model-maker-role channel.
Click the Submit Model button.

‎
‎

# `Now fill up the information about your model:`

model-name: Its name.
technology: The technology used for its training.
extraction: The extraction method you used.
epochs: Total epochs amount.
link: Its download link from Hugging Face or Weights.
demo: An audio sample of it talking/singing.
note: Optional. Add more context about the model if you want.

You can attach more samples when you repost the model to #voice-models.

# ‎

# Step 4: Send submission.

Once you are done filling the information it will send your model to get QCed

‎ ‎
Now, wait for a QC (quality checker) to verify your model. You'll be notified once it has been reviewed.
If you made a mistake in your submission or you want to change something you can cancel your submission by clicking on the cancel button that is attatched to the message you get when you send a submission.

‎
If your model gets approved, the bot will notify you with a message like this:

‎
‎
You can then repost the model (& future models) to the #voice-models forum.

# ‎

# `You have reached the end.`

Report Issues