# Model Maker Role

Last update: October 20, 2024


#

# Requirements

# Before proceeding, ensure you meet these requirements.
#
REQUIREMENTS
  • Model's .PTH file.
  • Model's .INDEX file.
  • General information about the model.
  • General information about its training process.
  • A Hugging Face or Weights.gg account.
  • At least 1 raw audio sample of the model WITH NO MUSIC.
#

# Things to Avoid

# These will disqualify your post

# It lacks the correct files.

  • The .ZIP file must contain both the correct .INDEX & .PTH file.

  • The correct .index is the one named added_.

    • The added index contains the voice's accent and speech manor.
  • The correct .pth is the one that has your model's name, for example: TylerSwift_e60_s120.pth

    • The .pth contains the actual model and pitch data.

#

# Model is low quality.

  • A bad model:

    • Sounds scratchy/screechy.
    • Has a muffled sound.
    • Sounds inaccurate to the source.
    • Is incapable of hitting certain notes.
    • Has slurred speech.
    • Is unable of pronouncing words correctly in its intended language.
    • Has artifacting.
    • Has noise.

#

# An outdated extraction method was used.

  • Only Mangio-Crepe & RMVPE are allowed. Learn about them here

  • Harvest, Dio, Crepe-Tiny, PM, etc. are obsolete.


#

# The audio demo contains instrumental.

  • Don't include ANY music in the audio demo, even if it's not copyrighted. This is due to:

    • Concerns over copyright.
    • In many cases, the music can "hide" the flaws of the voice model, making it harder to judge its quality.

# The audio demo is altered.

  • Don't add reverb, equalize, or alter the demo in any way, as it won't be a faithful representation of the model. It must be the raw, unmodified output from the inference.

  • Trimming silences at the beginning/end of the audio demo is allowed.

#

# Is a robotic or non-human voice.

  • Robotic, sound effect and drum models will also be rejected, because with these types of voices it is difficult to determine if you know how to clean a dataset properly.

  • However once you get model maker you will be able to post robotic, sound effect or drum models.

#

#

# Step 3: Prepare the submission.

  • Once your model is ready, head over to the #model-maker-role channel.

  • Click the Submit Model button.

image

# Now fill up the information about your model:

model-name
Its name.
technology
The technology used for its training.
extraction
The extraction method you used.
epochs
Total epochs amount.
link
Its download link from Hugging Face or Weights.
demo
An audio sample of it talking/singing.
note
Optional. Add more context about the model if you want.


#

# Step 4: Send submission.

  • Once you are done filling the information it will send your model to get QCed

    image‎ ‎

  • Now, wait for a QC (quality checker) to verify your model. You'll be notified once it has been reviewed.

  • If you made a mistake in your submission or you want to change something you can cancel your submission by clicking on the cancel button that is attatched to the message you get when you send a submission.

    image

  • If your model gets approved, the bot will notify you with a message like this:

    image

  • You can then repost the model (& future models) to the #voice-models forum.


#

# You have reached the end.

Report Issues