Text-to-Speech
Transformers
Safetensors
parler_tts
text2text-generation
annotation

Information

#7
by Sarvjeet001 - opened

Hi, thanks for you model... i am using it for exploring TTS but i have some queries that need your guidance:

  • What is the reason that your model can generate max 30 sec audio ? Is this depends on the audios length of training data? And if yes, can we increase a TTS model audio output time 30 sec to 3 min by changing training dataset audios from seconds to minutes ?
  • Is there any issue in audio output if we generate it longer then 30 sec by using any possible way?
  • Why your model has max word limit 20 for better result? On what it depends? and how can we increase it ?

Sign up or log in to comment