ai4bharat
/

vits_rasa_13

indic_vits_model

feature-extraction

Model card Files Files and versions Community

AshwinSankar commited on 8 days ago

Commit

00b1590

•

1 Parent(s): b89874f

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -22,6 +22,7 @@ tags:
 # VITS TTS for Indian Languages
 This repository contains a VITS-based Text-to-Speech (TTS) model fine-tuned for Indian languages. The model supports multiple Indian languages and a wide range of speaking styles and emotions, making it suitable for diverse use cases such as conversational AI, audiobooks, and more.
 ---
 ## Model Overview
@@ -30,6 +31,7 @@ The model `ai4bharat/vits_rasa_13` is based on the VITS architecture and support
 - **Languages**: Multiple Indian languages.
 - **Styles**: Various speaking styles and emotions.
 - **Speaker IDs**: Predefined speaker profiles for male and female voices.
 ---
 ## Installation
@@ -37,6 +39,7 @@ The model `ai4bharat/vits_rasa_13` is based on the VITS architecture and support
 ```bash
 pip install transformers torch
 ```
 ---
 ## Usage
@@ -59,6 +62,7 @@ outputs = model(inputs['input_ids'], speaker_id=speaker_id, emotion_id=style_id)
 sf.write("audio.wav", outputs.waveform.squeeze(), model.config.sampling_rate)
 print(outputs.waveform.shape)
 ```
 ---
 ## Supported Languages
@@ -76,13 +80,14 @@ print(outputs.waveform.shape)
 - `Sanskrit`
 - `Tamil`
 - `Telugu`
 ---
 ## Speaker-Style Identifier Overview
 <div style="display: flex; align-items: flex-start; gap: 20px; margin: 0; padding: 0;">
-<table>
   <tr>
     <th>Speaker Name</th>
     <th>Speaker ID</th>
@@ -233,6 +238,7 @@ print(outputs.waveform.shape)
 </table>
 </div>
 ---
 ## Citation

 # VITS TTS for Indian Languages
 This repository contains a VITS-based Text-to-Speech (TTS) model fine-tuned for Indian languages. The model supports multiple Indian languages and a wide range of speaking styles and emotions, making it suitable for diverse use cases such as conversational AI, audiobooks, and more.
 ---
 ## Model Overview
 - **Languages**: Multiple Indian languages.
 - **Styles**: Various speaking styles and emotions.
 - **Speaker IDs**: Predefined speaker profiles for male and female voices.
 ---
 ## Installation
 ```bash
 pip install transformers torch
 ```
 ---
 ## Usage
 sf.write("audio.wav", outputs.waveform.squeeze(), model.config.sampling_rate)
 print(outputs.waveform.shape)
 ```
 ---
 ## Supported Languages
 - `Sanskrit`
 - `Tamil`
 - `Telugu`
 ---
 ## Speaker-Style Identifier Overview
 <div style="display: flex; align-items: flex-start; gap: 20px; margin: 0; padding: 0;">
+<table style="margin: 0; padding: 0; border-spacing: 0;">
   <tr>
     <th>Speaker Name</th>
     <th>Speaker ID</th>
 </table>
 </div>
 ---
 ## Citation