AshwinSankar commited on
Commit
00b1590
1 Parent(s): b89874f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -1
README.md CHANGED
@@ -22,6 +22,7 @@ tags:
22
  # VITS TTS for Indian Languages
23
 
24
  This repository contains a VITS-based Text-to-Speech (TTS) model fine-tuned for Indian languages. The model supports multiple Indian languages and a wide range of speaking styles and emotions, making it suitable for diverse use cases such as conversational AI, audiobooks, and more.
 
25
  ---
26
 
27
  ## Model Overview
@@ -30,6 +31,7 @@ The model `ai4bharat/vits_rasa_13` is based on the VITS architecture and support
30
  - **Languages**: Multiple Indian languages.
31
  - **Styles**: Various speaking styles and emotions.
32
  - **Speaker IDs**: Predefined speaker profiles for male and female voices.
 
33
  ---
34
 
35
  ## Installation
@@ -37,6 +39,7 @@ The model `ai4bharat/vits_rasa_13` is based on the VITS architecture and support
37
  ```bash
38
  pip install transformers torch
39
  ```
 
40
  ---
41
 
42
  ## Usage
@@ -59,6 +62,7 @@ outputs = model(inputs['input_ids'], speaker_id=speaker_id, emotion_id=style_id)
59
  sf.write("audio.wav", outputs.waveform.squeeze(), model.config.sampling_rate)
60
  print(outputs.waveform.shape)
61
  ```
 
62
  ---
63
 
64
  ## Supported Languages
@@ -76,13 +80,14 @@ print(outputs.waveform.shape)
76
  - `Sanskrit`
77
  - `Tamil`
78
  - `Telugu`
 
79
  ---
80
 
81
  ## Speaker-Style Identifier Overview
82
 
83
  <div style="display: flex; align-items: flex-start; gap: 20px; margin: 0; padding: 0;">
84
 
85
- <table>
86
  <tr>
87
  <th>Speaker Name</th>
88
  <th>Speaker ID</th>
@@ -233,6 +238,7 @@ print(outputs.waveform.shape)
233
  </table>
234
 
235
  </div>
 
236
  ---
237
 
238
  ## Citation
 
22
  # VITS TTS for Indian Languages
23
 
24
  This repository contains a VITS-based Text-to-Speech (TTS) model fine-tuned for Indian languages. The model supports multiple Indian languages and a wide range of speaking styles and emotions, making it suitable for diverse use cases such as conversational AI, audiobooks, and more.
25
+
26
  ---
27
 
28
  ## Model Overview
 
31
  - **Languages**: Multiple Indian languages.
32
  - **Styles**: Various speaking styles and emotions.
33
  - **Speaker IDs**: Predefined speaker profiles for male and female voices.
34
+
35
  ---
36
 
37
  ## Installation
 
39
  ```bash
40
  pip install transformers torch
41
  ```
42
+
43
  ---
44
 
45
  ## Usage
 
62
  sf.write("audio.wav", outputs.waveform.squeeze(), model.config.sampling_rate)
63
  print(outputs.waveform.shape)
64
  ```
65
+
66
  ---
67
 
68
  ## Supported Languages
 
80
  - `Sanskrit`
81
  - `Tamil`
82
  - `Telugu`
83
+
84
  ---
85
 
86
  ## Speaker-Style Identifier Overview
87
 
88
  <div style="display: flex; align-items: flex-start; gap: 20px; margin: 0; padding: 0;">
89
 
90
+ <table style="margin: 0; padding: 0; border-spacing: 0;">
91
  <tr>
92
  <th>Speaker Name</th>
93
  <th>Speaker ID</th>
 
238
  </table>
239
 
240
  </div>
241
+
242
  ---
243
 
244
  ## Citation