AshwinSankar
commited on
Commit
•
00b1590
1
Parent(s):
b89874f
Update README.md
Browse files
README.md
CHANGED
@@ -22,6 +22,7 @@ tags:
|
|
22 |
# VITS TTS for Indian Languages
|
23 |
|
24 |
This repository contains a VITS-based Text-to-Speech (TTS) model fine-tuned for Indian languages. The model supports multiple Indian languages and a wide range of speaking styles and emotions, making it suitable for diverse use cases such as conversational AI, audiobooks, and more.
|
|
|
25 |
---
|
26 |
|
27 |
## Model Overview
|
@@ -30,6 +31,7 @@ The model `ai4bharat/vits_rasa_13` is based on the VITS architecture and support
|
|
30 |
- **Languages**: Multiple Indian languages.
|
31 |
- **Styles**: Various speaking styles and emotions.
|
32 |
- **Speaker IDs**: Predefined speaker profiles for male and female voices.
|
|
|
33 |
---
|
34 |
|
35 |
## Installation
|
@@ -37,6 +39,7 @@ The model `ai4bharat/vits_rasa_13` is based on the VITS architecture and support
|
|
37 |
```bash
|
38 |
pip install transformers torch
|
39 |
```
|
|
|
40 |
---
|
41 |
|
42 |
## Usage
|
@@ -59,6 +62,7 @@ outputs = model(inputs['input_ids'], speaker_id=speaker_id, emotion_id=style_id)
|
|
59 |
sf.write("audio.wav", outputs.waveform.squeeze(), model.config.sampling_rate)
|
60 |
print(outputs.waveform.shape)
|
61 |
```
|
|
|
62 |
---
|
63 |
|
64 |
## Supported Languages
|
@@ -76,13 +80,14 @@ print(outputs.waveform.shape)
|
|
76 |
- `Sanskrit`
|
77 |
- `Tamil`
|
78 |
- `Telugu`
|
|
|
79 |
---
|
80 |
|
81 |
## Speaker-Style Identifier Overview
|
82 |
|
83 |
<div style="display: flex; align-items: flex-start; gap: 20px; margin: 0; padding: 0;">
|
84 |
|
85 |
-
<table>
|
86 |
<tr>
|
87 |
<th>Speaker Name</th>
|
88 |
<th>Speaker ID</th>
|
@@ -233,6 +238,7 @@ print(outputs.waveform.shape)
|
|
233 |
</table>
|
234 |
|
235 |
</div>
|
|
|
236 |
---
|
237 |
|
238 |
## Citation
|
|
|
22 |
# VITS TTS for Indian Languages
|
23 |
|
24 |
This repository contains a VITS-based Text-to-Speech (TTS) model fine-tuned for Indian languages. The model supports multiple Indian languages and a wide range of speaking styles and emotions, making it suitable for diverse use cases such as conversational AI, audiobooks, and more.
|
25 |
+
|
26 |
---
|
27 |
|
28 |
## Model Overview
|
|
|
31 |
- **Languages**: Multiple Indian languages.
|
32 |
- **Styles**: Various speaking styles and emotions.
|
33 |
- **Speaker IDs**: Predefined speaker profiles for male and female voices.
|
34 |
+
|
35 |
---
|
36 |
|
37 |
## Installation
|
|
|
39 |
```bash
|
40 |
pip install transformers torch
|
41 |
```
|
42 |
+
|
43 |
---
|
44 |
|
45 |
## Usage
|
|
|
62 |
sf.write("audio.wav", outputs.waveform.squeeze(), model.config.sampling_rate)
|
63 |
print(outputs.waveform.shape)
|
64 |
```
|
65 |
+
|
66 |
---
|
67 |
|
68 |
## Supported Languages
|
|
|
80 |
- `Sanskrit`
|
81 |
- `Tamil`
|
82 |
- `Telugu`
|
83 |
+
|
84 |
---
|
85 |
|
86 |
## Speaker-Style Identifier Overview
|
87 |
|
88 |
<div style="display: flex; align-items: flex-start; gap: 20px; margin: 0; padding: 0;">
|
89 |
|
90 |
+
<table style="margin: 0; padding: 0; border-spacing: 0;">
|
91 |
<tr>
|
92 |
<th>Speaker Name</th>
|
93 |
<th>Speaker ID</th>
|
|
|
238 |
</table>
|
239 |
|
240 |
</div>
|
241 |
+
|
242 |
---
|
243 |
|
244 |
## Citation
|