Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
OpenSound
/
CapSpeech-models
like
14
Text-to-Speech
Transformers
Safetensors
arxiv:
2506.02863
License:
cc-by-nc-4.0
Model card
Files
Files and versions
xet
Community
3
Deploy
Use this model
main
CapSpeech-models
33.9 GB
3 contributors
History:
7 commits
OpenSound
nielsr
HF Staff
Add pipeline tag and library name (
#1
)
e3a063a
verified
6 months ago
ar_CapTTS-SE
1
7 months ago
ar_PT
1
7 months ago
nar_duration_predictor
1
7 months ago
.gitattributes
1.52 kB
initial commit
7 months ago
README.md
1.79 kB
Add pipeline tag and library name (#1)
6 months ago
clap-630k-best.pt
1.86 GB
xet
1
7 months ago
nar_AccCapTTS.pt
7.37 GB
xet
add
7 months ago
nar_CapTTS.pt
7.37 GB
xet
1
7 months ago
nar_EmoCapTTS.pt
7.37 GB
xet
1
7 months ago
nar_PT.pt
2.46 GB
xet
1
7 months ago
nar_pretrain.yaml
1.37 kB
add
7 months ago
vocab.txt
599 Bytes
add
7 months ago