Pretrained Models on Huggingface

Model License

  • Apache License 2.0

Model Zoo

Here we provided several pretrained models on different datasets. The details of models and datasets can be found on ModelScope.

Speech Recognition Models

Paraformer Models

Model Name Language Training Data Vocab Size Parameter Offline/Online Notes
Paraformer-large CN & EN Alibaba Speech Data (60000hours) 8404 220M Offline Duration of input wav <= 20s

UniASR Models

Conformer Models

RNN-T Models

Multi-talker Speech Recognition Models

MFCCA Models

Voice Activity Detection Models

Model Name Training Data Parameters Sampling Rate Notes
FSMN-VAD Alibaba Speech Data (5000hours) 0.4M 16000

Punctuation Restoration Models

Model Name Training Data Parameters Vocab Size Offline/Online Notes
CT-Transformer Alibaba Text Data 70M 272727 Offline offline punctuation model

Language Models

Speaker Verification Models

Speaker diarization Models

Timestamp Prediction Models