Dockerized REST API service for Persian/Farsi speech-to-text conversion. Utilizes m3hrdadfi/wav2vec2-large-xlsr-persian model.
Go to file
2023-12-15 05:16:36 +03:30
.env.docker Add '.env.docker' 2023-12-15 05:12:00 +03:30
.env.sample Feat: ReST API is done using FastAPI 2023-12-11 03:55:43 +03:30
.gitignore Add 'model/' to '.gitignore' 2023-12-15 05:11:04 +03:30
Dockerfile Add 'Dockerfile' 2023-12-15 05:12:16 +03:30
main.py Feat: ReST API is done using FastAPI 2023-12-11 03:55:43 +03:30
README.md Update README.md 2023-12-15 05:16:36 +03:30
requirements.txt Downgrade python version to 3.10 2023-12-15 05:11:32 +03:30

Persian Speech-to-text

Build docker image

docker build . -t rbehzadan/speech2text-fa:1.0.0

Run docker container

docker run --rm --name speech2text-da -p 8000:8000 -it rbehzadan/speech2text-fa:1.0.0

Testing

With curl

curl -X POST "http://localhost:8000/transcribe" -H "accept: application/json" -H "Content-Type: multipart/form-data" -F "audio_file=@sample.wav"

With HTTPie

http -f POST http://localhost:8000/transcribe audio_file@sample.wav