Skip to content

whisper.cpp 1.20 produces different inference than OpenAI whisper and with higher WER #472

Open
@jordimas

Description

@jordimas

Hello!

First, thanks for writing such a great tool.

Whisper.cpp: version 1.20
Open AI: version openai-whisper-20230124
Model used: medium

Audio file used: https://github.com/jordimas/whisper-cpp-error/raw/main/15GdH9-curt.mp3
Open AI transcription: https://raw.githubusercontent.com/jordimas/whisper-cpp-error/main/15GdH9-curt/15GdH9-curt.mp3.txt
Whisper.cpp: transcription: https://raw.githubusercontent.com/jordimas/whisper-cpp-error/main/15GdH9-curt.wav.txt

I will expect Whisper.cpp to produce the same output under the same model and input than OpenAI Whisper.

In terms of WER against reference the txt human transcribed file: OpenAI whisper -WER: 28.08, Whisper.cpp : WER 35.86

If there is anything that I can do to help, let me know

Thanks

Metadata

Metadata

Assignees

No one assigned

    Labels

    decodingDecoding related issues

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions