Ggml-medium.bin ((better)) -
speech recognition model, specifically formatted for use with the whisper.cpp Core Specifications Model Type: Automatic Speech Recognition (ASR). File Format:
OpenAI’s state-of-the-art model trained on 680,000 hours of multilingual and multitask supervised data. ggml-medium.bin
In the rapidly evolving landscape of on-device artificial intelligence, file extensions like .bin are commonplace, but few have garnered as much quiet respect among hobbyists and developers as the ggml-medium.bin file. If you have dabbled with running large language models (LLMs) or whisper.cpp (the automatic speech recognition system) on a CPU, you have almost certainly encountered this specific file. If you have dabbled with running large language
The GGML ecosystem thrives on offering a spectrum. Here’s how the Whisper medium compares: Georgi Gerganov, the creator of the whisper
Before GGML, running high-parameter LLMs typically required expensive NVIDIA GPUs with substantial VRAM. Georgi Gerganov, the creator of the whisper.cpp and llama.cpp projects, demonstrated that by using 4-bit and 5-bit quantization techniques, these massive models could be compressed and run efficiently on the unified memory architecture of Apple M1/M2 chips.