Dec. 10, 2023, 8:42 p.m. | Andrew Jones

DEV Community dev.to

Whisper is OpenAI's intelligent speech-to-text transcription model. It allows developers to enter audio and an optional styling prompt, and get transcribed text in response.


However, the official OpenAI Node.js SDK API docs only show one way to use Whisper - reading an audio file with fs.



async function main() {
const transcription = await openai.audio.transcriptions.create({
file: fs.createReadStream("audio.mp3"),
model: "whisper-1", …

api async audio developers file system function intelligent javascript node node.js openai prompt reading sdk show speech speech-to-text styling text text transcription transcription webdev whisper

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Software Engineer, Data Tools - Full Stack

@ DoorDash | Pune, India

Senior Data Analyst

@ Artsy | New York City