Meet PromptingWhisper: Using Prompt Engineering to Adapt the Whisper Model to Unseen Tasks, the Proposed Prompts Enhances Performance by 10% to 45% on Three Zero-Shot Tasks


GPT-4: Researchers have adapted OpenAI’s Whisper model, an automatic speech recognition system, to perform unseen tasks using simple prompts. The study, called PromptingWhisper, investigates the zero-shot task generalization abilities of the model and focuses on three specific tasks: audio-visual speech recognition, code-switched speech recognition, and speech translation. By designing task-specific prompts, the researchers significantly improved performance across the tasks, with gains ranging from 10% to 45%. The study highlights Whisper’s robustness to different prompts, accent biases, and multilingual understanding.
Read more at MarkTechPost…

Discover more from Emsi's feed

Subscribe now to keep reading and get access to the full archive.

Continue reading