Python & OPENAI- Media to Transcripts and Summaries with a Laptop & Microphone
OpenAI Whisper Media-to-Text #Python, #Linux, #FFmpeg
Whisper Loop AI and API Toolkit Part 1
OpenAi Whisper https://openai.com/research/whisper
There are three parts in this series. This one is the most useful and cost effective compared to other solutions.
Imagine having a one hour professional development session, recorded to video. Then taking that video and rapidly converting it to a transcript and then a bullet point summary of all the important information.
This project is using Linux and Python, however, you can do it with a normal Windows or Apple laptop.
The AI is OpenAI API, and it is using the older model (3.5). Many of the techniques are used to reduce the number of tokens that go to the AI. With the expanded models, I would leave the up-front filtering to keep the tokens down, but, I would expand the response and reduce the current cap set for output.
All the scripts are here:
https://github.com/domain7tech/whisperLoop
This post is free, but the solution is loaded with techniques for processing multiple types of data and files with AI. Even working with an AI assistant, this would take most people 15-30 days to complete. This is also fast, and batch processes hours of audio files from video files. Consider joining as a subscriber to support this work. 2 hours of your time, for weeks of work, and techniques you can use for years to come.