11/29/2023 0 Comments Transcription ai gethubWe will be sure to adhere to any/all licensing terms and in the event that we cannot bundle ffmpeg with Stage Whisper, we will make it as easy to obtain as possible for the end-user. Whisper is MIT licensed, but some of its dependencies (FFmpeg) are licensed under different terms. Join our Discord server to discuss the project's planning and development.Īny code that we distribute will be open sourced and follow the license terms of any of the projects that we are using.Want to contribute? Check out our good first issues and our contributing guide.Find a bug? Open an issue so that we can see how we can fix it.Request features or ask questions on the project discussions on GitHub.We are currently working on implementing major improvements and hope to release a beta version soon. The app will be available for MacOS, Windows, and Linux. We have a working prototype that uses the Electron and Mantine frameworks to create an app that allows users to input audio files, transcribe them using Whisper, and then manage and edit the resulting transcriptions. The project is currently in the early stages of development. We'd love to collaborate with anyone who has ideas about how we could more easily package Whisper and make it easy to use for non-technical users. Who is and (Christina Warren) created the project, and and (Sarah Kaiser) are leading the development with (Adam Newton-Blows) leading frontend development. Peter came up with the project name, Stage Whisper. Our goal is to package Whisper in an easier to use way so that less technical users can take advantage of this neural net. The only problem, as pointed out, is that not all journalists (or others who could benefit from this type of transcription tool) are comfortable with the command line and installing the dependencies required to run Whisper. There's any number of ways to get all these dependencies installed on your workstation, but here is one example of how you might install all of the above on a Mac (skip any step for something you have already installed):Įarlier this year, OpenAI released Whisper, its automatic speech recognition (ASR) system that is trained on "680,000 hours of multilingual and multitask supervised data collected from the web." You can learn more by reading the paper or looking at the examples on OpenAI's website.Īs Dan Nguyen noted on Twitter, this could be a "godsend for newsrooms." It is currently possible to separately work on the Electron interface or the Python backend, so if you are planning to only work on one or the other, you only have to install the requirements specific to that component. For now, though, you will need the following installed on your machine to develop Stage Whisper. The eventual 1.0 release of Stage Whisper will (ideally) not require any additional software. A Python backend that interfaces with OpenAI's Whisper library.Stage Whisper consists of two connected components: Stage Whisper uses OpenAI's Whisper machine learning model to produce very accurate transcriptions of audio files, and also allows users to store and edit transcriptions using a simple and intuitive graphical user interface. This is the main repo for Stage Whisper - a free, open-source, and easy-to-use audio transcription app.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |