This is a simple speech recognition application built with Python and Tkinter. It utilizes the AssemblyAI API for transcription of audio files.
Before running the application, you need to have Python installed on your system. You can download Python from the official website.
-
Clone the repository:
git clone https://github.com/iarslankhalid/speech-recognition-app.git
-
Navigate to the project directory:
cd speech-recognition-app
-
Create a virtual environment (optional but recommended):
python -m venv venv
-
Activate the virtual environment:
- On Windows:
venv\Scripts\activate
- On macOS and Linux:
source venv/bin/activate
- On Windows:
-
Install the required libraries:
pip install -r requirements.txt
To use the AssemblyAI API for transcription, you need to create an account on the AssemblyAI website. Follow these steps:
- Go to the AssemblyAI website.
- Sign up for an account or log in if you already have one.
- Once logged in, navigate to the API section and copy your API key.
- Open file named
api_secrets.py
in the project directory. - Inside
api_secrets.py
, create a variable namedAPI_KEY_ASSEMBLYAI
and assign your API key to it.
API_KEY_ASSEMBLYAI = "your-api-key-goes-here"
- Run the
app.py
file using Python.python app.py
- The application window will open.
- To transcribe an audio file:
- Click on the "Select Audio" button to choose a file from your computer.
- Alternatively, you can click on the "Record Audio" button to record audio directly.
- Once the file is selected or recording is done, click on the "Transcribe" button.
- The transcription will appear in the text box once it's ready.
This project is licensed under the MIT License - see the LICENSE file for details.
If you have any questions or suggestions, feel free to contact me:
- Email: [email protected]
- LinkedIn: LinkedIn Profile
- GitHub: Github Profile