Skip to content

Alen-121/transcription

Repository files navigation

Youtube Transcript Summarizer

This project showcases a system designed to extract and analyze text YouTube video transcriptions. In this project a fine-tuned model will provide you a summary for a certain duration , which will provided by the user.

Features

  • Provide summary for a the given transcription
  • Provide the content for user specified duration

Note & issues:

  • This project isn't complete because of lack data.
  • I have firstly trained the mode using the transcription (.txt) and then fine-tuned using the json file which contents the the duration . The fine-tuned model needs lots of train .
  • The accuracy may be low.
  • I have firstly tried to train the model with certain words such as end , ending ,finish , close , closure , termination for teaching the context to the model.

Frameworks and Libraries

Additional frameworks and libraries used in this project:

  • Langchain

Installation

1.Clone the repository:

git clone https://github.com/Alen-121/transcription.git

2.Navigate to the project directory and install dependencies:

cd yt-transcription
pip install -r requirements.txt

3.Run the project

python transcript.py

Environment Variables

To run this project, you will need to add the following environment variables to your .env file

API_KEY : https://aistudio.google.com/app/apikey

🚀 About Me

Hello! I'm Alen Sunny, . I'm passionate about learn ML, DL ,LLM and always eager to learn and grow in my field.

🔗 Links

linkedin

About

Find the important parts of a transcription

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages