Youtube Summariser: AI assistant for analyzing YouTube videos on Telegram
.jpg)
Process description
This automation is an intelligent system for working with YouTube videos via the Telegram interface. The system automatically receives the transcription of any YouTube video, translates it into Russian if necessary, creates short retellings, answers questions about the video content, and can save the full transcription in Google Docs for further work.
API keys and services:
- Telegram Bot API - for the user interface
- OpenAI API Key - for AI agent (gpt-4.1-mini)
- Supadata API - to get YouTube video transcriptions
- Google Translate API - for translating transcriptions into Russian
- Google Docs OAuth2 - for creating and editing documents
System architecture by blocks
SECTION 1: RECEIVING AND PROCESSING REQUESTS
1.1 Telegram Trigger — Entry Point
Purpose: Tracks all incoming messages from users with YouTube video URLs
Settings:
- Updates: message
- Credentials: Your tg bot
1.2 AI Agent - Central Processor
Purpose: An intelligent agent that coordinates the entire video process
System prompt:
You're a useful AI assistant who answers questions based on YouTube video transcriptions.
VIDEO CONTEXT:
1. To get a video transcription, take just the video URL and use it in the Transcription tool in placeholder {url}.
2. After you receive the text from the video, determine what language it is in, if not Russian, then use the Translate tool to translate the entire text into Russian.
3. Next, offer the user a retelling of the video clip and simply answer his questions on the topic.
4. At the user's request, to pack all the information into a document, use the Create Document tool to create a new document and immediately after that, use the Paste Info tool to add the Russian transcription into the document. After you upload the entire text to the document, announce it to the user and share it.
Instructions:
1. Answer questions based ONLY on the video transcript.
2. Link to your conversation history to maintain context.
3. If you can't find the answer in the transcript, say, “I couldn't find this information in this video.”
4. Enter timestamps if they are in the transcript.
5. When requesting a resume, create comprehensive resumes that cover key points.
6. Keep the original context and meaning of the video.
Connected components:
- OpenAI Chat Model (gpt-4.1-mini) - language model
- Simple Memory1 - personal memory for every chat
- Transcription - getting a transcription
- Translate - Russian translation
- Create a document - creating a Google document
- Paste Info - inserting text into a document
SECTION 2: VIDEO PROCESSING TOOLS
2.1 Transcription — Getting a transcription
Purpose: Extracts text transcription from YouTube videos via Supadata API
Settings:
- Tool Description: “A tool for obtaining video transcription”
- URL: https://api.supadata.ai/v1/transcript?url={url})&text=true
- Headers:
- x-api-key: your api key
- Placeholder:
- name: url
- description: “Full link to the video”
- type: string
2.2 Translate — Transcription translation
Purpose: Translates transcription into Russian via Google Translate API
Settings:
- Tool Description: “A tool for translating video transcription”
- Method: POST
- URL: https://translation.googleapis.com/language/translate/v2?key=ВАШ_АПИ_КЛЮЧ
- Body:
json
{
“q”: “{text}”,
“source”: “en”,
“target”: “ru”,
“format”: “text”
}
- Placeholder:
- name: text
- description: “Full video transcription”
SECTION 3: WORKING WITH DOCUMENTS
3.1 Create a document - Create a Google document
Purpose: Creates a new Google document to save transcription
Settings:
- Folder ID: default
- Title: filled in by an AI agent via $fromAI
- Credentials: Google Docs account 2
3.2 Paste Info — Paste information
Purpose: Inserts the translated transcription into a Google-generated document
Settings:
- Operation: update
- Document URL: filled in by an AI agent
- Action: insert
- Text: transcription in Russian
- Credentials: Google Docs account 3
SECTION 4: MEMORY AND RESPONSE COMPONENTS
4.1 Simple Memory1 - Contextual memory
Purpose: Saves a conversation history for each user
Settings:
- Session ID Type: CustomKey
- Session Key: $json.message.chat.id
- Context Window: 10 posts
4.2 OpenAI Chat Model — Language Model
Purpose: The main model for an AI agent
Settings:
- Model: gpt-4.1-mini
- Credentials: Your OpenAI account
4.3 Send a text message - Send a response
Purpose: Sends the AI agent's responses back to the user on Telegram
Settings:
- Chat ID: $ ('Telegram Trigger') .item.json.message.chat.id
- Text: $json.output
- AppendAttribution: false
Node connection diagram
Main stream:
Telegram Trigger → AI Agent → Send a text message
AI component connections:
OpenAI Chat Model ─
Simple Memory1 ────→ AI Agent
Transcription ─────
Translate ─────────
Create a document ─
Paste Info ────────
System workflow
Step 1: Getting a transcription
- A user sends a YouTube video URL
- AI Agent extracts a clean URL
- Calls Transcription with a URL as a parameter
- Gets full video transcription
Step 2: Translation (if required)
- AI Agent detects the transcription language
- If the language is not Russian, it calls Translate
- Receives a full Russian translation
Step 3: Analysis and Response
- AI Agent creates a short video retelling
- Highlights key points
- Sends a resume to the user
- I am ready to answer questions about the content
Step 4: Save to document (optional)
- When requested by the user, calls Create a document
- Creates a new Google document with the title
- Through Paste Info, inserts the full transcription
- Sends a link to the document to the user
Required services and their settings
Telegram Bot setup:
- Create a bot via @BotFather
- Get the token and save it as “Video Transcript” credentials
- Set up message permissions
- Set the bot description
Supadata API setup:
- Sign up for supadata.ai
- Get an API key
- Activate access to the transcript endpoint
- Check your request limits
Setting up Google Translate API:
- Activate Google Cloud Translation API
- Get an API key
- Set up quotas and billing
- Check language support
Setting up Google Docs:
- Set up OAuth2 for two Google Docs accounts
- Give permissions to create and edit documents
- Choose a default folder to save
- Check document sharing rights
OpenAI setup:
- Get the OpenAI API key
- Configure gpt-4.1-mini model
- Check your token limits
- Set up billing
System capabilities
Key features:
- Extracting transcriptions from any YouTube videos
- Automatic translation into Russian
- Creating short paraphrases with key points
- Answers to questions by video content
- Save to Google Docs for further work
- Specifying timestamps if available
AI capabilities:
- Contextual understanding video content
- Highlighting the main ideas from long videos
- Personalized answers to questions
- Preserving history dialogue
Language options:
- Multilanguage support through translation
- Preserving meaning when translating
- Adaptation for a Russian-speaking audience
Use cases
Scenario 1: Educational video analysis
👤 User: https://youtube.com/watch?v=example
🤖 AI: Receiving video transcription...
🤖 AI: Machine Learning Fundamentals Video (45 minutes)
📝 Summary:
1. Introduction to ML (0:00-5:00)
2. Algorithm types (5:00-15:00)
3. Practical examples (15:00-35:00)
4. Conclusions (35:00-45:00)
Key ideas:
- Machine learning is divided into 3 types...
- Key algorithms include...
I'm ready to answer your questions!
Scenario 2: Working with foreign content
👤 User: [English video link]
🤖 AI: I'm getting a transcription...
🤖 AI: English language discovered, I'm translating...
🤖 AI: Video translated! Here's a summary in Russian...
Scenario 3: Save for study
👤 User: Save all transcription to a document
🤖 AI: Creating a document...
🤖 AI: Uploading the full transcription...
🤖 AI: ✅ The document is ready! Here's a link: [Google Docs URL]
Now you can work with text offline!
Scenario 4: Detailed questions
👤 User: What was said about neural networks?
🤖 AI: The video about neural networks mentioned:
- At 12:30 p.m. - definition and structure
- At 18:45 - application examples
- At 25:00 - comparison with other methods
[detailed answer]
System application
For students:
- Lecture notes from YouTube
- Exam preparation based on videos
- Studying foreign courses with translation
- Quick search for information in long videos
For researchers:
- Conference analysis and webinars
- Collecting quotes from a video interview
- Creating reviews based on videos
- Archiving important information
For business:
- Competitor analysis based on their videos
- Employee training through video courses
- Documenting video meetings
- Monitoring trends through video content
For content creators:
- Exploring topics for new videos
- Popular content analysis
- Creating subtitles and descriptions
- Searching for ideas from other videos
The result of the system
What happens is:
- Instant access to the content of any YouTube video
- Save time - no need to watch long videos
- The language barrier has been removed via auto translation
- Structured information instead of a stream of speech
- Knowledge archive on Google Docs
Performance metrics:
- Processing speed: 10-30 seconds per video
- Transcription accuracy: 95% +
- Translation quality: 85-90%
- Save time: up to 90% of video length
- Completeness of answers: 80-85% of questions are covered
System advantages:
- Versatility: works with any YouTube videos
- Multilingualism: automatic translation
- Intellectuality: understanding the context
- Accessibility: simple interface via Telegram
- Safety: archiving in Google Docs
ROI and practical value:
- Save time: 45 minutes of video = 5 minutes of reading
- Increased productivity: +300% learning rate
- Lowering the language barrier: access to global content
- Improving learning: better memorization through text
- Scalability: processing an unlimited number of videos

