1k+ участников

Вступить в Telegram

n8n

•

02.10.2025

Youtube Summariser: AI assistant for analyzing YouTube videos on Telegram

•

8 articles

Process description

This automation is an intelligent system for working with YouTube videos via the Telegram interface. The system automatically receives the transcription of any YouTube video, translates it into Russian if necessary, creates short retellings, answers questions about the video content, and can save the full transcription in Google Docs for further work.

API keys and services:

Telegram Bot API - for the user interface
OpenAI API Key - for AI agent (gpt-4.1-mini)
Supadata API - to get YouTube video transcriptions
Google Translate API - for translating transcriptions into Russian
Google Docs OAuth2 - for creating and editing documents

System architecture by blocks

SECTION 1: RECEIVING AND PROCESSING REQUESTS

1.1 Telegram Trigger — Entry Point

Purpose: Tracks all incoming messages from users with YouTube video URLs
Settings:

Updates: message
Credentials: Your tg bot

1.2 AI Agent - Central Processor

Purpose: An intelligent agent that coordinates the entire video process

System prompt:

You're a useful AI assistant who answers questions based on YouTube video transcriptions.

VIDEO CONTEXT:

1. To get a video transcription, take just the video URL and use it in the Transcription tool in placeholder {url}.

2. After you receive the text from the video, determine what language it is in, if not Russian, then use the Translate tool to translate the entire text into Russian.

3. Next, offer the user a retelling of the video clip and simply answer his questions on the topic.

4. At the user's request, to pack all the information into a document, use the Create Document tool to create a new document and immediately after that, use the Paste Info tool to add the Russian transcription into the document. After you upload the entire text to the document, announce it to the user and share it.

Instructions:

1. Answer questions based ONLY on the video transcript.

2. Link to your conversation history to maintain context.

3. If you can't find the answer in the transcript, say, “I couldn't find this information in this video.”

4. Enter timestamps if they are in the transcript.

5. When requesting a resume, create comprehensive resumes that cover key points.

6. Keep the original context and meaning of the video.

Connected components:

OpenAI Chat Model (gpt-4.1-mini) - language model
Simple Memory1 - personal memory for every chat
Transcription - getting a transcription
Translate - Russian translation
Create a document - creating a Google document
Paste Info - inserting text into a document

SECTION 2: VIDEO PROCESSING TOOLS

2.1 Transcription — Getting a transcription

Purpose: Extracts text transcription from YouTube videos via Supadata API

Settings:

Tool Description: “A tool for obtaining video transcription”
URL: https://api.supadata.ai/v1/transcript?url={url})&text=true
Headers:
- x-api-key: your api key
Placeholder:
- name: url
- description: “Full link to the video”
- type: string

2.2 Translate — Transcription translation

Purpose: Translates transcription into Russian via Google Translate API

Settings:

Tool Description: “A tool for translating video transcription”
Method: POST
URL: https://translation.googleapis.com/language/translate/v2?key=ВАШ_АПИ_КЛЮЧ
Body:

json

{

“q”: “{text}”,

“source”: “en”,

“target”: “ru”,

“format”: “text”

}

Placeholder:
- name: text
- description: “Full video transcription”

SECTION 3: WORKING WITH DOCUMENTS

3.1 Create a document - Create a Google document

Purpose: Creates a new Google document to save transcription

Settings:

Folder ID: default
Title: filled in by an AI agent via $fromAI
Credentials: Google Docs account 2

3.2 Paste Info — Paste information

Purpose: Inserts the translated transcription into a Google-generated document

Settings:

Operation: update
Document URL: filled in by an AI agent
Action: insert
Text: transcription in Russian
Credentials: Google Docs account 3

SECTION 4: MEMORY AND RESPONSE COMPONENTS

4.1 Simple Memory1 - Contextual memory

Purpose: Saves a conversation history for each user

Settings:

Session ID Type: CustomKey
Session Key: $json.message.chat.id
Context Window: 10 posts

4.2 OpenAI Chat Model — Language Model

Purpose: The main model for an AI agent

Settings:

Model: gpt-4.1-mini
Credentials: Your OpenAI account

4.3 Send a text message - Send a response

Purpose: Sends the AI agent's responses back to the user on Telegram

Settings:

Chat ID: $ ('Telegram Trigger') .item.json.message.chat.id
Text: $json.output
AppendAttribution: false

Node connection diagram

Main stream:

Telegram Trigger → AI Agent → Send a text message

AI component connections:

OpenAI Chat Model ─

Simple Memory1 ────→ AI Agent

Transcription ─────

Translate ─────────

Create a document ─

Paste Info ────────

System workflow

Step 1: Getting a transcription

A user sends a YouTube video URL
AI Agent extracts a clean URL
Calls Transcription with a URL as a parameter
Gets full video transcription

Step 2: Translation (if required)

AI Agent detects the transcription language
If the language is not Russian, it calls Translate
Receives a full Russian translation

Step 3: Analysis and Response

AI Agent creates a short video retelling
Highlights key points
Sends a resume to the user
I am ready to answer questions about the content

Step 4: Save to document (optional)

When requested by the user, calls Create a document
Creates a new Google document with the title
Through Paste Info, inserts the full transcription
Sends a link to the document to the user

Required services and their settings

Telegram Bot setup:

Create a bot via @BotFather
Get the token and save it as “Video Transcript” credentials
Set up message permissions
Set the bot description

Supadata API setup:

Sign up for supadata.ai
Get an API key
Activate access to the transcript endpoint
Check your request limits

Setting up Google Translate API:

Activate Google Cloud Translation API
Get an API key
Set up quotas and billing
Check language support

Setting up Google Docs:

Set up OAuth2 for two Google Docs accounts
Give permissions to create and edit documents
Choose a default folder to save
Check document sharing rights

OpenAI setup:

Get the OpenAI API key
Configure gpt-4.1-mini model
Check your token limits
Set up billing

System capabilities

Key features:

Extracting transcriptions from any YouTube videos
Automatic translation into Russian
Creating short paraphrases with key points
Answers to questions by video content
Save to Google Docs for further work
Specifying timestamps if available

AI capabilities:

Contextual understanding video content
Highlighting the main ideas from long videos
Personalized answers to questions
Preserving history dialogue

Language options:

Multilanguage support through translation
Preserving meaning when translating
Adaptation for a Russian-speaking audience

Use cases

Scenario 1: Educational video analysis

👤 User: https://youtube.com/watch?v=example

🤖 AI: Receiving video transcription...

🤖 AI: Machine Learning Fundamentals Video (45 minutes)

📝 Summary:

1. Introduction to ML (0:00-5:00)

2. Algorithm types (5:00-15:00)

3. Practical examples (15:00-35:00)

4. Conclusions (35:00-45:00)

Key ideas:

- Machine learning is divided into 3 types...

- Key algorithms include...

I'm ready to answer your questions!

Scenario 2: Working with foreign content

👤 User: [English video link]

🤖 AI: I'm getting a transcription...

🤖 AI: English language discovered, I'm translating...

🤖 AI: Video translated! Here's a summary in Russian...

Scenario 3: Save for study

👤 User: Save all transcription to a document

🤖 AI: Creating a document...

🤖 AI: Uploading the full transcription...

🤖 AI: ✅ The document is ready! Here's a link: [Google Docs URL]

Now you can work with text offline!

Scenario 4: Detailed questions

👤 User: What was said about neural networks?

🤖 AI: The video about neural networks mentioned:

- At 12:30 p.m. - definition and structure

- At 18:45 - application examples

- At 25:00 - comparison with other methods

[detailed answer]

System application

For students:

Lecture notes from YouTube
Exam preparation based on videos
Studying foreign courses with translation
Quick search for information in long videos

For researchers:

Conference analysis and webinars
Collecting quotes from a video interview
Creating reviews based on videos
Archiving important information

For business:

Competitor analysis based on their videos
Employee training through video courses
Documenting video meetings
Monitoring trends through video content

For content creators:

Exploring topics for new videos
Popular content analysis
Creating subtitles and descriptions
Searching for ideas from other videos

The result of the system

What happens is:

Instant access to the content of any YouTube video
Save time - no need to watch long videos
The language barrier has been removed via auto translation
Structured information instead of a stream of speech
Knowledge archive on Google Docs

Performance metrics:

Processing speed: 10-30 seconds per video
Transcription accuracy: 95% +
Translation quality: 85-90%
Save time: up to 90% of video length
Completeness of answers: 80-85% of questions are covered

System advantages:

Versatility: works with any YouTube videos
Multilingualism: automatic translation
Intellectuality: understanding the context
Accessibility: simple interface via Telegram
Safety: archiving in Google Docs

ROI and practical value:

Save time: 45 minutes of video = 5 minutes of reading
Increased productivity: +300% learning rate
Lowering the language barrier: access to global content
Improving learning: better memorization through text
Scalability: processing an unlimited number of videos

write to us and we will show you the way to efficiency

Olga

@olga_startduck

Youtube Summariser: AI assistant for analyzing YouTube videos on Telegram

Process description

API keys and services:

System architecture by blocks

SECTION 1: RECEIVING AND PROCESSING REQUESTS

1.1 Telegram Trigger — Entry Point

1.2 AI Agent - Central Processor

SECTION 2: VIDEO PROCESSING TOOLS

2.1 Transcription — Getting a transcription

2.2 Translate — Transcription translation

SECTION 3: WORKING WITH DOCUMENTS

3.1 Create a document - Create a Google document

3.2 Paste Info — Paste information

SECTION 4: MEMORY AND RESPONSE COMPONENTS

4.1 Simple Memory1 - Contextual memory

4.2 OpenAI Chat Model — Language Model

4.3 Send a text message - Send a response

Node connection diagram

Main stream:

AI component connections:

System workflow

Step 1: Getting a transcription

Step 2: Translation (if required)

Step 3: Analysis and Response

Step 4: Save to document (optional)

Required services and their settings

Telegram Bot setup:

Supadata API setup:

Setting up Google Translate API:

Setting up Google Docs:

OpenAI setup:

System capabilities

Key features:

AI capabilities:

Language options:

Use cases

Scenario 1: Educational video analysis

Scenario 2: Working with foreign content

Scenario 3: Save for study

Scenario 4: Detailed questions

System application

For students:

For researchers:

For business:

For content creators:

The result of the system

What happens is:

Performance metrics:

System advantages:

ROI and practical value:

Читайте также

write to us and we will show you the way to efficiency