yt-neural-miner

v4.4.6

Published

16 days ago

Neural Miner: Extract Metadata, Audio, Video & Emotions from YouTube using AI pipelines.

0High
0Medium
0Low

calcifer-3118

youtube-dl ai miner neural metadata audio video emotions analysis

Neural Miner

Neural Miner is an advanced, multi-modal AI pipeline designed to mine deep contextual understanding from YouTube videos. It orchestrates a suite of neural networks to extract metadata, transcribe audio, analyze visual storytelling, and derive emotional context—all synced to a structured database.

Features

Metadata Engine

Extracts rich metadata including Title, Duration, Cast, Singers, and Summaries.
Uses Llama 3 (via Ollama) to intelligently parse and structure unstructured video descriptions.
Auto-corrects missing or malformed fields.

Audio Engine (Whisper + Romanization)

Transcription: High-accuracy speech-to-text using OpenAI's Whisper.
Romanization: Automatically detects non-English segments (e.g., Hindi, Spanish) and converts them to Romanized Text (colloquial spelling) using Llama 3 for better searchability.
Noise Filtering: Intelligently removes hallucinations and spam phrases (e.g., "Subscribe now").

Video Engine (Vision Language Model)

Uses Qwen2-VL-2B-Instruct to "watch" the video.
Extracts key visual frames and generates a detailed Visual Narrative describing scenery, lighting, actions, and character interactions.

Emotion Engine

Analyzes the combined context of Lyrics, Visuals, and Metadata.
Derives precise Emotional Tags (e.g., Melancholic, Energetic, Romantic) to categorize content by mood.

Database Sync

Seamlessly pushes all extracted data to a PostgreSQL database.
Smart conflict handling: Updates existing records without overwriting critical IDs.
Vector-ready: Generates embeddings for semantic search (using BAAI/bge-m3).

Prerequisites

Before installing, ensure you have the following dependencies set up:

Node.js (v16+)
Python (v3.10+) with pip.
FFmpeg installed and added to your system PATH.
Ollama running locally with the required models:

ollama pull llama3
ollama pull qwen2vl

Installation

Install the package globally via npm:

npm install -g yt-neural-miner

Usage

Neural Miner provides a robust CLI with two main modes: Run and Sync.

1. Run Pipeline

Downloads the video and runs the selected analysis engines.

miner run "[https://www.youtube.com/watch?v=VIDEO_ID](https://www.youtube.com/watch?v=VIDEO_ID)"

Options:

Example:

# Run only Audio & Metadata, save locally
miner run "[https://youtu.be/xyz](https://youtu.be/xyz)" -s audio metadata --mode local

2. Sync Existing Data

If you have processed videos locally and want to push the cached data to your database later.

miner sync "[https://www.youtube.com/watch?v=VIDEO_ID](https://www.youtube.com/watch?v=VIDEO_ID)" --db "postgresql://user:pass@localhost:5432/mydb"

Output Structure

When running in local mode, artifacts are organized by Video ID:

output/
└── <video_id>/
    ├── video.mp4            # Source Video File
    ├── audio.mp3            # Extracted Audio Track
    ├── metadata.json        # Structured Metadata (JSON)
    ├── transcript.txt       # Cleaned & Romanized Transcript
    ├── video_narrative.txt  # Frame-by-frame Visual Analysis
    └── emotions.json        # List of Derived Emotional Tags

Configuration

You can provide your Database URL in three ways (prioritized order):

CLI Flag:
```
miner run URL --db "postgresql://..."
```
Interactive Prompt: The CLI will ask you for the URL if it is missing.
Environment Variable: Set MINER_DB_URL in your system environment or a .env file in the execution directory.

Author

Dheer Jain

GitHub: calcifer-3118

License

This project is licensed under the ISC License.