@thewulf7/openclaw-avatarcam

v1.0.0

Published

a month ago

Generate video messages with a lip-syncing VRM avatar for OpenClaw

0High
0Medium
0Low

thewulf7

openclaw avatar vrm lip-sync video telegram video-note electron

openclaw-avatarcam

Generate video messages with a lip-syncing VRM avatar for OpenClaw.

Installation

npm install -g openclaw-avatarcam

System Dependencies

| Platform | Command | |----------|---------| | macOS | brew install ffmpeg | | Linux | sudo apt-get install -y xvfb xauth ffmpeg | | Windows | Install ffmpeg and add to PATH | | Docker | See Docker Setup |

Note: macOS and Windows don't need xvfb — they have native display support.

Usage

# Basic usage with color background
avatarcam --audio voice.mp3 --output video.mp4 --background "#00FF00"

# With image background
avatarcam --audio voice.mp3 --output video.mp4 --background ./bg.png

# With custom avatar
avatarcam --audio voice.mp3 --output video.mp4 --avatar ./my-avatar.vrm

Options

| Option | Required | Description | |--------|----------|-------------| | --audio <path> | Yes | Input audio file (mp3, wav, etc.) | | --output <path> | Yes | Output video file (.mp4) | | --avatar <path> | No | VRM avatar model (default: built-in) | | --background <val> | No | Color hex or image path (default: #00FF00) |

Output Format

Resolution: 384x384 (square, optimized for Telegram video notes)
Frame rate: 30fps constant
Max duration: 60 seconds
Codec: H.264 + AAC
Container: MP4

OpenClaw Integration

This package includes an OpenClaw skill in skill/SKILL.md.

To install the skill:

# Copy to your OpenClaw skills folder
cp -r $(npm root -g)/openclaw-avatarcam/skill ~/.openclaw/skills/video-message

Or configure manually in openclaw.json:

{
  "skills": {
    "entries": {
      "video-message": {
        "avatar": "default.vrm",
        "background": "#00FF00"
      }
    }
  }
}

Docker Setup

Add to OPENCLAW_DOCKER_APT_PACKAGES:

build-essential procps curl file git ca-certificates xvfb xauth libgbm1 libxss1 libatk1.0-0 libatk-bridge2.0-0 libgdk-pixbuf2.0-0 libgtk-3-0 libasound2 libnss3 ffmpeg

Custom Avatars

Place .vrm files in the models directory or specify with --avatar:

avatarcam --audio voice.mp3 --output video.mp4 --avatar ~/my-avatars/custom.vrm

Technical Details

Electron renders VRM avatar with lip sync at 1280x720
WebM captured via canvas.captureStream(30)
FFmpeg processes: crop → fps normalize → scale → encode
Output: 384x384 MP4 ready for Telegram sendVideoNote

Headless Rendering

The CLI auto-detects headless environments:

Uses xvfb-run when $DISPLAY is not set (Linux only)
macOS uses native Quartz display
GPU stall warnings are safe to ignore

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme