forked from Selig/openclaw-skill
6 custom skills (assign-task, dispatch-webhook, daily-briefing, task-capture, qmd-brain, tts-voice) with technical documentation. Compatible with Claude Code, OpenClaw, Codex CLI, and OpenCode.
1.4 KiB
1.4 KiB
Image and Media Support
Overview
The WhatsApp channel via Baileys Web supports media handling with specific rules for sending, gateway processing, and agent replies.
Key Features
CLI Command Structure
The documentation specifies: openclaw message send --media <path-or-url> [--message <caption>] for transmitting media with optional accompanying text.
Media Processing Pipeline The system handles various file types differently:
- Images undergo resizing and recompression to JPEG format with a maximum dimension of 2048 pixels
- Audio files are converted to voice notes with the
pttflag enabled - Documents preserve filenames and support larger file sizes
- MP4 files can enable looped playback on mobile clients using the
gifPlaybackparameter
Size Constraints
Outbound limits vary by media category:
- Images are capped at approximately 6 MB following recompression
- Audio and video files max out at 16 MB
- Documents can reach up to 100 MB
- Media understanding operations have separate thresholds (10 MB for images, 20 MB for audio, 50 MB for video)
Inbound Processing
When messages arrive with attachments, the system downloads media to temporary storage and exposes templating variables for command processing. Audio transcription enables slash command functionality, while image and video descriptions preserve caption text for parsing.