Files
Selig 4c966a3ad2 Initial commit: OpenClaw Skill Collection
6 custom skills (assign-task, dispatch-webhook, daily-briefing,
task-capture, qmd-brain, tts-voice) with technical documentation.
Compatible with Claude Code, OpenClaw, Codex CLI, and OpenCode.
2026-03-13 10:58:30 +08:00

1.4 KiB

Image and Media Support

Overview

The WhatsApp channel via Baileys Web supports media handling with specific rules for sending, gateway processing, and agent replies.

Key Features

CLI Command Structure The documentation specifies: openclaw message send --media <path-or-url> [--message <caption>] for transmitting media with optional accompanying text.

Media Processing Pipeline The system handles various file types differently:

  • Images undergo resizing and recompression to JPEG format with a maximum dimension of 2048 pixels
  • Audio files are converted to voice notes with the ptt flag enabled
  • Documents preserve filenames and support larger file sizes
  • MP4 files can enable looped playback on mobile clients using the gifPlayback parameter

Size Constraints

Outbound limits vary by media category:

  • Images are capped at approximately 6 MB following recompression
  • Audio and video files max out at 16 MB
  • Documents can reach up to 100 MB
  • Media understanding operations have separate thresholds (10 MB for images, 20 MB for audio, 50 MB for video)

Inbound Processing

When messages arrive with attachments, the system downloads media to temporary storage and exposes templating variables for command processing. Audio transcription enables slash command functionality, while image and video descriptions preserve caption text for parsing.