Skip to main content
Analyze image content using OpenAI’s GPT-4 Vision API, understanding objects, text, colors, and other elements in images.

Dependencies

DependencyDescription
OPENAI_API_KEYOpenAI API key
curl, base64System commands (usually pre-installed)
Configuration:
  • Configure OPENAI_API_KEY via the env_config tool
  • Or set open_ai_api_key in config.json

Supported Models

  • gpt-4.1-mini (recommended, cost-effective)
  • gpt-4.1

Usage

Once configured, send an image to the Agent to automatically trigger image recognition.