Skip to content
BeClaude

images-agnes-ai

New
9GitHub TrendingGeneralby FrancoFang667788

This skill should be used when the user asks to "generate an image", "create a picture", "draw", "text to image", "image to image", "img2img", "edit image", "transform image style", "describe image", "recognize image", "image understanding", "generate video", "text to video", "image to video", "animate image", or any image/video generation and recognition task using Agnes AI API.

Summary

This skill enables Claude Code to generate, edit, and recognize images and videos using the Agnes AI API.

  • It supports text-to-image, image-to-image, image recognition, and video generation tasks, making it easy to integrate multimodal AI capabilities into your development workflow.

Overview

Agnes AI Multimodal Generation

Generate images, recognize image content, and generate videos via Agnes AI API.

API Configuration

  • Base: https://apihub.agnes-ai.com
  • Auth: Authorization: Bearer <API_KEY>
  • Content-Type: application/json

API Key: environment variable AGNES_API_KEY or file ~/.agnes-ai-key.

Available Models

ModelCapabilityEndpoint
agnes-image-2.0-flashImage generation (t2i, i2i)/v1/images/generations
agnes-image-2.1-flashImage generation (latest)/v1/images/generations
agnes-2.0-flashImage recognition / vision/v1/chat/completions
agnes-video-v2.0Video generation (t2v, i2v)/v1/videos

1. Text-to-Image

bash
scripts/agnes-image.py --prompt "A futuristic city at sunset" --size 1024x1024
json
{
  "model": "agnes-image-2.0-flash",
  "prompt": "A futuristic city at sunset",
  "n": 1, "size": "1024x1024", "seed": 42
}

2. Image-to-Image

bash
scripts/agnes-image.py --prompt "turn into watercolor" --image "https://example.com/photo.png"
scripts/agnes-image.py --prompt "add a hat" --image "/path/to/local.png"
json
{
  "model": "agnes-image-2.0-flash",
  "prompt": "turn into watercolor style",
  "n": 1, "size": "1024x1024",
  "extra_body": {
    "image": ["https://example.com/photo.png"],
    "response_format": "url"
  }
}

3. Image Recognition

Analyze/describe image content using the vision model (agnes-2.0-flash).

bash
scripts/agnes-image.py recognize --image "https://example.com/photo.png" --prompt "Describe this image"

Endpoint: POST /v1/chat/completions

json
{
  "model": "agnes-2.0-flash",
  "messages": [
    {
      "role": "user",
      "content": [
        {"type": "text", "text": "Describe this image in detail."},
        {"type": "image_url", "image_url": {"url": "https://example.com/photo.png"}}
      ]
    }
  ],
  "max_tokens": 1024
}

Response: standard OpenAI-compatible chat completion with choices[0].message.content.

4. Video Generation (Async)

Video generation is asynchronous: submit a task, then poll for status.

Text-to-Video

bash
scripts/agnes-video.py --prompt "A cat walking on a beach at sunset" --width 1152 --height 768
json
{
  "model": "agnes-video-v2.0",
  "prompt": "A cat walking on a beach at sunset",
  "height": 768, "width": 1152,
  "num_frames": 121, "frame_rate": 24
}

Image-to-Video (animate single image)

bash
scripts/agnes-video.py --prompt "The woman turns around" --image "https://example.com/photo.png"
json
{
  "model": "agnes-video-v2.0",
  "prompt": "The woman turns around",
  "image": "https://example.com/photo.png",
  "num_frames": 121, "frame_rate": 24
}

Multi-Image / Keyframe Video

bash
scripts/agnes-video.py --prompt "Smooth transition" --images img1.png img2.png
scripts/agnes-video.py --prompt "Smooth transition" --images img1.png img2.png --mode keyframes
json
{
  "model": "agnes-video-v2.0",
  "prompt": "Smooth transition between scenes",
  "extra_body": {
    "image": ["https://example.com/img1.png", "https://example.com/img2.png"],
    "mode": "keyframes"
  },
  "num_frames": 121, "frame_rate": 24
}

Check Video Status

bash
scripts/agnes-video.py status <task_id>

GET /v1/videos/{task_id} — returns status: queuedprocessingcompleted.


Parameters Quick Reference

Image Generation

ParameterRequiredDefaultDescription
modelYesagnes-image-2.0-flashModel
promptYes-Text description
nNo1Number of images
sizeNo1024x1024Dimensions (512x512, 1024x768, etc.)
seedNorandomReproducibility seed
extra_body.imageimg2img-Input image URLs (array)
extra_body.response_formatNourl"url" or "b64_json" (must be in extra_body)

Video Generation

ParameterRequiredDefaultDescription
modelYesagnes-video-v2.0Model
promptYes-Text description
heightNo768Video height
widthNo1152Video width
num_framesNo121Frame count (~5s at 24fps)
frame_rateNo24FPS
imagei2v-Single input image URL
extra_body.imagemulti-Multiple image URLs (array)
extra_body.modeNo-"keyframes" for keyframe animation

Important Notes

  • response_format at top level causes 400 for images; place inside extra_body
  • Image generation: synchronous, timeout >= 60s
  • Video generation: asynchronous, poll GET /v1/videos/{task_id} for completion
  • Input images accept URLs or Data URI Base64 (local files auto-converted)

Scripts

  • `scripts/agnes-image.py` — Image generation (t2i, i2i) and recognition
  • `scripts/agnes-video.py` — Video generation (t2v, i2v, multi-image, keyframes) and status polling

API Documentation

Install & Usage

1
Create the skills directory
mkdir -p .claude/skills
2
Download the skill file
mkdir -p .claude/skills && curl -o .claude/skills/images-agnes-ai.md https://raw.githubusercontent.com/FrancoFang667788/agnes-ai-cli/main/SKILL.md
3
Invoke in Claude Code
/images-agnes-ai

Use Cases

Generate an image from a text description for a blog post or social media.
Transform an existing image into a different style, such as watercolor or sketch.
Analyze and describe the content of an image for accessibility or metadata extraction.
Create a short video from a text prompt or an input image.
Edit an image by adding objects or changing elements based on a prompt.
Generate multiple variations of an image for design exploration.

Usage Examples

1

/images-agnes-ai --prompt 'A serene mountain landscape at dawn' --size 1024x1024

2

/images-agnes-ai --prompt 'turn into oil painting' --image 'https://example.com/photo.jpg'

3

Describe this image: /path/to/image.png

View source on GitHub
api

Security Audits

LicenseUnknownSourceWarnRepositoryPass

Frequently Asked Questions

What is images-agnes-ai?

This skill enables Claude Code to generate, edit, and recognize images and videos using the Agnes AI API. It supports text-to-image, image-to-image, image recognition, and video generation tasks, making it easy to integrate multimodal AI capabilities into your development workflow.

How to install images-agnes-ai?

To install images-agnes-ai: create the skills directory (mkdir -p .claude/skills), then run: mkdir -p .claude/skills && curl -o .claude/skills/images-agnes-ai.md https://raw.githubusercontent.com/FrancoFang667788/agnes-ai-cli/main/SKILL.md. Finally, /images-agnes-ai in Claude Code.

What is images-agnes-ai best for?

images-agnes-ai is a skill categorized under General. It is designed for: api. Created by FrancoFang667788.

What can I use images-agnes-ai for?

images-agnes-ai is useful for: Generate an image from a text description for a blog post or social media.; Transform an existing image into a different style, such as watercolor or sketch.; Analyze and describe the content of an image for accessibility or metadata extraction.; Create a short video from a text prompt or an input image.; Edit an image by adding objects or changing elements based on a prompt.; Generate multiple variations of an image for design exploration..