## Image Generation - gemini/gemini

### Skill: gemini
#### generate_image_gemini
Description: Generates an image using the Gemini Image API.
            Supported Models (aliases are internal):
            The model parameter allows selection between available image generation models.
            - "gemini-2.5-flash-image" (recommended default for stable, fast response).
            - "gemini-3-pro-image-preview".
            - "gemini-3-flash"

            Aliases for these models are 'nano-banana 2.5' and 'nano-banana 3 Pro' respectively.
            Please use 'gemini-2.5-flash-image' unless the user specifically requests the Gemini 3 model.

        Args:
            model: The image generation model to use (see supported models above). Defaults to "gemini-2.5-flash-image". Supports: 'gemini-3-flash', 'gemini-3-pro-image-preview', 'gemini-2.5-flash-image'
            prompt: A detailed text description for the image to be generated.
            image_name: The filename for the output image, can be a relative path. Defaults to "gemini_output_images.png".
            output_folder: The optional folder path where the image will be saved (use the user's personal directory). If None, uses a server default.
            aspect_ratio: The aspect ratio of the generated image (e.g., '16:9', '1:1', '4:3'). Defaults to '16:9'.
            image_size: The size/resolution of the generated image (e.g., '1K', '2K', '4K'). Defaults to '1K'.

        Return:
            Dict: Result dictionary containing image path, message, and success status.
            output_result["image_path"]: str
            output_result["image_url"]: str
            output_result["message"]: str
            output_result["success"]: bool
## Parameters
- model: string (default: gemini-2.5-flash-image)
- prompt: string (default: A detailed, cinematic image of a futuristic city.)
- image_name: string (default: gemini_output_images.png)
- output_folder: string/null
- aspect_ratio: string (default: 16:9)
- image_size: string (default: 1K)
##### CLI
```
onekey agent gemini/gemini generate_image_gemini '{"model": "gemini-2.5-flash-image", "prompt": "sunrise over mountains", "aspect_ratio": "16:9", "image_size": "1K"}'
```
##### RESTFUL
```
export DEEPNLP_ONEKEY_ROUTER_ACCESS=your_access_key
curl -v -X POST "https://agent.deepnlp.org/agent_router" \
  -H "Content-Type: application/json" \
  -H "X-OneKey: $DEEPNLP_ONEKEY_ROUTER_ACCESS" \
  -d '{"unique_id":"gemini/gemini","api_id":"generate_image_gemini","data":{"model": "gemini-2.5-flash-image", "prompt": "sunrise over mountains", "aspect_ratio": "16:9", "image_size": "1K"}}'
```
##### MCP
```
onekey mcp gemini
```
Add to client config
```
{
  "mcpServers":{
    "deepnlp-onekey-gemini":{
      "url":"https://agent.deepnlp.org/mcp?server_name=gemini&onekey=${DEEPNLP_ONEKEY_ROUTER_ACCESS}"
    }
  }
}
```
##### Skills
```
npx agtm add aiagenta2z/onekey-gateway --skill gemini -g
npx skills add https://github.com/aiagenta2z/onekey-gateway --skill gemini
```
##### python/typescript
```python
from ai_agent_marketplace import OneKeyAgentRouter
import os
router = OneKeyAgentRouter(onekey=os.getenv('DEEPNLP_ONEKEY_ROUTER_ACCESS','BETA_TEST_KEY_MARCH_2026'))
router.invoke(unique_id="gemini/gemini", api_id="generate_image_gemini", data={"model": "gemini-2.5-flash-image", "prompt": "sunrise over mountains", "aspect_ratio": "16:9", "image_size": "1K"})
```

#### generate_image_nano_banana
Description: Get Public Available Stock Symbols from Global Marketplace

        Args:
            model: The image generation model to use. Defaults to "gemini-2.5-flash-image". Supported Models such as follows Google Gemini Doc, such as 'gemini-3-flash', "gemini-3-pro-image-preview", "gemini-2.5-flash-image", note that nano-banana is the alias name of the Gemini Image Model. Nano banana 3 Pro refers to Gemini 3 pro preview, and Nono Banana 2.5 refers to Gemini 2.5. Unless specified by user to use Gemini 3 model preview, general 'Neno Banana' image models, please use 'gemini-2.5-flash-image' for more stable and fast response.
            prompt: A detailed text description for the image to be generated.
            image_name: The filename for the output image, can be a relative path, such as "./new_gemini_image.png", etc. Defaults to "gemini_output_images.png".
            output_folder: The optional folder path where the image will be saved. Please use the users' personal directory for this path. If None, uses a default location to the root folder of the server/image
            aspect_ratio: The aspect ratio of the generated image (e.g., '16:9', '1:1', '4:3'), defaults to '16:9'.
            image_size: The size/resolution of the generated image (e.g., '1K', '2K', '4K'), defaults to '1K'.

        Return:
            Dict:  output_result is the result dict of MCP returned
            output_result["image_path"] = full_path: str
            output_result["message"] = output_message: str
            output_result["success"] = success: bool
## Parameters
- model: string (default: gemini-2.5-flash-image)
- prompt: string (default: A detailed, cinematic image of a futuristic city.)
- image_name: string (default: gemini_output_images.png)
- output_folder: string/null
- aspect_ratio: string (default: 16:9)
- image_size: string (default: 1K)
##### CLI
```
onekey agent gemini/gemini generate_image_nano_banana '{"model": "gemini-2.5-flash-image", "prompt": "robot reading book", "aspect_ratio": "16:9", "image_size": "1K"}'
```
##### RESTFUL
```
export DEEPNLP_ONEKEY_ROUTER_ACCESS=your_access_key
curl -v -X POST "https://agent.deepnlp.org/agent_router" \
  -H "Content-Type: application/json" \
  -H "X-OneKey: $DEEPNLP_ONEKEY_ROUTER_ACCESS" \
  -d '{"unique_id":"gemini/gemini","api_id":"generate_image_nano_banana","data":{"model": "gemini-2.5-flash-image", "prompt": "robot reading book", "aspect_ratio": "16:9", "image_size": "1K"}}'
```
##### MCP
```
onekey mcp gemini
```
Add to client config
```
{
  "mcpServers":{
    "deepnlp-onekey-gemini":{
      "url":"https://agent.deepnlp.org/mcp?server_name=gemini&onekey=${DEEPNLP_ONEKEY_ROUTER_ACCESS}"
    }
  }
}
```
##### Skills
```
npx agtm add aiagenta2z/onekey-gateway --skill gemini -g
npx skills add https://github.com/aiagenta2z/onekey-gateway --skill gemini
```
##### python/typescript
```python
from ai_agent_marketplace import OneKeyAgentRouter
import os
router = OneKeyAgentRouter(onekey=os.getenv('DEEPNLP_ONEKEY_ROUTER_ACCESS','BETA_TEST_KEY_MARCH_2026'))
router.invoke(unique_id="gemini/gemini", api_id="generate_image_nano_banana", data={"model": "gemini-2.5-flash-image", "prompt": "robot reading book", "aspect_ratio": "16:9", "image_size": "1K"})
```

#### generate_image_nano_banana_with_reference
Description: Get Public Available Stock Symbols from Global Marketplace

        Args:
            model: The image generation model to use. Defaults to "gemini-2.5-flash-image". Supported Models such as follows Google Gemini Doc, such as "gemini-3-pro-image-preview", "gemini-2.5-flash-image", note that nano-banana is the alias name of the Gemini Image Model. Nano banana 3 Pro refers to Gemini 3 pro preview, and Nono Banana 2.5 refers to Gemini 2.5. Unless specified by user to use Gemini 3 model preview, general 'Neno Banana' image models, please use 'gemini-2.5-flash-image' for more stable and fast response.
            prompt: A detailed text description for the image to be generated.
            image_name: The filename for the output image, can be a relative path, such as "./new_gemini_image.png", etc. Defaults to "gemini_output_images.png".
            output_folder: The optional folder path where the image will be saved. Please use the users' personal directory for this path. If None, uses a default location to the root folder of the server/image
            aspect_ratio: The aspect ratio of the generated image (e.g., '16:9', '1:1', '4:3'), defaults to '16:9'.
            image_size: The size/resolution of the generated image (e.g., '1K', '2K', '4K'), defaults to '1K'.

        Return:
            Dict:  output_result is the result dict of MCP returned
            output_result["image_path"] = full_path: str
            output_result["message"] = output_message: str
            output_result["success"] = success: bool
## Parameters
- model: string (default: gemini-3-pro-image-preview)
- prompt: string (default: Please change this image to a winter coat style)
- images: array
- image_name: string (default: output_winter_coat.jpg)
- output_folder: string (default: ./output)
- aspect_ratio: string (default: 1:1)
- image_size: string (default: 1K)
##### CLI
```
onekey agent gemini/gemini generate_image_nano_banana_with_reference '{"model": "gemini-3-pro-image-preview", "prompt": "winter coat style", "images": ["https://avatars.githubusercontent.com/u/242328252"], "aspect_ratio": "1:1"}'
```
##### RESTFUL
```
export DEEPNLP_ONEKEY_ROUTER_ACCESS=your_access_key
curl -v -X POST "https://agent.deepnlp.org/agent_router" \
  -H "Content-Type: application/json" \
  -H "X-OneKey: $DEEPNLP_ONEKEY_ROUTER_ACCESS" \
  -d '{"unique_id":"gemini/gemini","api_id":"generate_image_nano_banana_with_reference","data":{"model": "gemini-3-pro-image-preview", "prompt": "winter coat style", "images": ["https://avatars.githubusercontent.com/u/242328252"], "aspect_ratio": "1:1"}}'
```
##### MCP
```
onekey mcp gemini
```
Add to client config
```
{
  "mcpServers":{
    "deepnlp-onekey-gemini":{
      "url":"https://agent.deepnlp.org/mcp?server_name=gemini&onekey=${DEEPNLP_ONEKEY_ROUTER_ACCESS}"
    }
  }
}
```
##### Skills
```
npx agtm add aiagenta2z/onekey-gateway --skill gemini -g
npx skills add https://github.com/aiagenta2z/onekey-gateway --skill gemini
```
##### python/typescript
```python
from ai_agent_marketplace import OneKeyAgentRouter
import os
router = OneKeyAgentRouter(onekey=os.getenv('DEEPNLP_ONEKEY_ROUTER_ACCESS','BETA_TEST_KEY_MARCH_2026'))
router.invoke(unique_id="gemini/gemini", api_id="generate_image_nano_banana_with_reference", data={"model": "gemini-3-pro-image-preview", "prompt": "winter coat style", "images": ["https://avatars.githubusercontent.com/u/242328252"], "aspect_ratio": "1:1"})
```

#### ocr_extract_text_from_image
Description: Perform Optical Character Recognition (OCR) to extract all text from a given image URL.

    Args:
        image_url: The public URL of the image containing text to be transcribed.
        model: The model used for vision analysis. Defaults to Gemini 3 Flash.
## Parameters
- model: string (default: gemini-3-flash-preview)
- images: array
##### CLI
```
onekey agent gemini/gemini ocr_extract_text_from_image '{"images": ["https://avatars.githubusercontent.com/u/242328252"], "model": "gemini-3-flash-preview"}'
```
##### RESTFUL
```
export DEEPNLP_ONEKEY_ROUTER_ACCESS=your_access_key
curl -v -X POST "https://agent.deepnlp.org/agent_router" \
  -H "Content-Type: application/json" \
  -H "X-OneKey: $DEEPNLP_ONEKEY_ROUTER_ACCESS" \
  -d '{"unique_id":"gemini/gemini","api_id":"ocr_extract_text_from_image","data":{"images": ["https://avatars.githubusercontent.com/u/242328252"], "model": "gemini-3-flash-preview"}}'
```
##### MCP
```
onekey mcp gemini
```
Add to client config
```
{
  "mcpServers":{
    "deepnlp-onekey-gemini":{
      "url":"https://agent.deepnlp.org/mcp?server_name=gemini&onekey=${DEEPNLP_ONEKEY_ROUTER_ACCESS}"
    }
  }
}
```
##### Skills
```
npx agtm add aiagenta2z/onekey-gateway --skill gemini -g
npx skills add https://github.com/aiagenta2z/onekey-gateway --skill gemini
```
##### python/typescript
```python
from ai_agent_marketplace import OneKeyAgentRouter
import os
router = OneKeyAgentRouter(onekey=os.getenv('DEEPNLP_ONEKEY_ROUTER_ACCESS','BETA_TEST_KEY_MARCH_2026'))
router.invoke(unique_id="gemini/gemini", api_id="ocr_extract_text_from_image", data={"images": ["https://avatars.githubusercontent.com/u/242328252"], "model": "gemini-3-flash-preview"})
```

#### list_items_from_image
Description: Analyze an image and return a list of all identified objects, items, or subjects.

    Args:
        image_url: The public URL of the image to analyze.
        model: The model used for object detection.
        output_json: If True, returns a structured list. If False, returns a text description.
## Parameters
- model: string (default: gemini-3-flash-preview)
- images: array
- output_json: boolean (default: True)
##### CLI
```
onekey agent gemini/gemini list_items_from_image '{"images": ["https://avatars.githubusercontent.com/u/242328252"], "model": "gemini-3-flash-preview", "output_json": true}'
```
##### RESTFUL
```
export DEEPNLP_ONEKEY_ROUTER_ACCESS=your_access_key
curl -v -X POST "https://agent.deepnlp.org/agent_router" \
  -H "Content-Type: application/json" \
  -H "X-OneKey: $DEEPNLP_ONEKEY_ROUTER_ACCESS" \
  -d '{"unique_id":"gemini/gemini","api_id":"list_items_from_image","data":{"images": ["https://avatars.githubusercontent.com/u/242328252"], "model": "gemini-3-flash-preview", "output_json": true}}'
```
##### MCP
```
onekey mcp gemini
```
Add to client config
```
{
  "mcpServers":{
    "deepnlp-onekey-gemini":{
      "url":"https://agent.deepnlp.org/mcp?server_name=gemini&onekey=${DEEPNLP_ONEKEY_ROUTER_ACCESS}"
    }
  }
}
```
##### Skills
```
npx agtm add aiagenta2z/onekey-gateway --skill gemini -g
npx skills add https://github.com/aiagenta2z/onekey-gateway --skill gemini
```
##### python/typescript
```python
from ai_agent_marketplace import OneKeyAgentRouter
import os
router = OneKeyAgentRouter(onekey=os.getenv('DEEPNLP_ONEKEY_ROUTER_ACCESS','BETA_TEST_KEY_MARCH_2026'))
router.invoke(unique_id="gemini/gemini", api_id="list_items_from_image", data={"images": ["https://avatars.githubusercontent.com/u/242328252"], "model": "gemini-3-flash-preview", "output_json": true})
```