Is nanaban actually free to use for image generation?

The CLI itself is MIT-licensed and free. For image generation: GPT Image 2 is free for ChatGPT Plus and Pro subscribers via Codex OAuth — generations decrement your existing subscription image quota, not an OpenAI API balance. The Gemini Nano Banana and GPT-5 Image backends cost cents per image but are optional.

How does nanaban get me free GPT Image 2 access?

It reads the OAuth token written by `codex login` (stored in ~/.codex/auth.json) and proxies image generation through ChatGPT's backend. Each image counts against your ChatGPT Plus or Pro image quota — no OpenAI API key, no separate billing. Your existing subscription covers it.

Do I need to install Codex CLI to use nanaban?

Only if you want the free GPT Image 2 backend. For the Gemini Nano Banana or OpenRouter GPT-5 backends, you just need the relevant API key set via `nanaban auth set` or `nanaban auth set-openrouter`. Codex CLI is the unlock for the free path specifically.

Can nanaban do vertical 9:16 images for Reels and Shorts?

Yes — through the Gemini Nano Banana 2 backend. GPT Image 2 and the GPT-5 variants are limited to 1:1, 2:3, and 3:2. Nano Banana 2 supports all 14 ratios including 9:16, 16:9, 1:4, 4:1, 1:8, and 8:1. Pin it with `--model nb2` or `--via gemini-direct`.

Does nanaban support image editing or just generation?

Both. Run `nanaban edit photo.png "make it a pencil sketch"` for image-to-image editing with a natural-language instruction. For style-locked generation, pass a reference image with `-r style.png` and nanaban will apply that image's color, composition, and texture to a fresh generation.

Can my Claude Code or Codex agent call nanaban as a tool?

Yes. Run `nanaban skill install` once and nanaban registers itself as a Skill for Claude Code, Codex, and Gemini. The `--json` flag returns structured output with file path, model, cost, duration, dimensions, and a fallback audit trail — exactly the contract an LLM agent needs.

What happens when my ChatGPT image quota runs out?

If you have OpenRouter or Gemini API keys configured, nanaban auto-falls back to those backends silently. The agent JSON mode surfaces the actual transport used so an LLM caller can see when the fallback fired. If you have no fallback configured, nanaban returns a RATE_LIMITED error with machine-readable hints.

How does nanaban compare to using the ChatGPT app directly?

Same backend, same quota — but terminal-native. Where the ChatGPT app forces you to click, copy, and download, nanaban gives you a one-command CLI that auto-names files, pipes to other tools, and works in scripts and CI. Use the app for one-off creative exploration; use nanaban for anything that touches more than one image.

Is using my ChatGPT subscription quota through nanaban allowed?

Nanaban uses the same OAuth token that the official Codex CLI uses, hitting the same backend your ChatGPT app does. You are using your own paid quota through a different client — analogous to using a third-party email client with your Gmail account. Read OpenAI's terms for the latest position, but the mechanism is fully above-board.

Which install method should I use?

Homebrew on macOS (`brew install paperfoot/tap/nanaban`) is the simplest. The standalone binary from GitHub releases is the next best — no Node.js runtime needed. npm (`npm install -g nanaban`) is fine if you already have Node 18+. All three give you the same CLI; pick whatever fits your existing setup.

Free AI Image Generation in the Terminal

The cheapest way to generate AI images in 2026 is not the OpenAI API. It is not Midjourney. It is your existing ChatGPT Plus subscription — accessed through your terminal, with no browser, no API key, and no per-image bill. The trick is a small open-source CLI called nanaban, which reads your Codex OAuth token and proxies image generation through ChatGPT's backend so every image counts against your ChatGPT Plus / Pro quota instead of an API balance.

Same idea, broader picture. Nanaban unifies three image backends behind one command — GPT Image 2 (free via ChatGPT Plus/Pro), Google Gemini's Nano Banana 2 and Pro (cents per image), and OpenAI's GPT-5 Image and GPT-5 Image Mini via OpenRouter. You type a prompt, you get a file. Auto-named from the prompt. 14 aspect ratios. Script-friendly stdout. Works inside Claude Code, Codex, and Gemini agents as a Skill.

This guide walks through the whole setup. How nanaban works, how to wire up the free path, the four real workflows it unlocks, and where the trade-offs are. By the end you will have a terminal where typing five words produces a PNG in three seconds for zero marginal cost.

Why Terminal Image Generation Matters

For most people, "AI image generation" means a web app — ChatGPT, Midjourney, OpenArt, Gemini. Type a prompt, wait, download. That works for one-off creative asks. It breaks for everything else.

Use case	Browser pain	Terminal advantage
Batch generate 50 variations	50 manual clicks + downloads	One while-loop over a prompts.txt file
LLM agent calls image gen as a tool	Impossible without an API	Standard subprocess call with --json output
CI pipeline generates social assets	Cannot do — no browser in CI	One CLI command in the workflow file
Test prompt variations rapidly	20 seconds per iteration	3 seconds per iteration in a terminal
Pipe an image into another tool	Download → upload → repeat	Pipe straight to `xargs open` or `imagemagick`

The browser is the wrong tool for any workflow that touches more than one image. The terminal is where image generation belongs once you treat it as part of a real pipeline.

The Free Path in One Sentence

Install nanaban. Run codex login once. Run nanaban "your prompt". Every image you generate comes out of your ChatGPT Plus / Pro image quota — not an API bill, not Midjourney credits, not anything else you pay for separately.

That is it. If you already pay for ChatGPT Plus or Pro, you have already paid for this. Nanaban just gives you a terminal-native way to use the image quota you are already getting.

Pro tip: The mechanism here is fully legitimate. Nanaban uses the Codex OAuth token that the official Codex CLI writes to ~/.codex/auth.json when you run codex login. It hits the same backend your ChatGPT app does. You are not jailbreaking anything — you are using your own paid quota through a different client.

What is Nanaban?

Nanaban is an open-source CLI built by Boris Djordjevic at Paperfoot AI / 199 Biotechnologies. It is MIT-licensed, ships as a Homebrew tap, an npm package, and standalone bun-compiled binaries for macOS and Linux. One command, one image, automatic filename, three-second turnaround in normal use.

What sets it apart from a thin OpenAI wrapper:

Three backends, one command. GPT Image 2 (OpenAI, free via Codex), Nano Banana 2 / Pro (Google Gemini), and GPT-5 Image / Mini (OpenAI via OpenRouter).
Auto-detected transport. If you are logged into Codex, it picks GPT Image 2 by default. If not, it falls back to whichever provider key you have configured.
Auto-naming from the prompt. "a fox in a snowy forest at dawn" becomes fox_snowy_forest_dawn.png. Collisions auto-increment.
14 aspect ratios on Nano Banana 2. Including extreme panoramic (1:8, 8:1) and tall portrait (1:4, 4:1) that GPT Image 2 cannot do.
Script-friendly output. Stdout is the file path only; metadata to stderr. Pipe straight to xargs, jq, or anything else.
Agent mode. The --json flag returns structured output with cost, duration, dimensions, and a fallback audit trail — exactly what an LLM agent needs to consume the result programmatically.
Image editing. nanaban edit photo.png "make it a pencil sketch" runs an image-to-image edit.
Style references. Pass -r style.png to apply that image's visual language (color, composition, texture) to a fresh generation.
Skill installation. nanaban skill install registers it with Claude Code, Codex, and Gemini so an agent can call it directly.

Source on GitHub at paperfoot/nanaban-cli. Full skill page on PromptsRush at /marketplace/skills/nanaban-cli.

Setup — From Zero to First Image in 90 Seconds

Step 1 — Install Nanaban

Homebrew is the easiest path on macOS:

brew install paperfoot/tap/nanaban

npm if you already have Node 18+:

npm install -g nanaban

Or grab a standalone binary from the GitHub releases — bun-compiled, no Node.js runtime required. The macOS Apple Silicon one:

curl -L https://github.com/paperfoot/nanaban-cli/releases/latest/download/nanaban-darwin-arm64 -o /usr/local/bin/nanaban && chmod +x /usr/local/bin/nanaban

Step 2 — Authenticate (the free path)

For the free GPT Image 2 backend, install Codex CLI and run:

codex login

This writes an OAuth token to ~/.codex/auth.json. Nanaban reads it on demand. From here on, every nanaban generation routes through ChatGPT's backend and decrements your ChatGPT Plus / Pro image quota.

If you do not have a ChatGPT subscription, skip this step and use one of the alternative backends:

nanaban auth set-openrouter # for OpenRouter (Gemini, GPT-5 Image)
nanaban auth set # for Gemini direct

Step 3 — Generate your first image

nanaban "cyberpunk tokyo street neon rain"

Three seconds later there is a file in your current directory called something like cyberpunk_tokyo_street_neon_rain.png. Open it.

That is the entire setup. Five lines, three of which are a one-time install. Everything from here is optimisation.

The Three Backends — When to Use Which

Backend	Cost per image	Best for	Aspect ratios
GPT Image 2 (via Codex OAuth)	$0 (uses ChatGPT Plus/Pro quota)	Strong text rendering, agentic planning, high fidelity	1:1, 2:3, 3:2
Nano Banana 2 (Gemini)	$0.067	Fast batch work, extreme aspect ratios, panoramas	All 14 (including 1:8, 8:1)
Nano Banana Pro (Gemini)	$0.136	Higher detail when budget allows	Standard 10
GPT-5 Image (OpenRouter)	$0.193	UI rendering, on-image typography	1:1, 2:3, 3:2
GPT-5 Image Mini (OpenRouter)	$0.041	Budget OpenAI alternative	1:1, 2:3, 3:2

The honest recommendation for most users: leave the default backend as GPT Image 2 (free), and pin Nano Banana 2 with --via gemini-direct or --model nb2 when you need a panoramic ratio that GPT Image 2 cannot produce.

If you do not have a ChatGPT subscription, Nano Banana 2 is the next-best default at $0.067 per image — cheap enough that a hundred generations cost less than a coffee.

The Workflows Nanaban Actually Unlocks

Workflow 1 — Batch generation from a prompts file

The single biggest reason to leave the browser. Write fifty prompts in a text file, generate all fifty:

cat prompts.txt | while read p; do nanaban "$p"; done

On the free Codex backend this costs $0 (within your ChatGPT image quota). On Nano Banana 2 it costs about $3.35. Total time: under five minutes for fifty images.

Workflow 2 — Pipe to your editor or browser

Stdout is the file path only. Pipe it to whatever you want next:

nanaban "a red circle" --json | jq -r .file | xargs open

That command generates the image, parses the JSON output for the file path, and opens it in your default viewer. One line.

Workflow 3 — Image editing

Edit an existing image with a natural-language instruction:

nanaban edit headshot.png "make it a pencil sketch"

Or apply a style reference to a fresh generation:

nanaban "portrait of a woman" -r vermeer_painting.png

The reference image's color, composition, and texture get applied to the generation. This is the cleanest path we have seen for style-locked product shots — one reference image, dozens of consistent product variations.

Workflow 4 — LLM agent tool use

The big one for agent builders. Run:

nanaban skill install

This registers nanaban as a callable Skill in Claude Code, Codex, and Gemini. Any of those agents can now call nanaban as a tool — generate an image, get a file path back, use that file path in the next step.

For programmatic use, the --json flag returns a clean structured response:

{
  "file": "fox_snowy_forest_dawn.png",
  "model": "gpt-image-2",
  "transport": "codex-oauth",
  "cost_usd": 0,
  "duration_ms": 2840,
  "dimensions": "1024x1024",
  "fallbacks_attempted": []
}

For a Claude Code agent building a thumbnail, this is exactly the contract you want — a known file path, a known cost, a known size, and an audit trail of which backend actually served the request.

Aspect Ratios and Sizes

The biggest practical difference between backends. If you are working in vertical 9:16 for Shorts or horizontal 16:9 for YouTube, you need Nano Banana 2.

Aspect ratio	Nano Banana 2	Nano Banana Pro	GPT Image 2 / GPT-5
1:1 (square)	Yes	Yes	Yes
2:3 / 3:2 (standard photo)	Yes	Yes	Yes
9:16 / 16:9 (social, video)	Yes	Yes	No
1:4 / 4:1 (tall / wide)	Yes	No	No
1:8 / 8:1 (extreme panorama)	Yes	No	No

Resolution caps: GPT Image 2 and GPT-5 variants are 1024px only. Nano Banana 2 goes from 0.5K up to 4K — useful when you want a generation you can use at billboard size without upscaling.

Use the --ar flag to set the ratio and --size for the resolution:

nanaban "panoramic alpine landscape sunset" --ar 4:1 --size 4k

The Honest Trade-offs

The free path is real but not magic.

Your ChatGPT image quota is finite. Plus and Pro have daily image limits — generous, but not unlimited. Heavy batch work will exhaust them. When it does, configure an OpenRouter or Gemini key as fallback and nanaban will silently route there.
GPT Image 2 is locked to 1024px and three ratios. For wide / tall / 4K work, you need Nano Banana 2, which costs cents per image.
Negative prompts are Gemini-only. If you need explicit "do not include X" guidance, that path is Nano Banana 2 or Pro.
You are using your subscription quota. Heavy nanaban use will eat into the same image budget you use through the ChatGPT app. Plan accordingly.
OAuth tokens expire. When yours does, re-run codex login. Nanaban surfaces an AUTH_EXPIRED error code so an agent caller can handle the case cleanly.

None of these break the use case. They are just the lines you should know exist before you build a pipeline that depends on the free path.

How This Stacks Up Against Other Image Tools

Tool	Cost	Terminal-native	Best for
Nanaban (Codex backend)	$0 (with ChatGPT Plus/Pro)	Yes	Anyone with a ChatGPT subscription doing scripted image work
Nanaban (Gemini backend)	$0.067/img	Yes	Panoramic, vertical 9:16, batch work without ChatGPT
Midjourney	$10-30/mo	No (Discord)	Premium creative work, distinctive style
OpenAI DALL-E API	$0.04-0.17/img	Yes (via API)	Production app integration with billing tracking
OpenArt	Free tier + paid	No	Browser-based with multi-model access (Flux, Midjourney style)
ComfyUI + local models	$0 (hardware required)	No (mostly)	Air-gapped pro workflows with full control

The honest split: nanaban for pipelines, scripts, agents, and any workflow that touches more than one image; OpenArt or Midjourney for browser-based creative exploration; ComfyUI for the air-gapped advanced lane. They are not competitors — they cover different jobs.

A Real Pipeline — Generating Blog Thumbnails From Post Titles

To make this concrete, here is the actual five-line pipeline we use to draft thumbnails for new blog posts. The input is a CSV of post titles; the output is a folder of thumbnail candidates.

tail -n +2 posts.csv | cut -d',' -f1 | while read title; do
nanaban "blog thumbnail, $title, bold typography, dark background, indigo accent" --ar 16:9 --model nb2 -o "thumbs/$(echo $title | tr ' ' '_' | tr -d '\"').png"
done

Reading from a 12-row CSV. Nano Banana 2 at $0.067 per image. Total cost: about $0.80. Total time: under two minutes. Browser equivalent: a half-hour of clicking.

For an LLM-driven version of the same workflow, the Claude Code agent calls nanaban --json as a tool, gets the file path back, and decides whether to regenerate based on the response. The agent loop is genuinely well-served by the structured JSON contract.

The Verdict

If you have a ChatGPT Plus or Pro subscription, you should install nanaban today. Five minutes of setup, and every image you generate from your terminal afterward costs you nothing on top of what you already pay. The free GPT Image 2 path alone justifies the install.

Even without a ChatGPT subscription, the multi-backend design makes nanaban a strong default — Nano Banana 2 at $0.067 per image is one of the cheapest serious image generators available, and the script-friendly interface is unmatched. If your workflow touches the terminal at all, this is the tool to wire in.

For the full install instructions, supported features, and FAQ, the skill page is at /marketplace/skills/nanaban-cli.

Keep Reading

Nanaban CLI on the PromptsRush directory — the skill page with the download and full SKILL.md.
How to Use Claude Code for FREE — same principle applied to coding instead of images.
AI Skills vs Prompts: What's the Difference? — the conceptual primer on what a Skill actually is.
Best Nano Banana 2 Fashion Photography Prompts — prompts to pair with the nb2 backend.
OpenArt Review 2026 — the browser-based image platform that pairs with nanaban for creative exploration.
How to Create a UI Design Skill Using design.md — pair this with nanaban so your agent generates on-brand images.
Prompt Library — the searchable prompt collection across image, video, and writing.
All AI Models — model catalogue with pricing and capabilities.