studioglobal
Trending Discover
AnswersPublished13 sources

GPT Image 2 vs. Nano Banana Pro: Which image API should you use?

No public source here proves a universal winner: GPT Image 2 is the safer default for exact text and structured commercial layouts, while Nano Banana Pro has the stronger direct signal for photoreal lighting and skin... Pricing is closer than model vs model hot takes suggest: OpenAI lists gpt image 2 image output at...

13K0
Nano Banana Pro children's picture book double-page spread with fox and bear, birch forest and northern lights, warm gouache painting style with pressed paper grain
Nano Banana Pro children's picture book double-page spread with fox and bear, birch forest and northern lights, warm gouache painting styleNano Banana Pro children's picture book double-page spread with fox and bear, birch forest and northern lights, warm gouache painting style with pressed paper grain

Choosing between GPT Image 2 and Nano Banana Pro is less about naming a permanent champion than choosing the model whose strengths match your workload. The strongest sources here are official docs for model availability, rate limits, pricing, and generation parameters; the quality claims come from small hands-on tests and review-style comparisons [3][6][9][10][13][14][25][26]. Taken together, the evidence supports a practical split: start with GPT Image 2 for text-heavy and layout-heavy commercial assets, and start with Nano Banana Pro when photoreal lighting, skin texture, and Gemini-native image workflows matter most [3][6][10][26].

Evidence caveat: Pro, 2, and public benchmarks are not identical

AVB’s direct comparison tested GPT Image 2.0 against Nano Banana Pro, identified there as gemini-3-pro-image, across 10 prompts on April 22, 2026 [6]. Several other useful public head-to-heads in this source set compare GPT Image 2 with Nano Banana 2 rather than Nano Banana Pro [3][9][10]. Google’s developer docs describe Nano Banana image generation through the Gemini API, including aspect-ratio and 2K resolution parameters [26].

That naming mismatch matters. Model routes, provider policies, default settings, and product surfaces can change results. Treat Nano Banana 2 comparisons as adjacent evidence for Gemini/Nano Banana image workflows, not as a perfect substitute for your exact Nano Banana Pro endpoint [3][6][10][26].

Official docs are strongest for API facts: OpenAI lists gpt-image-2-2026-04-21 and tiered rate limits for GPT Image 2 [13], and OpenAI’s pricing page lists token prices for gpt-image-2 [14]. Google’s Gemini pricing page lists image-output pricing and a 1024×1024 token estimate [25]. Public quality benchmarks are weaker: the cited tests are small prompt sets, blind tests, or comparison posts rather than a standardized independent benchmark suite [3][6][9][10]. Some comparison pages also make very precise claims, such as text-accuracy percentages or leaderboard-style rankings, without enough methodology in the provided snippets to treat those numbers as decisive [5][8].

Quick decision matrix

WorkloadBetter first testWhy
Exact English text, labels, menus, UI copy, posters, product labelsGPT Image 2Genspark reports a narrow GPT Image 2 edge on precise text and technical terminology, and AVB reports GPT Image 2.0 wins on in-image typography, manga dialogue panels, a bilingual menu, and a silkscreen gig poster [3][6].
Layout-heavy ads, packaging, mockups, or commercial editsGPT Image 2Vidguru’s 10-test blind benchmark says GPT-Image 2 won five rounds and tied the other five, with the biggest gap in image-editing fidelity, material logic, and layout-heavy commercial work [10].
Photoreal portraits, UGC-style images, lifestyle ads, cinematic lightingNano Banana ProAVB reports Nano Banana Pro wins on photorealism, skin texture, and lighting in hyperreal portrait, UGC selfie, and athletic ad prompts [6].
CJK typography polish or dramatic lightingTest Nano Banana first, then verify on your endpointGenspark’s Nano Banana 2 comparison reports a narrow edge on CJK typography polish and dramatic lighting, but this is adjacent evidence rather than a direct Nano Banana Pro result [3].
Product shots, e-commerce mockups, marketing infographics, anatomy diagramsBenchmark bothGenspark reports the two models are effectively tied in these categories when prompted properly [3].
Technical diagrams and labeled schematicsBenchmark bothAnalytics Vidhya describes its annotated-diagram task as the closest contest, with both models rendering requested labels and data points accurately in that test [9].
OpenAI-centered API stack, tiered rate limits, or Batch API economicsGPT Image 2OpenAI documents the GPT Image 2 model, tiered rate limits, token pricing, and lower Batch API pricing [13][14][15].
Gemini-centered workflow with documented Nano Banana parametersNano Banana Pro / Gemini image workflowGoogle’s Nano Banana image-generation docs show Gemini API usage with parameters such as aspect ratio and 2K resolution [26].

Benchmark findings

Text, typography, and structured layouts: GPT Image 2 has the clearest edge

Text rendering is the most consistent GPT Image 2 advantage in the provided comparisons. Genspark reports that GPT Image 2 has a narrow edge on precise text and technical terminology [3]. In AVB’s direct 10-prompt test, GPT Image 2.0 won the in-image typography, manga dialogue-panel, bilingual-menu, and silkscreen-gig-poster prompts [6]. Vidguru’s blind benchmark also favors GPT-Image 2 for layout-heavy commercial work, image-editing fidelity, and material logic [10].

The practical read: if a malformed headline, bad menu item, broken UI label, or inaccurate product callout makes the image unusable, GPT Image 2 is the more defensible first API to test [3][6][10].

Photorealism and lighting: Nano Banana Pro’s strongest signal

Nano Banana Pro’s best direct evidence is in photoreal and lighting-heavy outputs. AVB reports that Nano Banana Pro won on photorealism, skin texture, and lighting in the hyperreal portrait, UGC selfie, and athletic ad prompts [6]. Genspark’s adjacent Nano Banana 2 comparison also reports a narrow edge for dramatic lighting [3].

That makes Nano Banana Pro a strong first candidate for editorial portraits, lifestyle campaigns, UGC-style ads, and cinematic creative where mood and naturalistic lighting matter more than exact text overlays [3][6].

Product shots, e-commerce, and infographics look close

For common commercial categories, the public evidence does not show a clean winner. Genspark reports that GPT Image 2 and Nano Banana 2 are effectively tied on photorealistic product shots, e-commerce mockups, marketing infographics, and anatomy diagrams when prompted properly [3]. For these workloads, the deciding factors are likely to be your prompt style, reference-image workflow, editing loop, latency in your stack, and cost per accepted output.

Technical diagrams are too close for a generic verdict

Diagram performance depends heavily on the style and scoring rubric. Analytics Vidhya describes its annotated-diagram task as the closest contest in its comparison: Nano Banana 2 produced a rigorous two-view engineering-style diagram, GPT Image 2 produced a visually strong blueprint-style result, and both rendered the requested labels and data points accurately in that test [9]. If you need exact dimensions, callouts, or engineering conventions, do not rely on a generic model ranking; test your actual diagram templates.

Prompt completion and refusals: one small test favored GPT Image 2

AVB reports that GPT Image 2.0 rendered all 10 prompts in its test, while Nano Banana Pro rendered 9 of 10 and refused a prominent-person CV prompt on policy grounds [6]. That is useful signal, but it should not be generalized too far. Refusal behavior can vary by prompt wording, API route, safety policy, product surface, and time.

Pricing: the headline image-output price is similar

OpenAI’s pricing page lists gpt-image-2 image input at $8.00 per 1M tokens, cached image input at $2.00 per 1M tokens, and image output at $30.00 per 1M tokens [14]. OpenAI’s materials also list GPT Image 2 text input at $5.00 per 1M tokens, cached text input at $1.25 per 1M tokens, and text output at $10.00 per 1M tokens [14][21].

Google’s Gemini pricing page lists image output at $30 per 1,000,000 tokens and says output images up to 1024×1024 consume 1,290 tokens, equivalent to $0.039 per image [25].

So the public pricing docs do not support a simple claim that one model is always cheaper. Real cost can diverge based on prompt length, image inputs, reference images, resolution, edit loops, retries, refusals, caching, and routing [14][21][25][26]. OpenAI’s pricing docs also show lower Batch API rates for GPT Image 2, and OpenAI says the Batch API can save 50% on inputs and outputs for asynchronous jobs run over 24 hours [14][15]. If your workload is bulk generation rather than interactive creation, that can materially affect cost.

Production parameters and routing

OpenAI’s GPT Image 2 model page lists tiered TPM and IPM rate limits, with Free not supported and higher tiers scaling upward by usage tier [13]. Google’s Nano Banana image-generation docs show Gemini API examples with inline image inputs, aspect ratio, and 2K resolution parameters [26].

If you use third-party routing, verify limits against that provider rather than assuming they match the first-party API. For example, Fal’s GPT Image 2 page lists custom dimensions that must be multiples of 16, a maximum single edge of 3840px, a maximum aspect ratio of 3:1, and a total pixel range from 655,360 to 8,294,400; it also notes that routing and quota behavior can depend on how the provider is configured [17].

Which API should you use?

Choose GPT Image 2 if you need:

  • Precise text, labels, UI copy, menus, posters, or technical terminology [3][6].
  • Layout-heavy commercial assets such as ads, packaging, product mockups, and structured brand graphics [10].
  • OpenAI API integration with documented model access, tiered rate limits, and token pricing [13][14].
  • Batch-friendly economics for asynchronous high-volume image jobs [14][15].

Choose Nano Banana Pro if you need:

  • Photoreal portraits, lifestyle imagery, UGC-style creative, skin texture, or lighting-heavy ad concepts [6].
  • A Gemini/Nano Banana workflow with documented API parameters such as aspect ratio and 2K resolution [26].
  • CJK typography polish or dramatic lighting, provided your own Nano Banana Pro test confirms the adjacent Nano Banana 2 signal from Genspark [3].
  • A pricing model where Google’s documented 1024×1024 estimate of 1,290 output tokens, or $0.039 per image, fits your budgeting workflow [25].

Run a private benchmark before committing

The public comparisons are useful for shortlisting, not for final vendor selection. A stronger production test should use your own prompts, references, brand rules, and failure criteria. Vidguru’s benchmark describes a useful baseline approach: first-take generations with identical prompts and identical references where relevant, scored on prompt adherence, commercial usability, text accuracy, physical logic, and reference fidelity [10].

For a practical internal benchmark, include 30–50 prompts across your real categories: product shots, ads, UI screens, diagrams, multilingual text, reference-image edits, brand layouts, and policy-sensitive edge cases. Score each output on text accuracy, prompt adherence, layout and spatial logic, reference fidelity, photorealism or style match, editability across follow-up prompts, artifact rate, latency, refusal rate, and cost per accepted image.

The current evidence is enough to choose a starting point: GPT Image 2 for text and structure, Nano Banana Pro for photoreal lighting and Gemini workflows. It is not enough to replace a workload-specific benchmark.

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

Search & fact-check with Studio Global AI

Key takeaways

  • No public source here proves a universal winner: GPT Image 2 is the safer default for exact text and structured commercial layouts, while Nano Banana Pro has the stronger direct signal for photoreal lighting and skin...
  • Pricing is closer than model vs model hot takes suggest: OpenAI lists gpt image 2 image output at $30 per 1M tokens, and Google lists Gemini image output at $30 per 1M tokens with 1024×1024 images at 1,290 tokens, or...
  • For production, run your own prompt set and compare text accuracy, layout adherence, reference fidelity, refusals, latency, retries, and cost per accepted image.

Supporting visuals

Nano Banana Pro silkscreen risograph gig poster with convertible and cactus silhouettes, fluorescent red and deep navy ink overlap, hand numbered edition text
Nano Banana Pro silkscreen risograph gig poster with convertible and cactus silhouettes, fluorescent red and deep navy ink overlap, hand numNano Banana Pro silkscreen risograph gig poster with convertible and cactus silhouettes, fluorescent red and deep navy ink overlap, hand numbered edition text
Nano Banana Pro hyperreal editorial portrait of woman in her early sixties with skin texture pores fine lines and subsurface scattering visible in lips
Nano Banana Pro hyperreal editorial portrait of woman in her early sixties with skin texture pores fine lines and subsurface scattering visiNano Banana Pro hyperreal editorial portrait of woman in her early sixties with skin texture pores fine lines and subsurface scattering visible in lips

People also ask

What is the short answer to "GPT Image 2 vs. Nano Banana Pro: Which image API should you use?"?

No public source here proves a universal winner: GPT Image 2 is the safer default for exact text and structured commercial layouts, while Nano Banana Pro has the stronger direct signal for photoreal lighting and skin...

What are the key points to validate first?

No public source here proves a universal winner: GPT Image 2 is the safer default for exact text and structured commercial layouts, while Nano Banana Pro has the stronger direct signal for photoreal lighting and skin... Pricing is closer than model vs model hot takes suggest: OpenAI lists gpt image 2 image output at $30 per 1M tokens, and Google lists Gemini image output at $30 per 1M tokens with 1024×1024 images at 1,290 tokens, or...

What should I do next in practice?

For production, run your own prompt set and compare text accuracy, layout adherence, reference fidelity, refusals, latency, retries, and cost per accepted image.

Which related topic should I explore next?

Continue with "MRSA Management in Nursing Homes: Evidence for a Team-Based Approach" for another angle and extra citations.

Open related page

What should I compare this against?

Cross-check this answer against "Should You Retake FRACDS (GDP) Before Orthodontics?".

Open related page

Continue your research

Research conversation

You

Research question

Search for GPT Image 2 vs. Nano Banana Pro benchmarks and compare them as comprehensively as possible.

Studio Global AI36 sources

Cited answer

GPT Image 2 and Nano Banana Pro appear very close overall, with GPT Image 2 showing the stronger pattern for exact text, structured layouts, and prompt compliance, while Nano Banana Pro/Gemini 3 Pro Image tends to be stronger for speed, 2K/4K workflows, CJK typography polish, lighting, and Google ecosystem integration. There is no single independent, standardized benchmark suite I found, so the comparison below is based on official API/pricing docs plus several small hands-on prompt tests; treat winner claims as directional, not definitive.

Evidence quality caveat

  • Official docs are strongest for model availability, pricing, and API capabilities: OpenAI lists gpt-image-2-2026-04-21 and pricing for gpt-image-2 in its API docs [1][2].
  • Google’s Gemini API docs cover Nano Banana image generation and pricing, including image output priced at $30 per 1,000,000 tokens and 1024×1024 output equivalent to $0.039 per image [4][5].
  • The benchmark evidence is weaker: most public comparisons are blog-style, small-N prompt tests rather than reproducible third-party benchmarks, and some are vendor/SEO sites [6][13][14].
  • Insufficient evidence for a statistically reliable “overall winner.”

Head-to-head summary

DimensionGPT Image 2Nano Banana Pro / Gemini 3 Pro ImagePractical read
Text renderingUsually favored in hands-on comparisons for precise text and technical terminology [14]Often strong, with some reports favoring it for CJK typography polish [14]Pick GPT Image 2 for English-heavy posters, labels, UI mockups, diagrams.
Prompt adherencePublic tests report strong compliance; one 10-prompt test said GPT Image 2 rendered all 10 prompts [13]Same test said Nano Banana Pro rendered 9/10 and refused one prominent-person CV prompt [13]GPT may be less refusal-prone in some commercial prompt sets, but policy behavior varies.
Technical diagramsComparisons describe GPT Image 2 as strong for spatial logic and technical/diagram-like outputs [15]Some tests describe Nano Banana as producing rigorous annotated diagrams [9]Close; test your exact diagram style.
Photorealism/product shotsHands-on comparisons describe both as very close for product shots, mockups, infographics, and anatomy diagrams [14]Same: very close, with no universal winner in many commercial categories [14]Either can work; choose by ecosystem and cost.
Lighting / cinematic styleCompetitive, but less consistently singled outHands-on comparison gives Nano Banana 2 a narrow edge for dramatic lighting [14]Nano Banana Pro may be preferable for cinematic/editorial imagery.
Resolution / productionThird-party access page for GPT Image 2 lists custom dimensions with max single edge 3840 px and total pixels up to 8,294,400 [10]Google’s image docs show Nano Banana generation options including aspect ratio and 2K resolution [5]Both are production-capable; Nano Banana is more visibly marketed around 2K/4K workflows.
SpeedNot consistently established from official docs in the search resultsSome non-official comparisons claim very fast 3–5 second generation for Nano Banana 2 [13]Insufficient evidence from official sources; benchmark in your stack.
API pricingOpenAI docs list gpt-image-2 image input at $8.00 per 1M tokens, cached image input at $2.00, image output at $30.00, text input at $5.00, cached text input at $1.25, and text output at $10.00 [2]Google docs list image output at $30 per 1M tokens, with 1024×1024 images using 1,290 tokens, equivalent to $0.039 per image [4]Similar output-token headline; actual per-image cost depends on resolution, inputs, routing, and batch use.
EcosystemBest if your workflow is already OpenAI/ChatGPT/API-centered [1][2]Best if your workflow is Gemini, Google AI Studio/API, Search/Workspace/Vertex-style tooling [4][5]Ecosystem fit may matter more than small quality deltas.

Benchmark findings from public comparisons

  • A 10-prompt hands-on test reported GPT Image 2 completed 10/10 prompts, while Nano Banana Pro completed 9/10 and refused a prompt involving Elon Musk’s CV due to prominent-person policy concerns [13].
  • A hands-on comparison found GPT Image 2 had a narrow edge on precise text and technical terminology, while Nano Banana 2 had a narrow edge on CJK typography polish and dramatic lighting [14].
  • The same comparison said the two models were close on photorealistic product shots, e-commerce mockups, marketing infographics, and anatomy diagrams [14].
  • Another comparison source characterizes GPT Image 2 as stronger for spatial relationships and Nano Banana 2 as Google DeepMind’s image model combining generation/editing strengths, but it is a software-comparison listing rather than a rigorous benchmark [15].
  • Some SEO-style comparison pages make very specific claims such as GPT Image 2 “99.2% text accuracy,” but I would not treat those as reliable without methodology, dataset, and reproducible examples [11].

Where GPT Image 2 looks better

  • Use GPT Image 2 when your image depends on exact English text, labels, UI copy, branded layouts, technical captions, or instruction-heavy prompts, because public hands-on comparisons give it an edge on precise text and technical terminology [14].
  • Use GPT Image 2 for workflows where OpenAI API integration and token pricing are already understood, since OpenAI’s official docs list the model and pricing directly [1][2].
  • Use GPT Image 2 if policy refusal sensitivity is a concern for benign commercial tests, since one small benchmark reported GPT Image 2 completed all 10 prompts while Nano Banana Pro refused one [13].

Where Nano Banana Pro looks better

  • Use Nano Banana Pro when you need Google/Gemini-native workflows, because Google’s official docs cover Nano Banana image generation through the Gemini API [5].
  • Use Nano Banana Pro for 2K-oriented generation workflows, since Google’s docs show Nano Banana image generation parameters including aspect ratio and 2K resolution [5].
  • Use Nano Banana Pro when CJK typography polish, cinematic lighting, or dramatic style matter most, because one hands-on comparison found a narrow edge for Nano Banana in those areas [14].
  • Use Nano Banana Pro when simple per-image pricing around 1024×1024 is attractive, because Google documents 1024×1024 output as 1,290 tokens, equivalent to $0.039 per image [4].

Pricing comparison

  • OpenAI’s official pricing for gpt-image-2 is token-based: image input $8.00 per 1M tokens, cached image input $2.00, image output $30.00, text input $5.00, cached text input $1.25, and text output $10.00 [2].
  • Google’s official Gemini pricing says image output is $30 per 1M tokens, and an output image up to 1024×1024 consumes 1,290 tokens, equivalent to $0.039 per image [4].
  • The headline output price is similar at $30 per 1M image-output tokens, but real job cost can diverge because prompts, reference images, resolution, editing steps, retries, and provider routing all affect total cost [2][4][5].

Recommendation

  • Choose GPT Image 2 if your top priorities are text accuracy, structured designs, instruction following, technical diagrams, brand layouts, and OpenAI integration.
  • Choose Nano Banana Pro if your top priorities are Gemini integration, fast creative iteration, 2K/4K-style production workflows, cinematic lighting, and multilingual/CJK visual text.
  • For production, run a private benchmark with 30–50 prompts from your actual workload and score: text accuracy, prompt adherence, editability, artifact rate, latency, refusal rate, and cost per accepted image. Public benchmark evidence is too limited to replace that.

Sources

  • [3] GPT Image 2 vs Nano Banana 2: Hands-On 2026 Comparisongenspark.ai

    If you only read one paragraph: GPT Image 2 has a narrow, real edge on precise text and technical terminology. Nano Banana 2 has a narrow, real edge on CJK typography polish and dramatic lighting. On photorealistic product shots, e-commerce mockups, marketi...

  • [5] GPT Image 2 vs. Nano Banana 2: The Ultimate 2026 AI ... - GlobalGPTglbgpt.com

    logo GPT Image 2 vs. Nano Banana 2: The Ultimate 2026 AI Image Comparison Guide avatar GPT Image 2 vs. Nano Banana 2: The Ultimate 2026 AI Image Comparison Guide GPT Image 2 leads in spatial logic and 99.2% text accuracy, while Nano Banana 2 excels in 4K pr...

  • [6] GPT Image 2.0 vs Nano Banana Pro: 10 Prompts Tested 2026 | AVBaivideobootcamp.com

    TL;DR: We ran the same 10 prompts through GPT Image 2.0 (gpt-image-2) and Nano Banana Pro (gemini-3-pro-image) on April 22, 2026. GPT 2.0 rendered 10 of 10. Nano Banana Pro rendered 9 of 10 and refused the Elon Musk CV prompt with the message "This prompt m...

  • [8] GPT-Image-2 vs Nano Banana Pro: Which is stronger? 7 ...help.apiyi.com

    Skip to content Apiyi.com Blog Apiyi.com Blog Best AI API Router Services Apiyi.com Blog Apiyi.com Blog Best AI API Router Services Image Generation API Model Selection & Comparison GPT-Image-2 vs Nano Banana Pro: Which is stronger? 7-dimensional deep showd...

  • [9] Is GPT Image 2 the Best Image Generation Model? - Analytics Vidhyaanalyticsvidhya.com

    Image 14: Annotated Diagrams Observation: Task 5 was the closest contest of the comparison. Nano Banana 2 produced a technically rigorous two-view engineering diagram with bold annotation lines, precise measurement callouts, and a detailed Wing Warp schemat...

  • [10] Nano Banana 2 vs GPT-Image 2: Our 10-Test Blind Benchmark After OpenAI's API Launch | Vidguruvidguru.ai

    About This Test This benchmark was conducted by Vidguru AI Lab on April 23, 2026 using the Vidguru web platform. All generations were first-take only, with identical prompts and identical references where relevant. Scores focused on prompt adherence, commer...

  • [13] GPT Image 2 Model | OpenAI APIdevelopers.openai.com

    gpt-image-2-2026-04-21 Rate limits Rate limits ensure fair and reliable access to the API by placing specific caps on requests or tokens used within a given time period. Your usage tier determines how high these limits are set and automatically increases as...

  • [14] Pricing | OpenAI APIdevelopers.openai.com

    Model Modality Input Cached input Output --- --- gpt-image-2 Image $8.00 $2.00 $30.00 Text $5.00 $1.25 gpt-image-1.5 Image $8.00 $2.00 $32.00 Text $5.00 $1.25 $10.00 gpt-image-1-mini Image $2.50 $0.25 $8.00 Text $2.00 $0.20 All models Batch For image genera...

  • [15] API Pricing - OpenAIopenai.com

    Price $10.00 / 1k calls Search content tokens are free. Containers Run code and tools in secure, scalable environments alongside your models. Price Now: 1 GB for $0.03 / 64GB for $1.92 per container Starting March 31, 2026: 1 GB for $0.03 / 64GB for $1.92 p...

  • [17] GPT Image 2 API | Text to Image - Fal.aifal.ai

    // Use the returned URL in your request []( Custom image dimensions must be multiples of 16 on both edges Maximum single edge is 3840px; maximum aspect ratio is 3:1 Total pixel count must be between 655,360 and 8,294,400 When running client-side code, never...

  • [21] Introducing gpt-image-2 - available today in the API and Codexcommunity.openai.com

    Modality Input Cached Input Output --- --- Image $8.00 $2.00 $30.00 Text $5.00 $1.25 $10.00 Full details and rate limits are available on the model page. Use gpt-image-2 in the API for production image generation workflows, or in Codex when you want to crea...

  • [25] Gemini Developer API pricingai.google.dev

    [] Image output is priced at $30 per 1,000,000 tokens. Output images up to 1024x1024px consume 1290 tokens and are equivalent to $0.039 per image. Gemini 2.0 Flash-Lite gemini-2.0-flash-lite Warning: Gemini 2.0 Flash-Lite is deprecated and will be shut down...

  • [26] Nano Banana image generation - Google AI for Developersai.google.dev

    import { GoogleGenAI } from "@google/genai"; import as fs from "node:fs"; async function main() { const ai = new GoogleGenAI({}); const prompt = 'An office group photo of these people, they are making funny faces.'; const aspectRatio = '5:4'; const resoluti...