Skip to content
Agentifact
ToolsBlueprintsBugsTrending
Submit a Tool+
  1. Tools
  2. /Image Generation
RelatedBlueprintsBugsReplacements

Category

Image Generation

49 toolsAvg score 64

Text-to-image APIs, diffusion model endpoints, and creative AI platforms for agents that generate or manipulate visual content.

Filters

We only list tools that meet minimum quality standards.

49 tools

Sort:
Together AI Image Generation logo

Together AI Image Generation

FULL AUTO
82
Trust score

Together AI is an AI cloud platform offering 200+ models for text, code, and image generation via a unified API. For images, Together hosts FLUX.1 Schnell (with 3 months free access), FLUX.1 Pro, and Ideogram 3.0, priced per megapixel with step-based adjustments. The platform emphasizes developer experience with fast inference, serverless deployment, and a single API key covering all modalities. AI agent builders who want a one-stop API for both LLM and image generation tasks will find Together AI's unified approach particularly convenient.

AGENT
85
TRUST
92
INTEROP
72
SECURE
75
DOCS
85
Verified Mar 2026REST
View details →
Fireworks AI logo

Fireworks AI

FULL AUTO
81
Trust score

Fireworks AI is a fast inference platform offering serverless access to open-source and fine-tuned models, including image generation via SDXL, FLUX, and custom checkpoints. The platform delivers up to 4x faster inference than alternatives using NVIDIA Blackwell GPUs and supports batch processing at a 40% discount over real-time endpoints. Pricing is usage-based with no monthly minimums. Developers building AI agents that require high-throughput image generation — especially alongside language model calls — benefit from Fireworks' multi-modal coverage under a single account.

AGENT
92
TRUST
82
INTEROP
85
SECURE
82
DOCS
65
Verified Mar 2026MCPREST
View details →
RunwayML API logo

RunwayML API

FULL AUTO
80
Trust score

Runway is a leading AI video and image generation company offering a developer API for Gen-4 image generation ($0.08 per image), Gen-4 Turbo video (5 credits/second), and a full suite of video editing and generation models. Credits are purchased at $0.01 each in the developer portal. The Runway API is purpose-built for professional media production, offering camera control, multi-shot consistency, and high-resolution output. For AI agent developers building automated video production systems, content marketing pipelines, or film-quality creative tools, Runway provides the most cinematically capable API available.

AGENT
85
TRUST
85
INTEROP
70
SECURE
75
DOCS
85
rate limits per usage tier
Verified Mar 2026REST
View details →
OpenAI GPT-4o Image Generation API logo

OpenAI GPT-4o Image Generation API

FULL AUTO
80
Trust score

OpenAI's image generation API, now powered by the gpt-image-1 model (formerly DALL-E 3), enables developers to generate and edit high-quality images from text prompts via a simple REST API. The model excels at photorealistic scenes, creative illustrations, and accurate text rendering within images. Pricing starts at $0.04–$0.19 per image depending on quality and size. For AI agent builders, it provides a reliable, OpenAI-native image generation capability that integrates seamlessly alongside GPT-4o in multi-modal pipelines.

AGENT
95
TRUST
85
INTEROP
70
SECURE
65
DOCS
85
Verified Mar 2026REST
View details →
Leonardo.AI API logo

Leonardo.AI API

FULL AUTO
79
Trust score

Leonardo.AI offers a comprehensive image and video generation API designed for developers building creative applications, game studios, and content platforms. The API supports text-to-image, image-to-image, LoRA fine-tuning, upscaling, and their proprietary Alchemy v4 pipeline. Every new API account starts with $5 in free credits, with pay-as-you-go pricing thereafter. Leonardo's Phoenix model architecture and built-in support for training custom styles make it particularly useful for AI agents that need stylistically consistent or branded visual output.

AGENT
75
TRUST
85
INTEROP
85
SECURE
65
DOCS
85
Verified Mar 2026MCPREST
View details →
Ideogram API logo

Ideogram API

NEEDS APPROVAL
79
Trust score

Ideogram is a text-to-image API renowned for its industry-leading typography rendering — it can accurately place readable text inside generated images, something most other models struggle with. The API serves millions of images daily and is popular for marketing materials, posters, social media graphics, and branded design assets. Ideogram 3.0 introduced character reference features for consistent facial and character traits across generations. Developers building AI agents for design automation, ad creative, or marketing content will find Ideogram's text accuracy and layout control uniquely valuable.

AGENT
85
TRUST
85
INTEROP
70
SECURE
72
DOCS
85
image links expire and must be downloaded to persistAPI key generation may take time and window must not be refreshed during process
Verified Mar 2026REST
View details →
Remove.bg API logo

Remove.bg API

FULL AUTO
77
Trust score

Remove.bg is a dedicated background removal API that uses AI to detect and extract subjects from any image with high accuracy. The API offers a free tier of 50 calls per month, with paid plans scaling from €3 to €89 per month, plus enterprise pricing for bulk volumes over 100,000 images per year. The integration is as simple as a single POST request, and the tool supports bulk processing for pipelines. For AI agent builders automating product photography, avatar generation, or composite image creation, Remove.bg is the fastest and most reliable background removal solution.

AGENT
85
TRUST
85
INTEROP
65
SECURE
65
DOCS
85
rate limit exceeded returns HTTP 429 with headers for limit/remaining/resetlimited to images with foreground like people/products/animals/cars
Verified Mar 2026REST
View details →
Google Imagen API (Vertex AI) logo

Google Imagen API (Vertex AI)

FULL AUTO
77
Trust score

Google's Imagen API, accessible via Vertex AI, provides text-to-image generation, image editing, and upscaling powered by Google DeepMind's Imagen models (currently Imagen 3 and Imagen 4). The API integrates into Google Cloud infrastructure and supports safety filters and watermarking via SynthID. Pricing is approximately $0.02–$0.04 per standard image. Developers using Google Cloud for their AI agent infrastructure will benefit from native integration, consistent access controls, and Google's enterprise-grade SLAs for high-volume production use.

AGENT
85
TRUST
85
INTEROP
70
SECURE
75
DOCS
72
Verified Mar 2026REST
View details →
Black Forest Labs Flux API logo

Black Forest Labs Flux API

FULL AUTO
77
Trust score

Black Forest Labs is the creator of the FLUX family of image generation models, including FLUX.1 Pro, FLUX.1 Dev, FLUX.1 Schnell, and the latest FLUX.2 series. FLUX models are widely regarded as producing the best open-weight image quality available, with strong prompt adherence and photorealism. API access is available directly via the BFL platform as well as through inference providers like fal.ai, Replicate, Together AI, and Cloudflare. Commercially licensed variants (flux-1.1-pro) make it suitable for production use in AI agent applications.

AGENT
82
TRUST
85
INTEROP
60
SECURE
75
DOCS
85
rate limit under burst load above 10-50 RPM
Verified Mar 2026REST
View details →
Recraft AI API logo

Recraft AI API

FULL AUTO
76
Trust score

Recraft AI provides a professional image and vector generation API that natively outputs true SVG vector graphics from text prompts — not raster-to-vector traces, but clean scalable paths directly from the model. Recraft V4 (released February 2026) is available in standard (1MP) and Pro (4MP) variants, and the API supports inpainting, outpainting, background removal, and over 100 style presets for visual consistency. Trusted by teams at Netflix, Microsoft, and HubSpot, Recraft is the go-to API for AI agents generating brand assets, icons, illustrations, or any design output that requires scalable vector format.

AGENT
75
TRUST
75
INTEROP
75
SECURE
82
DOCS
72
Verified Mar 2026MCPREST
View details →
Adobe Firefly API logo

Adobe Firefly API

FULL AUTO
76
Trust score

Adobe Firefly Services provides over 30 generative AI and creative APIs built on Adobe's commercially safe AI models, which were trained exclusively on licensed Adobe Stock images and public domain content. The API includes text-to-image, generative fill, generative expand, background removal, and Firefly Custom Models for brand-consistent generation. Enterprise plans include IP indemnification for generated content. For AI agents deployed in enterprise marketing or content workflows, Firefly is the gold standard for legal safety and Adobe Creative Cloud integration.

AGENT
65
TRUST
85
INTEROP
75
SECURE
82
DOCS
75
Verified Mar 2026REST
View details →
Stability AI API logo

Stability AI API

FULL AUTO
75
Trust score

Stability AI's developer platform provides API access to their state-of-the-art Stable Diffusion models, including Stable Image Ultra, Stable Diffusion 3.5 Large, and Stable Image Core. The credit-based pricing system starts at $0.01 per credit, with image generation costing 3.5–8 credits depending on model and resolution. As the leading open-weights model family in the industry, Stability AI's API is a go-to for developers building image generation into agents, pipelines, or products that require fine-grained control over generation parameters.

AGENT
75
TRUST
85
INTEROP
65
SECURE
75
DOCS
75
timeout-after-specified-secondserror-on-invalid-api-key
Verified Mar 2026REST
View details →
Getimg.ai API logo

Getimg.ai API

FULL AUTO
74
Trust score

Getimg.ai is a comprehensive image generation API hub that provides access to Stable Diffusion variants, SDXL, ControlNet, and other open-source models through a single, well-documented REST endpoint. API pricing is calculated per 1 million pixel-steps, ranging from $0.0006 to $0.015 depending on model complexity, making it one of the more cost-efficient options for high-volume generation. The platform also supports real-time generation and image editing endpoints. For AI agent developers who want fine-grained control over Stable Diffusion parameters without managing their own GPU infrastructure, Getimg.ai is a solid choice.

AGENT
85
TRUST
82
INTEROP
65
SECURE
65
DOCS
75
Verified Mar 2026REST
View details →
Luma AI Dream Machine API logo

Luma AI Dream Machine API

FULL AUTO
73
Trust score

Luma AI's Dream Machine API provides state-of-the-art image and video generation in a single integrated workflow, built by former Google researchers. The API supports text-to-image, text-to-video, image-to-video, video extension, and camera motion control via natural language. Video generation is priced at $0.32 per million pixels generated. With over 25 million users and models like Ray2 that produce coherent motion and ultra-realistic detail, Luma is a compelling choice for AI agents that need to generate cinematic images or short video clips as part of creative content pipelines.

AGENT
85
TRUST
82
INTEROP
70
SECURE
65
DOCS
65
Verified Mar 2026REST
View details →
Krea AI logo

Krea AI

73
Trust score

Krea AI is a real-time image generation platform that updates visuals in under 50ms as users draw or modify prompts, using Latent Consistency Models for near-zero-latency creative feedback. The platform hosts over 150 models and serves a community of over 10 million users. Krea provides an SDK for developers who want to embed real-time generation into their own applications, such as custom design tools or interactive creative experiences. For AI agent builders designing interactive generative workflows — where instant visual feedback is required — Krea's real-time architecture sets it apart from batch-style inference APIs.

AGENT
85
TRUST
75
INTEROP
80
SECURE
85
DOCS
40
credit exhaustion limits usagefree tier highly restrictive
Verified Mar 2026MCPREST
View details →
Diffusers (Hugging Face) logo

Diffusers (Hugging Face)

FULL AUTO
73
Trust score

Hugging Face Diffusers is the de facto Python library for state-of-the-art pretrained diffusion models, providing inference pipelines, interchangeable noise schedulers, and model components as modular building blocks for image, video, and audio generation. The library abstracts over the full Stable Diffusion family, FLUX, SDXL, and dozens of specialized models, with integrations for ControlNet, LoRA adapters, IP-Adapter, and inpainting workflows in just a few lines of code. Diffusers is open-source under the Apache 2.0 license and is free to use. For AI agent developers, Diffusers is the foundational Python SDK for building programmatic, composable generation pipelines rather than using GUI tools.

AGENT
72
TRUST
82
INTEROP
60
SECURE
65
DOCS
85
Verified Mar 2026
View details →
Adobe Firefly logo

Adobe Firefly

NEEDS APPROVAL
73
Trust score

Adobe Firefly is Adobe's commercially safe generative AI platform for image, video, audio, and vector graphic creation, with all training data properly licensed to minimize legal risk for commercial users. As of early 2026 Firefly offers unlimited image and video generations on select plans and integrates third-party models including GPT Image Generation and Runway Gen-4. Plans range from a free tier to Firefly Premium at $199.99/month. Firefly is natively embedded in Photoshop, Illustrator, and other Creative Cloud apps, making it essential for AI builders integrating into professional creative workflows.

AGENT
75
TRUST
85
INTEROP
60
SECURE
82
DOCS
65
throttling applied after monthly credit depletionreduced speeds on standard features when credits exhausted
Verified Mar 2026REST
View details →
Photoroom API logo

Photoroom API

FULL AUTO
72
Trust score

Photoroom's API provides AI-powered product photography tools including background removal, AI background generation, object isolation, and image compositing — all designed for e-commerce scale. The Studio model generates photorealistic product scene backgrounds, and the API is a standard REST interface compatible with any DAM or product catalog system. Brands using Photoroom's AI backgrounds have reported over 50% higher sell-through rates. For AI agents handling product listing automation, dropshipping workflows, or catalog management, Photoroom provides production-grade image transformation at scale.

AGENT
75
TRUST
85
INTEROP
45
SECURE
70
DOCS
85
free trial revocation on suspected abuse
Verified Mar 2026REST
View details →
Civitai logo

Civitai

71
Trust score

Civitai is the largest open-source AI model sharing platform and community for Stable Diffusion and FLUX, hosting over 50,000 community-uploaded models including checkpoints, LoRAs, embeddings, hypernetworks, and textual inversions. The platform also provides a web-based image generation interface for running any hosted model in-browser without local hardware. A distinctive feature is that preview images include embedded prompt metadata, making it straightforward to reproduce any community-shared output. Civitai is free to use with optional paid features, and serves as the primary distribution channel for fine-tuned models in the open-source AI image ecosystem — essential for developers sourcing specialized models for agent pipelines.

AGENT
65
TRUST
65
INTEROP
85
SECURE
60
DOCS
78
inconsistent moderationarbitrary account bans+1 more
Verified Mar 2026MCPREST
View details →
Topaz Labs AI Enhancement API logo

Topaz Labs AI Enhancement API

FULL AUTO
70
Trust score

Topaz Labs offers a professional-grade AI image and video enhancement API trusted by Google, NASA, Nike, and Tesla, serving over 3 million users. The API provides upscaling, face recovery, noise reduction, sharpening, and image colorization via both synchronous and asynchronous endpoints with Autopilot mode that automatically selects the best enhancement settings. Two model classes are available: Standard (fast and fidelity-preserving) and Generative (highest quality, creative output). For AI agent pipelines that need batch enhancement of generated or raw images for production use, Topaz Labs is the industry standard.

AGENT
85
TRUST
65
INTEROP
65
SECURE
65
DOCS
72
Verified Mar 2026REST
View details →
Segmind API logo

Segmind API

FULL AUTO
70
Trust score

Segmind is a developer-first platform for automating image and video generation workflows, best known for its Flux fine-tuning API that allows training custom image models on as few as 10 reference images. The API supports text-to-image, image-to-image, and model fine-tuning with webhook notifications for async job tracking. Fine-tuned models can be used to generate brand characters, product visuals, and game assets with consistent style. For AI agent developers building personalized creative tools or branded content pipelines, Segmind's fine-tuning infrastructure is a standout capability.

AGENT
72
TRUST
65
INTEROP
65
SECURE
65
DOCS
85
rate limit under burst load above 60 RPM on Flexible plansession timeout not explicitly documented
Verified Mar 2026REST
View details →
Upscayl logo

Upscayl

69
Trust score

Upscayl is the leading free and open-source AI image upscaler for Windows, macOS, and Linux, built on Real-ESRGAN and Vulkan GPU acceleration for cross-platform hardware compatibility. The desktop application can enhance images up to 16x their original resolution by using AI models to intelligently reconstruct fine detail, and supports batch processing for multiple images simultaneously. All processing runs entirely locally with no cloud uploads required, ensuring full data privacy. Upscayl is available under the AGPLv3 license and is essential for AI agent pipelines that generate images and need a local, cost-free upscaling step before delivery.

AGENT
85
TRUST
75
INTEROP
40
SECURE
60
DOCS
85
Verified Mar 2026
View details →
Kling AI API logo

Kling AI API

FULL AUTO
69
Trust score

Kling AI, developed by Kuaishou (one of the world's largest video platforms), offers an official developer API for text-to-video, image-to-video, lip sync, motion brush, and AI effects generation. Kling 3.0 (launched February 2026) uses a unified multimodal framework to generate synchronized video and audio in a single pass, supporting up to native 4K resolution at 60 FPS and 15-second clips. The official API is accessible at the Kling AI developer center. For AI agents building video content automation, social media generation, or cinematic production pipelines, Kling is among the highest-quality video generation APIs available.

AGENT
85
TRUST
85
INTEROP
65
SECURE
65
DOCS
45
concurrent-session-limit-of-590-day-credit-expiry
Verified Mar 2026REST
View details →
ComfyUI logo

ComfyUI

68
Trust score

ComfyUI is the most powerful and modular open-source diffusion model GUI and backend, using a visual node-based flowchart interface to design and execute Stable Diffusion pipelines without writing code. Its graph architecture only re-executes nodes that change between runs, enabling efficient iterative workflows, and smart memory management allows running large models on GPUs with as little as 1GB VRAM. ComfyUI supports all major model formats and extensions including ControlNet, IP-Adapter, LoRA, FLUX, and custom node packages from a large community ecosystem. For developers building agent workflows around image generation, ComfyUI's API backend mode provides programmatic workflow execution over HTTP, making it a powerful self-hosted generation server.

AGENT
72
TRUST
65
INTEROP
75
SECURE
65
DOCS
65
Verified Mar 2026REST
View details →
Page 1 of 3

Explore by category

MCP ServersHITL ProvidersA2A AgentsFrameworks57Workflow TemplatesProtocols29
Agentifact

The trust index for the agent economy. Every tool scored on agent-readiness, trust, interoperability, security, and documentation quality.

Explore
  • Tools
  • Blueprints
  • Bugs
  • Builders
  • Trending
  • Replacements
Reference
  • Skills
  • Integrations
  • Lexicon
  • Sources
  • Guides
Community
  • Voices
  • Benchmarks
  • Stack Layers
Company
  • About
  • Methodology
  • Submit a Tool
  • Contact
  • Disclosure
  • Privacy
  • Terms
Quick filtersNew This WeekFree Tools
© 2026 Agentifact. Independent editorial. Scores verified against live infrastructure.
PrivacyTermsSitemap