Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.bfl.ml/llms.txt

Use this file to discover all available pages before exploring further.

Black Forest FLUX.2 spans the full spectrum of image generation—from sub-second inference with [klein] to highest quality with [max]. Generate photorealistic images with precise control over colors, poses, and composition, or edit existing images by referencing up to 10 sources simultaneously. Choose [klein] for real-time, high-volume generation, [pro] for production at scale, [flex] for fine-grained control, or [max] for maximum quality and grounding search.
Want to try first? Test FLUX.2 [max], [pro], and [flex] in our playground. [klein] is available via our API and on Hugging Face.

What Can You Do?

Combine elements from multiple images while maintaining identity across complex scenes. Create ad variants with consistent faces, product mockups in any context, or fashion editorials where models stay consistent.

Which Model to Choose?

[klein][max][pro][flex][dev]
Best forReal-time, high-volumeHighest quality, final assetsProduction at scaleQuality with controlLocal development
Multi-referenceUp to 4Up to 8 (API), 10 (playground)Up to 8 (API), 10 (playground)Up to 8 (API), 10 (playground)Recommended max 6
ControlsStandardStandardStandardAdjustable steps & guidanceFull customization
Grounding searchNoYesNoNoNo
Pricingfrom $0.014 / imagefrom $0.07 / MPfrom $0.03 / MP$0.06 / MPFree (non-commercial)
FLUX.2 [klein] delivers sub-second inference with open weights. 4B runs on consumer GPUs (~13GB VRAM). Apache 2.0 for 4B, FLUX NCL for 9B. See model details below.
FLUX.2 [max] includes grounding search: when prompted, it performs web searches to access real-time information to visualize trending products, current events, or the latest styles without manually sourcing reference material.

Compare FLUX.2 Models

At a Glance

[klein]

Sub-second inference. Our fastest models with open weights. Runs on consumer GPUs (~13GB VRAM). From $0.014/image via API, or run locally with Apache 2.0 (4B) / FLUX NCL (9B).

[max]

Maximum performance. Highest editing consistency across tasks. Vast world knowledge. Strongest prompt following and faithful style representation.

[pro]

Top performance at affordable price. The high quality, production-grade image editing and generation model.

[flex]

Specialized for typography. Best for text rendering and preserving small details.

FLUX.2 [klein] Models

FLUX.2 [klein] diverse output examples
Open weights available: [klein] 4B is fully open under Apache 2.0. [klein] 9B is available under the FLUX Non-Commercial License. Download from Hugging Face.

API Models

[klein] 4B[klein] 9B
Best forHigh volume, local deploymentBalanced quality and speed
Inference steps4 (step-distilled)4 (step-distilled)
SpeedSub-secondSub-second
API Pricing0.014+0.014 + 0.001/MP0.015+0.015 + 0.002/MP
LicenseApache 2.0FLUX Non-Commercial License

Open Weights (Community)

The Base variants are undistilled foundation models with full training signal—ideal for fine-tuning, LoRA training, research, and custom pipelines. Higher output diversity than distilled models.
[klein] Base 4B[klein] Base 9B
Best forFine-tuning, research, custom pipelinesMaximum quality, research
Output diversityHighHighest
Step-distilledNo (full capacity)No (full capacity)
LicenseApache 2.0FLUX Non-Commercial License
AvailabilityHugging FaceHugging Face
Base models are available as open weights for local development and research. They are not offered on the public API.
FLUX.2 [klein] does not include prompt upsampling. Write detailed, descriptive prompts for best results. See our Prompting Guide for techniques.

Preview Endpoints

Preview endpoints are where our latest improvements land first. They reflect our most recent advances in quality and speed.
EndpointDescription
flux-2-pro-previewOur latest FLUX.2 [pro] model.
flux-2-proA fixed snapshot of FLUX.2 [pro]. This endpoint will not change, making it suitable for workflows that require reproducibility.
flux-2-klein-9b-previewOur latest FLUX.2 [klein] 9B model with KV caching for improved performance.
flux-2-klein-9bA fixed snapshot of FLUX.2 [klein] 9B. Choose this when you need reproducibility.
Which endpoint should I use? For most use cases, the preview endpoints (flux-2-pro-preview, flux-2-klein-9b-preview) give you the best results. Choose the non-preview endpoints when you need a pinned model — for example, if your workflow depends on consistent outputs across runs or you have compliance requirements around model stability.
The flux-2-pro and flux-2-klein-9b endpoints are unchanged. If you are already using them, no action is required.
Both preview and non-preview endpoints share the same API contract — the request and response format is identical. Only the underlying model weights differ.

Getting Started

Try in Playground

Test FLUX.2 [max], [pro], and [flex] in your browser. No setup required.

Download [klein] Weights

Get [klein] weights from Hugging Face for local inference.

Text-to-Image API

Generate images from text prompts.

Image Editing API

Edit images with multi-reference support.

Prompting Guide

Master prompting techniques — basics, style, JSON, editing, and use cases.

Local Development

Download [dev] weights for local inference.