Flux better than Stable Diffusion
FLUX by Black Forest Labs provides three models for text-to-image and image-to-image transformations, requires local installation, supports interactive image generation, and integrates with Hugging Face Diffusers library.
Read original articleThe GitHub repository for FLUX, developed by Black Forest Labs, provides minimal inference code for text-to-image and image-to-image transformations using Flux latent rectified flow transformers. It offers three models: `FLUX.1 [pro]`, a base model accessible via API; `FLUX.1 [dev]`, a guidance-distilled variant; and `FLUX.1 [schnell]`, a guidance and step-distilled variant. Installation can be done locally by cloning the repository and setting up a Python virtual environment. Users can interactively sample or generate single images by executing specific Python commands. Additionally, the models are integrated with the Hugging Face Diffusers library, facilitating their use in various applications. Access to the pro model is available through an API, which requires user registration and an API key. For further details, users can refer to the FLUX GitHub repository.
- FLUX offers three distinct models for different use cases.
- Installation requires cloning the repository and setting up a Python virtual environment.
- Users can generate images through interactive sampling or single sample commands.
- The models are compatible with the Hugging Face Diffusers library.
- API access for the pro model necessitates registration and an API key.
Related
AuraFlow v0.1: a open source alternative to Stable Diffusion 3
AuraFlow v0.1 is an open-source large rectified flow model for text-to-image generation. Developed to boost transparency and collaboration in AI, it optimizes training efficiency and achieves notable advancements.
Black Forest Labs – FLUX.1 open weights SOTA text to image model
Black Forest Labs has launched to develop generative deep learning models for media, securing $31 million in funding. Their FLUX.1 suite includes three model variants, outperforming competitors in image synthesis.
The open weight Flux text to image model is next level
Black Forest Labs has launched Flux, the largest open-source text-to-image model with 12 billion parameters, available in three versions. It features enhanced image quality and speed, alongside the release of AuraSR V2.
Forget Midjourney – Flux is the new king of AI image generation
Flux, an open-source AI image generator by Black Forest Labs, competes with Midjourney and Stable Diffusion, offering three versions and a developing text-to-video model for enhanced media production.
Show HN: Flux AI Image Generator Webapp
Flux AI Image Generator by Black Forest Labs converts text to high-quality images using a 12-billion parameter model. It offers three versions for diverse applications and emphasizes user-friendly features.
iirc, a lot of the devs who left Stable Diffusion went on to found/join Black Forest Labs.
FWIW I haven’t tried it for that reason alone. Curious to hear from people plugged into leaderboard competitions — how does this rank objectively? I feel like image models are super hard to evaluate though, to be fair. All I can find is instructions, but no centralized results
Eg https://huggingface.co/docs/diffusers/en/conceptual/evaluati...
Related
AuraFlow v0.1: a open source alternative to Stable Diffusion 3
AuraFlow v0.1 is an open-source large rectified flow model for text-to-image generation. Developed to boost transparency and collaboration in AI, it optimizes training efficiency and achieves notable advancements.
Black Forest Labs – FLUX.1 open weights SOTA text to image model
Black Forest Labs has launched to develop generative deep learning models for media, securing $31 million in funding. Their FLUX.1 suite includes three model variants, outperforming competitors in image synthesis.
The open weight Flux text to image model is next level
Black Forest Labs has launched Flux, the largest open-source text-to-image model with 12 billion parameters, available in three versions. It features enhanced image quality and speed, alongside the release of AuraSR V2.
Forget Midjourney – Flux is the new king of AI image generation
Flux, an open-source AI image generator by Black Forest Labs, competes with Midjourney and Stable Diffusion, offering three versions and a developing text-to-video model for enhanced media production.
Show HN: Flux AI Image Generator Webapp
Flux AI Image Generator by Black Forest Labs converts text to high-quality images using a 12-billion parameter model. It offers three versions for diverse applications and emphasizes user-friendly features.