Tag

AI image generation

1 views collected around this technical thread.

Code Mala Tang
Code Mala Tang
Jun 4, 2025 · Artificial Intelligence

Flux Kontext: How Open‑Weight AI Image Editing Beats GPT‑Image‑1

Flux Kontext, Black Forest Labs' new open‑weight AI image editing suite, enables fast, low‑cost contextual generation and editing with features such as role consistency, local edits, style transfer, and superior benchmark performance compared to GPT‑Image‑1, Imagen 4, and other leading models.

AI image generationFlux Kontextbenchmark performance
0 likes · 12 min read
Flux Kontext: How Open‑Weight AI Image Editing Beats GPT‑Image‑1
Architecture and Beyond
Architecture and Beyond
Nov 16, 2024 · Artificial Intelligence

ComfyUI Architecture Overview: Initialization, Node System, Execution Flow, Cache Mechanism and Usage Limits

This article provides a comprehensive technical overview of ComfyUI, an open‑source, node‑based Stable Diffusion UI, detailing its modular initialization steps, node system design, execution pipeline, hierarchical cache strategies, resource management, error handling, API interfaces, and practical usage limits.

AI image generationCache systemComfyUI
0 likes · 25 min read
ComfyUI Architecture Overview: Initialization, Node System, Execution Flow, Cache Mechanism and Usage Limits
Architecture and Beyond
Architecture and Beyond
Nov 2, 2024 · Artificial Intelligence

Step-by-Step Guide to Training a LoRA Model with Flux1_dev on ComfyUI

This tutorial walks programmers through preparing a GPU cloud environment, installing ComfyUI, downloading Flux1_dev models, integrating a custom LoRA, labeling generated images, and finally training the LoRA using ai‑toolkit, providing detailed commands, configuration tips, and practical cost estimates.

AI image generationComfyUIFlux
0 likes · 12 min read
Step-by-Step Guide to Training a LoRA Model with Flux1_dev on ComfyUI
Alimama Tech
Alimama Tech
Aug 16, 2024 · Artificial Intelligence

SPLAM: Sub‑Path Linear Approximation for Accelerating Diffusion Model Sampling

SPLAM (Sub‑Path Linear Approximation Model) accelerates diffusion‑model image synthesis by linearly approximating short sub‑paths of the probability‑flow ODE, allowing high‑quality generation in as few as four steps, outperforming prior fast‑sampling methods on COCO benchmarks and being deployed in Alibaba Mama’s recommendation system.

AI image generationSPLAMdiffusion models
0 likes · 11 min read
SPLAM: Sub‑Path Linear Approximation for Accelerating Diffusion Model Sampling
Ximalaya Technology Team
Ximalaya Technology Team
Feb 1, 2024 · Artificial Intelligence

Understanding AI Image Generation: Diffusion Models, CLIP, and Control Techniques

This guide explains how AI image generators such as Stable Diffusion and DALL·E 3 turn text prompts into pictures by using diffusion models, CLIP‑aligned embeddings, and optional controls like negative prompts, fine‑tuned LoRA checkpoints and ControlNet conditioning, highlighting their differences, workflow, and practical customization.

AI image generationClipControlNet
0 likes · 18 min read
Understanding AI Image Generation: Diffusion Models, CLIP, and Control Techniques
DaTaobao Tech
DaTaobao Tech
Jan 26, 2024 · Artificial Intelligence

Reference Object Guided AI Image Generation: Advances, Methods, and Home Furnishing Applications

The article surveys recent advances in reference‑object‑guided AI image generation, detailing diffusion‑based models such as Dreambooth and Blip‑diffusion, evaluating their trade‑offs, and demonstrating how combining these techniques with 3D reconstruction can realistically insert catalog furniture into users’ rooms, despite viewpoint and depth challenges.

AI image generationAIGCBlip-diffusion
0 likes · 9 min read
Reference Object Guided AI Image Generation: Advances, Methods, and Home Furnishing Applications
Baidu Geek Talk
Baidu Geek Talk
Nov 7, 2023 · Artificial Intelligence

Interview on AI Image Generation (Text-to-Image) Technology and Baidu Search Applications

In a recent InfoQ Geek Talk, Baidu Search chief architect Tianbao discussed the rapid evolution of AI text‑to‑image technology—highlighting Chinese‑language data preparation, prompt‑engineering challenges, evaluation methods combining human feedback and metrics, and future video‑generation prospects—while announcing openings for visual algorithm engineers.

AI image generationAIGCBaidu
0 likes · 24 min read
Interview on AI Image Generation (Text-to-Image) Technology and Baidu Search Applications
DaTaobao Tech
DaTaobao Tech
Oct 13, 2023 · Artificial Intelligence

Understanding Stable Diffusion: Core Principles and Technical Architecture

The article demystifies Stable Diffusion by explaining its low‑cost latent‑space design and conditioning mechanisms, comparing it to autoregressive, VAE, flow‑based and GAN models, detailing the iterative noise‑to‑image process, token‑based text‑to‑image control, version differences, common generation issues, and providing implementation code examples.

AI image generationCross-AttentionStable Diffusion
0 likes · 15 min read
Understanding Stable Diffusion: Core Principles and Technical Architecture
DaTaobao Tech
DaTaobao Tech
Aug 11, 2023 · Artificial Intelligence

Practical Guide to Stable Diffusion WebUI: Prompt Engineering, LoRA, VAE, and ControlNet

This practical guide walks users through installing Stable Diffusion WebUI, explains the differences between base, LoRA, VAE, and ControlNet models, shows how to derive prompts with CLIP or DeepBooru, and provides detailed text‑to‑image and image‑to‑image examples for effective prompt engineering.

AI image generationControlNetLoRA
0 likes · 12 min read
Practical Guide to Stable Diffusion WebUI: Prompt Engineering, LoRA, VAE, and ControlNet
DaTaobao Tech
DaTaobao Tech
Jun 16, 2023 · Artificial Intelligence

Introduction to Stable Diffusion: Concepts, Prompts, and Advanced Techniques

The article introduces Stable Diffusion, explains key terms and parameters, guides model checkpoint merging and fine‑tuning with embeddings, LoRA, and hypernetworks, details ControlNet pose control, sampling choices, prompt engineering techniques—including weighting and negative prompts—and explores advanced uses such as inpainting, Pix2Pix, custom training, highlighting personal and commercial applications and the technology’s growing impact across industries.

AI image generationControlNetStable Diffusion
0 likes · 18 min read
Introduction to Stable Diffusion: Concepts, Prompts, and Advanced Techniques
Top Architect
Top Architect
May 8, 2023 · Artificial Intelligence

Understanding Stable Diffusion: Architecture, Training, and Practical Applications

This article provides a comprehensive overview of Stable Diffusion, covering its latent diffusion architecture, training data and procedures, model components such as autoencoder, CLIP text encoder and UNet, as well as practical usage examples including text‑to‑image generation, image‑to‑image, inpainting, and advanced extensions like ControlNet and SD‑2.x.

AI image generationStable Diffusiondiffusion models
0 likes · 52 min read
Understanding Stable Diffusion: Architecture, Training, and Practical Applications
Selected Java Interview Questions
Selected Java Interview Questions
Apr 24, 2023 · Artificial Intelligence

Step-by-Step Guide to Using MidJourney AI Image Generation on Discord

This guide explains how to set up a Discord account, create or join a MidJourney server, meet the prerequisites, and use the /imagine command along with various upscale and variation options to generate AI images, including tips on image size and aspect ratio.

AI image generationArtificial IntelligenceDiscord
0 likes · 5 min read
Step-by-Step Guide to Using MidJourney AI Image Generation on Discord
Tencent Cloud Developer
Tencent Cloud Developer
Apr 10, 2023 · Artificial Intelligence

How Computers Generate Realistic Images: An In‑Depth Guide to AI Image Generation, Diffusion Models, ControlNet, LoRA and More

This guide explains how AI creates photorealistic images, tracing the shift from VAEs and GANs to diffusion models, detailing latent diffusion, ControlNet conditioning, CLIP text‑image alignment, and lightweight fine‑tuning methods like DreamBooth and LoRA, plus practical tips for higher‑resolution results.

AI image generationControlNetLoRA
0 likes · 22 min read
How Computers Generate Realistic Images: An In‑Depth Guide to AI Image Generation, Diffusion Models, ControlNet, LoRA and More
Tencent Cloud Developer
Tencent Cloud Developer
Apr 4, 2023 · Artificial Intelligence

Step-by-Step Guide to Building Your Own Realistic AI Image Generation Website with Stable Diffusion

This step‑by‑step tutorial shows how to set up a Stable Diffusion web UI, install the required Python environment and GPU‑enabled PyTorch, add Chinese localization and optional LoRA or Deforum extensions, generate realistic human images, create animated videos, and add speech with D‑ID, all ready for deployment on your own AI website.

AI image generationDeforumPython
0 likes · 9 min read
Step-by-Step Guide to Building Your Own Realistic AI Image Generation Website with Stable Diffusion
Tencent Cloud Developer
Tencent Cloud Developer
Nov 14, 2022 · Artificial Intelligence

Building an AI‑Powered Image Generation Mini‑Program with Go Backend and Tencent Cloud

The article walks through building a WeChat mini‑program that turns user‑typed text into cartoon‑style images by using Go to query Sogou’s picture search API, passing the first result to Tencent Cloud’s FaceCartoonPic service, and exposing the workflow through a simple HTTP endpoint.

AI image generationGo backendSogou API
0 likes · 15 min read
Building an AI‑Powered Image Generation Mini‑Program with Go Backend and Tencent Cloud