Tagged articles
7 articles
Page 1 of 1
Xiaomi Tech
Xiaomi Tech
May 29, 2026 · Artificial Intelligence

ControlFoley: An Open‑Source Model for Fully Controllable Video Sound Generation

ControlFoley, released by Xiaomi's large‑model team, is an open‑source framework that lets creators generate video‑aligned sound effects while explicitly controlling content, style, and timing through text prompts, video dubbing, or reference audio, achieving SOTA performance on multiple benchmarks.

ControlFoleyMultimodalOpen Source
0 likes · 15 min read
ControlFoley: An Open‑Source Model for Fully Controllable Video Sound Generation
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Mar 22, 2026 · Artificial Intelligence

DigMA: Controllable Generation of Financial Market Data – A Deep Dive

This article reviews the DigMA model, which uses a diffusion‑guided meta‑agent to generate high‑fidelity, controllable order‑flow data for financial markets, details its problem formulation, architecture, training on Chinese stock datasets, extensive experiments—including reinforcement‑learning‑based high‑frequency trading evaluation—and demonstrates its superior accuracy and ultra‑low latency generation.

Financial Market SimulationMeta‑Agentcontrollable generation
0 likes · 16 min read
DigMA: Controllable Generation of Financial Market Data – A Deep Dive
AIWalker
AIWalker
Aug 19, 2025 · Artificial Intelligence

DynamicFace: Controllable High‑Quality Face Swapping for Images and Video

DynamicFace introduces a diffusion‑based framework that explicitly decouples identity, pose, expression, illumination and background using composable 3D facial priors, achieving superior identity preservation, motion consistency and visual fidelity in both image and video face‑swapping tasks.

3D facial priorscontrollable generationdiffusion models
0 likes · 13 min read
DynamicFace: Controllable High‑Quality Face Swapping for Images and Video
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Aug 18, 2025 · Artificial Intelligence

DynamicFace: Composable 3D Facial Priors for High‑Quality, Consistent Face Swaps

DynamicFace introduces a controllable face‑swapping framework that leverages composable 3D facial priors, dual‑stream identity injection, and a FusionTVO module to achieve superior image and video quality, identity preservation, and temporal consistency, outperforming existing state‑of‑the‑art methods on benchmark datasets.

3D facial priorsAIcontrollable generation
0 likes · 13 min read
DynamicFace: Composable 3D Facial Priors for High‑Quality, Consistent Face Swaps
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Sep 2, 2024 · Artificial Intelligence

How AIGC Transforms Advertising Material Creation on Xiaohongshu

This article analyzes how large‑model AIGC reshapes the production, evaluation, and deployment of advertising creatives on Xiaohongshu, detailing the business motivations, technical pipeline, controllable generation, reward‑model filtering, and experimental results that balance commercial efficiency with community tone.

AIGCAdvertisingLarge Language Model
0 likes · 14 min read
How AIGC Transforms Advertising Material Creation on Xiaohongshu
360 Tech Engineering
360 Tech Engineering
Apr 17, 2024 · Artificial Intelligence

HiCo: A Hierarchical Controllable Diffusion Model for Layout‑to‑Image Generation

The 360 AI Research Institute introduces HiCo, a hierarchical controllable diffusion model that enables fine‑grained layout control across up to eight image regions, integrates seamlessly with existing Stable Diffusion ecosystems, and demonstrates superior performance on the GRIT‑VAL benchmark for layout‑aware image synthesis.

AI drawingHiCocontrollable generation
0 likes · 8 min read
HiCo: A Hierarchical Controllable Diffusion Model for Layout‑to‑Image Generation