Tagged articles

7 articles

Page 1 of 1

May 29, 2026 · Artificial Intelligence

ControlFoley: An Open‑Source Model for Fully Controllable Video Sound Generation

ControlFoley, released by Xiaomi's large‑model team, is an open‑source framework that lets creators generate video‑aligned sound effects while explicitly controlling content, style, and timing through text prompts, video dubbing, or reference audio, achieving SOTA performance on multiple benchmarks.

ControlFoleyMultimodalOpen Source

0 likes · 15 min read

ControlFoley: An Open‑Source Model for Fully Controllable Video Sound Generation

Bighead's Algorithm Notes

Mar 22, 2026 · Artificial Intelligence

DigMA: Controllable Generation of Financial Market Data – A Deep Dive

This article reviews the DigMA model, which uses a diffusion‑guided meta‑agent to generate high‑fidelity, controllable order‑flow data for financial markets, details its problem formulation, architecture, training on Chinese stock datasets, extensive experiments—including reinforcement‑learning‑based high‑frequency trading evaluation—and demonstrates its superior accuracy and ultra‑low latency generation.

Financial Market SimulationMeta‑Agentcontrollable generation

0 likes · 16 min read

DigMA: Controllable Generation of Financial Market Data – A Deep Dive

AIWalker

Aug 19, 2025 · Artificial Intelligence

DynamicFace: Controllable High‑Quality Face Swapping for Images and Video

DynamicFace introduces a diffusion‑based framework that explicitly decouples identity, pose, expression, illumination and background using composable 3D facial priors, achieving superior identity preservation, motion consistency and visual fidelity in both image and video face‑swapping tasks.

3D facial priorscontrollable generationdiffusion models

0 likes · 13 min read

DynamicFace: Controllable High‑Quality Face Swapping for Images and Video

Xiaohongshu Tech REDtech

Aug 18, 2025 · Artificial Intelligence

DynamicFace: Composable 3D Facial Priors for High‑Quality, Consistent Face Swaps

DynamicFace introduces a controllable face‑swapping framework that leverages composable 3D facial priors, dual‑stream identity injection, and a FusionTVO module to achieve superior image and video quality, identity preservation, and temporal consistency, outperforming existing state‑of‑the‑art methods on benchmark datasets.

3D facial priorsAIcontrollable generation

0 likes · 13 min read

DynamicFace: Composable 3D Facial Priors for High‑Quality, Consistent Face Swaps

Kuaishou Tech

May 26, 2025 · Artificial Intelligence

CineMaster: A 3D‑Aware and Controllable Framework for Cinematic Text‑to‑Video Generation

Researchers introduce CineMaster, a SIGGRAPH‑2025 paper presenting a 3D‑aware, controllable text‑to‑video generation framework that lets users define target objects and camera motions via an interactive workflow, enabling cinematic video creation with high‑quality, user‑directed results.

3D-awareAI VideoCineMaster

0 likes · 6 min read

CineMaster: A 3D‑Aware and Controllable Framework for Cinematic Text‑to‑Video Generation

Xiaohongshu Tech REDtech

Sep 2, 2024 · Artificial Intelligence

How AIGC Transforms Advertising Material Creation on Xiaohongshu

This article analyzes how large‑model AIGC reshapes the production, evaluation, and deployment of advertising creatives on Xiaohongshu, detailing the business motivations, technical pipeline, controllable generation, reward‑model filtering, and experimental results that balance commercial efficiency with community tone.

AIGCAdvertisingLarge Language Model

0 likes · 14 min read

How AIGC Transforms Advertising Material Creation on Xiaohongshu

360 Tech Engineering

Apr 17, 2024 · Artificial Intelligence

HiCo: A Hierarchical Controllable Diffusion Model for Layout‑to‑Image Generation

The 360 AI Research Institute introduces HiCo, a hierarchical controllable diffusion model that enables fine‑grained layout control across up to eight image regions, integrates seamlessly with existing Stable Diffusion ecosystems, and demonstrates superior performance on the GRIT‑VAL benchmark for layout‑aware image synthesis.

AI drawingHiCocontrollable generation

0 likes · 8 min read

HiCo: A Hierarchical Controllable Diffusion Model for Layout‑to‑Image Generation