Tagged articles
11 articles
Page 1 of 1
Xiaomi Tech
Xiaomi Tech
May 26, 2026 · Artificial Intelligence

Xiaomi Auto Unveils Integrated Reconstruction‑Generation World Model Framework Achieving SOTA on Major Benchmarks

Xiaomi Auto introduces a novel world‑model framework that tightly couples 3D reconstruction and generative prediction, delivering state‑of‑the‑art performance on Waymo and nuScenes benchmarks while enabling high‑fidelity, long‑duration video synthesis for autonomous‑driving scenarios.

3D ReconstructionAutonomous DrivingBenchmark SOTA
0 likes · 10 min read
Xiaomi Auto Unveils Integrated Reconstruction‑Generation World Model Framework Achieving SOTA on Major Benchmarks
Machine Heart
Machine Heart
May 10, 2026 · Artificial Intelligence

The First Industry Survey of Vision World Models: Toward a Higher‑Intelligence Visual Paradigm

This survey introduces vision world models as a central driver for AI to learn physical and causal dynamics directly from visual data, presents a unified "representation‑learning‑simulation" framework, categorises four major technical routes, outlines evaluation metrics and datasets, and proposes a 3R roadmap for the next generation of world models.

Future DirectionsGenerative ModelingPhysical Reasoning
0 likes · 15 min read
The First Industry Survey of Vision World Models: Toward a Higher‑Intelligence Visual Paradigm
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Apr 2, 2026 · Artificial Intelligence

Diffolio: A Diffusion‑Model Framework for Risk‑Aware Portfolio Optimization

Diffolio introduces a diffusion‑model‑based approach that directly learns a pseudo‑optimal portfolio distribution conditioned on user risk preferences, generating diverse high‑quality portfolios and outperforming classic and recent baselines on six real‑world market datasets, with annualized returns improving up to 12.1 percentage points.

Financial AIGenerative ModelingQuantitative Finance
0 likes · 22 min read
Diffolio: A Diffusion‑Model Framework for Risk‑Aware Portfolio Optimization
Alimama Tech
Alimama Tech
Mar 26, 2026 · Industry Insights

How Alibaba’s Large User Model (LUM) Boosted CTR by 4.5% and Scaled to Billions of Parameters

The article analyzes the evolution from traditional modular recommendation models to a generative Large User Model (LUM), detailing its three‑stage paradigm, tokenization, training objectives, scaling‑law findings, offline and online experiments, and the AI‑infra innovations that enabled a 4.5% CTR lift in production.

CTR predictionGenerative ModelingRecommendation Systems
0 likes · 18 min read
How Alibaba’s Large User Model (LUM) Boosted CTR by 4.5% and Scaled to Billions of Parameters
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 15, 2026 · Artificial Intelligence

A 17‑Year‑Old High‑Schooler Becomes First‑Author on a CVPR Paper

A 17‑year‑old high‑school student from Anhui Ansheng School led the first‑author CVPR 2026 paper "CraftMesh," a novel 3D mesh editing framework that combines image editing, mesh generation, and Poisson seamless fusion, achieving superior quantitative metrics and showcasing the growing impact of young researchers in top AI conferences.

3D mesh generationCVPRCraftMesh
0 likes · 7 min read
A 17‑Year‑Old High‑Schooler Becomes First‑Author on a CVPR Paper
AIWalker
AIWalker
Mar 4, 2026 · Artificial Intelligence

Drifting Models Enable One‑Step Generation, Shattering Speed Records

The paper introduces Drifting Models, a new generative paradigm that moves the distribution evolution to the training phase, achieving true one‑step (1‑NFE) generation with state‑of‑the‑art ImageNet FID scores of 1.54 in latent space and 1.61 in pixel space, while eliminating the need for distillation or classifier‑free guidance.

Drifting ModelsGenerative ModelingImageNet
0 likes · 24 min read
Drifting Models Enable One‑Step Generation, Shattering Speed Records
Data Party THU
Data Party THU
Oct 31, 2025 · Artificial Intelligence

Can AI Generate High‑Fidelity Spectra? Inside MIT’s SpectroGen Breakthrough

MIT’s SpectroGen model uses physics‑informed generative AI to convert a single spectral modality into high‑fidelity cross‑modal spectra, achieving up to 99% correlation with experimental data, dramatically reducing the cost and time of material spectroscopy while preserving detailed spectral features.

AIGenerative ModelingMaterials Science
0 likes · 9 min read
Can AI Generate High‑Fidelity Spectra? Inside MIT’s SpectroGen Breakthrough
AI Frontier Lectures
AI Frontier Lectures
Jul 8, 2025 · Artificial Intelligence

How LaVin-DiT Unifies Vision Tasks with a Large Diffusion Transformer

The LaVin-DiT paper presents a large vision diffusion transformer that integrates a spatio‑temporal variational auto‑encoder, a joint diffusion transformer with full‑sequence joint attention, and 3D rotary position encoding to enable unified, efficient multi‑task generation for images and videos, and details its training via flow‑matching and experimental results.

3D RoPEGenerative ModelingJoint Diffusion Transformer
0 likes · 12 min read
How LaVin-DiT Unifies Vision Tasks with a Large Diffusion Transformer
Kuaishou Tech
Kuaishou Tech
Jun 20, 2025 · Artificial Intelligence

How OneRec Redefines Recommendation with End‑to‑End Generative Modeling and RL Alignment

The OneRec system from Kuaishou replaces traditional cascade recommendation pipelines with an encoder‑decoder architecture, leverages reward‑based preference alignment via reinforcement learning, achieves ten‑fold FLOPs gains, cuts operational costs by 90%, and delivers significant user‑engagement improvements across short‑video and local‑service scenarios.

Generative ModelingKuaishouOneRec
0 likes · 17 min read
How OneRec Redefines Recommendation with End‑to‑End Generative Modeling and RL Alignment
AntTech
AntTech
Jun 4, 2025 · Artificial Intelligence

LLaDA and LLaDA‑V: Large Language Diffusion Models and Their Multimodal Extensions

This article presents the LLaDA series of diffusion‑based large language models, explains how their generative‑modeling principle yields language intelligence comparable to autoregressive models, and details the multimodal LLaDA‑V architecture, training methods, experimental results, and broader implications for AI research.

Generative Modelingdiffusion modelsinstruction following
0 likes · 10 min read
LLaDA and LLaDA‑V: Large Language Diffusion Models and Their Multimodal Extensions
DaTaobao Tech
DaTaobao Tech
Apr 7, 2025 · Artificial Intelligence

Flow Matching for Generative Modeling

Flow Matching reformulates generative modeling by learning a time‑dependent vector field that deterministically transports Gaussian noise to data, using a neural network trained with an analytically derived L2 loss, yielding simpler training, faster convergence, and deterministic sampling that matches or exceeds diffusion model quality.

AIGenerative Modelingcontinuous normalizing flow
0 likes · 13 min read
Flow Matching for Generative Modeling