Tagged articles

13 articles

Page 1 of 1

May 28, 2026 · Artificial Intelligence

Can a Pre‑trained Embodied Model Work Out‑of‑the‑Box? New Chinese Open‑Source VLA Model Shows Yes

The newly open‑sourced Wall‑OSS‑0.5 VLA model demonstrates that a large‑scale pre‑trained embodied robot brain can achieve strong zero‑shot performance on 17 real‑world tasks, exhibit staircase emergence with longer pre‑training, and far surpass the industry baseline after fine‑tuning, while also revealing current precision limits.

Embodied AIVLAbenchmark

0 likes · 15 min read

Can a Pre‑trained Embodied Model Work Out‑of‑the‑Box? New Chinese Open‑Source VLA Model Shows Yes

Machine Heart

May 14, 2026 · Artificial Intelligence

Introducing TTFA: Hong Kong University’s Open‑Source FASTER Gives VLA Models Instant Reaction

The paper identifies real‑time latency as the main obstacle for deploying VLA models on robots, proposes the TTFA metric and the FASTER framework with a Horizon‑Aware Schedule, mixed scheduling and streaming inference, and demonstrates through extensive GPU and task experiments that TTFA and reaction time can be cut by up to three‑fold without sacrificing motion quality.

Embodied AIFASTERReal-time Inference

0 likes · 14 min read

Introducing TTFA: Hong Kong University’s Open‑Source FASTER Gives VLA Models Instant Reaction

PaperAgent

May 9, 2026 · Artificial Intelligence

How ActDistill Slashes Deployment Costs of VLA Large Models

ActDistill, proposed by Tongji University and collaborators, reduces the inference latency, compute consumption, and action-loop speed of Vision‑Language‑Action (VLA) models by selectively distilling action‑relevant knowledge, achieving up to 1.67× speedup while preserving control quality on real robot hardware.

ActDistillDynamic RoutingEfficiency

0 likes · 13 min read

How ActDistill Slashes Deployment Costs of VLA Large Models

Machine Heart

Apr 30, 2026 · Artificial Intelligence

How LWD Redefines Embodied AI Training with Fleet‑Scale Reinforcement Learning

LWD (Learning While Deploying) introduces a distributed multi‑robot reinforcement‑learning framework that continuously improves VLA policies during real‑world deployment, leveraging DIVL, QAM, dynamic n‑step TD and an asynchronous actor‑learner architecture to achieve over 90% success on five‑minute tasks and outperform traditional behavior‑cloning, HG‑Dagger and RECAP baselines.

Embodied AILWDVLA

0 likes · 13 min read

How LWD Redefines Embodied AI Training with Fleet‑Scale Reinforcement Learning

Alibaba Cloud Big Data AI Platform

Apr 22, 2026 · Artificial Intelligence

How to Build an End‑to‑End Hand‑Video to VLA Data Pipeline on Alibaba Cloud PAI with Data‑Juicer

This article details a step‑by‑step, distributed pipeline built on Alibaba Cloud PAI using Data‑Juicer and Ray that transforms raw egocentric hand videos into LeRobot v2.0‑compatible Vision‑Language‑Action (VLA) training data, covering video splitting, frame extraction, camera calibration, 3D hand reconstruction, pose estimation, action captioning, and export, with code snippets, performance numbers, and references.

Data-JuicerDistributed computingEmbodied AI

0 likes · 29 min read

How to Build an End‑to‑End Hand‑Video to VLA Data Pipeline on Alibaba Cloud PAI with Data‑Juicer

Machine Heart

Apr 22, 2026 · Artificial Intelligence

China’s AlphaBrain Platform Launches First Full‑Stack Open‑Source Brain‑Like VLA

The AlphaBrain Platform, an open‑source embodied‑intelligence suite from China’s AI² Robotics, combines a world‑model stack, the pioneering NeuroVLA brain‑like model with spiking‑neuron actions, low‑cost RL‑Token training, and cross‑architecture continuous learning, all validated on leading robotics benchmarks.

AlphaBrainEmbodied IntelligenceNeuroVLA

0 likes · 11 min read

China’s AlphaBrain Platform Launches First Full‑Stack Open‑Source Brain‑Like VLA

Machine Heart

Apr 17, 2026 · Artificial Intelligence

Can π0.7 Unlock Compositional Generalization and Cross‑Embodiment Transfer for VLA?

The new π0.7 model from Physical Intelligence demonstrates emergent compositional generalization and cross‑embodiment transfer in visual‑language‑action (VLA) robots by leveraging massive heterogeneous data and richly structured prompts, outperforming specialist Recap models on tasks such as air‑fryer cooking, clothing folding, and coffee making.

VLAcompositional generalizationcross-embodiment transfer

0 likes · 11 min read

Can π0.7 Unlock Compositional Generalization and Cross‑Embodiment Transfer for VLA?

PaperAgent

Apr 13, 2026 · Artificial Intelligence

How Keyframe‑Chaining VLA Gives Robots Long‑Term Memory and Faster Reasoning

The article introduces the Keyframe‑Chaining VLA (KC‑VLA) framework, which replaces dense video sampling with semantic keyframe linking to provide robots with global temporal awareness, presents a new long‑term memory benchmark, and demonstrates superior performance in both simulation and real‑world robotic experiments.

AIKeyframe ChainingLong-term Memory

0 likes · 9 min read

How Keyframe‑Chaining VLA Gives Robots Long‑Term Memory and Faster Reasoning

AI Explorer

Mar 17, 2026 · Artificial Intelligence

RISE Enables Breakthrough in Vision‑Language‑Action Learning for Embodied AI

The article examines the limitations of vision‑language‑action (VLA) models in real‑world tasks, explains how the RISE technique from Hong Kong University uses internal simulation, reflection and imagination to cut training costs by an order of magnitude, and discusses its implications for future embodied AI.

Embodied AIRISEVLA

0 likes · 6 min read

RISE Enables Breakthrough in Vision‑Language‑Action Learning for Embodied AI

Baidu Geek Talk

Feb 2, 2026 · Artificial Intelligence

How Cloud AI Infra Powers the Next Wave of Embodied Intelligence

This article outlines the rapid rise of embodied intelligence, the explosion of Vision‑Language‑Action (VLA) research, and how cloud‑based AI infrastructure—including multi‑level IaaS, data pipelines, dual‑system model designs, and reinforcement‑learning workflows—addresses emerging scaling and deployment challenges.

VLAmultimodal modelsreinforcement learning

0 likes · 13 min read

How Cloud AI Infra Powers the Next Wave of Embodied Intelligence

DataFunSummit

Jan 17, 2026 · Artificial Intelligence

How UnrealZoo Accelerates Embodied AI Research with High‑Fidelity Simulation

This article outlines the evolution from traditional AI to embodied intelligence, explains the Vision‑Language‑Action (VLA) paradigm, highlights data‑collection bottlenecks, introduces the UnrealZoo simulation platform built on Unreal Engine, and showcases real‑world case studies and future challenges for embodied AI research.

Embodied AISimulationUnreal Engine

0 likes · 16 min read

How UnrealZoo Accelerates Embodied AI Research with High‑Fidelity Simulation

PaperAgent

Jan 8, 2026 · Artificial Intelligence

How SOP Enables Scalable Online Post-Training for Real‑World Robots

The SOP (Scalable Online Post‑training) framework redesigns VLA post‑training from offline, single‑machine, sequential processing to a distributed, parallel online system, allowing robot fleets to continuously learn, share experiences, and scale intelligence while maintaining stability and generalization in complex real‑world environments.

Online LearningSOPVLA

0 likes · 11 min read

How SOP Enables Scalable Online Post-Training for Real‑World Robots

AI Frontier Lectures

May 31, 2025 · Artificial Intelligence

Why Embodied Intelligence Is Exploding and What It Means for the Future

The article analyzes the recent surge in embodied intelligence, examines why physical agents matter despite advances in large language models, outlines common failure modes, discusses key research decisions such as 2D versus 3D perception and tactile sensing, and explores the roles of imitation learning, VLA, and reinforcement learning in shaping the field.

VLAVisionimitation learning

0 likes · 24 min read

Why Embodied Intelligence Is Exploding and What It Means for the Future