Tagged articles
2 articles
Page 1 of 1
AntTech
AntTech
May 26, 2026 · Artificial Intelligence

Enabling Robots to “Think While Acting”: LingBot-VA Paper Accepted at RSS 2026

Researchers from AntLingbo and Hong Kong University present LingBot-VA, a causal world modeling framework for robot control that predicts future environment changes and generates actions, achieving up to 98.5% success on benchmarks and over 20‑point gains with only 50 real demonstrations, now open‑sourced after acceptance at RSS 2026.

LingBot-VAOpen SourceRSS 2026
0 likes · 5 min read
Enabling Robots to “Think While Acting”: LingBot-VA Paper Accepted at RSS 2026
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
May 9, 2026 · Artificial Intelligence

Heuristic Learning: Reinforcement Without Parameter Updates via .py File

OpenAI researcher Yong Jiayi introduces Heuristic Learning, a reinforcement paradigm that replaces gradient‑based neural network updates with code‑editing driven by GPT‑5.4, achieving the theoretical 864‑point Atari Breakout score and matching or surpassing PPO on multiple Atari and robot tasks.

Atari BenchmarkGPT-5.4continual learning
0 likes · 8 min read
Heuristic Learning: Reinforcement Without Parameter Updates via .py File