Tagged articles
5 articles
Page 1 of 1
AI Large-Model Wave and Transformation Guide
AI Large-Model Wave and Transformation Guide
May 27, 2026 · Artificial Intelligence

Balancing Information Value and Platform Survival in Underwater UUV C2 Decision Making

The article presents a comprehensive C2 decision framework for underwater UUVs, defining core variables, rule‑based and game‑theoretic models, POMDP and Monte‑Carlo solutions, risk‑aware algorithms, multi‑UUV consensus, and practical three‑layer rule implementations to balance information gain against platform survivability.

Autonomous SystemsC2Decision Theory
0 likes · 14 min read
Balancing Information Value and Platform Survival in Underwater UUV C2 Decision Making
Data Party THU
Data Party THU
Sep 15, 2025 · Artificial Intelligence

Agentic RL: Transforming LLMs into Autonomous Decision‑Making Agents

This survey formalizes the shift from preference‑based reinforcement fine‑tuning to Agentic Reinforcement Learning, defines Agentic RL via MDP/POMDP abstractions, proposes a dual taxonomy of capabilities and task domains, compiles over 500 recent works, and outlines open challenges for scalable, robust AI agents.

AI AgentsLLMPOMDP
0 likes · 12 min read
Agentic RL: Transforming LLMs into Autonomous Decision‑Making Agents
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Sep 14, 2025 · Artificial Intelligence

How MM‑DREX Uses Multimodal LLMs for Dynamic Expert Routing in Financial Trading

The article reviews the MM‑DREX framework, which tackles the non‑stationarity of financial markets by modeling trading as a POMDP, employing a vision‑language model‑driven dynamic router to allocate four heterogeneous experts, and demonstrates superior returns, Sharpe ratios, and drawdown control across stocks, futures, and crypto compared with 15 strong baselines.

Dynamic RoutingLLMPOMDP
0 likes · 13 min read
How MM‑DREX Uses Multimodal LLMs for Dynamic Expert Routing in Financial Trading
Code DAO
Code DAO
Apr 28, 2022 · Artificial Intelligence

Model-Based Reinforcement Learning from Raw Video: A Detailed Walkthrough

The article explains how to train robots to learn tasks directly from raw video using model-based reinforcement learning, covering POMDP formulation, CNN auto‑encoders, latent‑space representations, iLQR optimization, and a step‑by‑step pipeline with concrete examples and references.

CNN autoencoderPOMDPiLQR
0 likes · 11 min read
Model-Based Reinforcement Learning from Raw Video: A Detailed Walkthrough
DataFunTalk
DataFunTalk
Feb 27, 2020 · Artificial Intelligence

Technical Challenges in Planning and Control for Autonomous Heavy Trucks

The article reviews the complex system model of autonomous heavy trucks, outlines traditional and modern planning and control methods—including rule‑based FSM, POMDP, learning‑based and optimization techniques—highlights safety, efficiency, fuel‑economy, and dynamic modeling challenges specific to heavy‑truck and trailer configurations, and shares practical attempts such as lane‑changing, merging, and trailer‑aware trajectory planning.

Autonomous DrivingPOMDPPlanning
0 likes · 13 min read
Technical Challenges in Planning and Control for Autonomous Heavy Trucks