Tagged articles
9 articles
Page 1 of 1
Data Party THU
Data Party THU
May 31, 2026 · Artificial Intelligence

Why AI Agents Get Dumber Over Time? ICML 2026 Theory of Agent Explains

The article introduces the ICML 2026 Theory of Agent (ToA), analyzes four common failure modes of modern agents, explains the internal‑vs‑external tool trade‑off through a knowledge‑boundary framework, and outlines how effort‑conservation and the β parameter guide self‑evolving agent design and future research.

AI agentsICML 2026Theory of Agent
0 likes · 24 min read
Why AI Agents Get Dumber Over Time? ICML 2026 Theory of Agent Explains
Machine Heart
Machine Heart
May 31, 2026 · Artificial Intelligence

LMNet: Enabling Language Models to Self‑Organize into Networks

The paper introduces Language Model Networks (LMNet), a framework that lets pretrained large language models act as reusable compute nodes communicating via dense, trainable vectors, showing measurable performance gains on general and supervised adaptation tasks with minimal extra training cost.

ICML 2026LLM collaborationLMNet
0 likes · 10 min read
LMNet: Enabling Language Models to Self‑Organize into Networks
Machine Heart
Machine Heart
May 22, 2026 · Artificial Intelligence

Breaking the Echo Chamber: MP‑MoE Introduces Ensemble‑Pruning for Diverse Experts

The paper presents MP‑MoE, a new Mixture‑of‑Experts architecture that replaces top‑k routing with Mahalanobis‑based ensemble pruning, explicitly encouraging expert diversity via a co‑occurrence matrix, and uses an efficient greedy algorithm with incremental Cholesky updates, achieving higher performance with minimal training overhead and no inference cost.

Dynamic RoutingEnsemble PruningExpert Diversity
0 likes · 8 min read
Breaking the Echo Chamber: MP‑MoE Introduces Ensemble‑Pruning for Diverse Experts
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
May 21, 2026 · Artificial Intelligence

Breaking the UED Bottleneck: PACE Locates the Reinforcement‑Learning Zone of Proximal Development

The paper introduces PACE, a Parameter‑Change based Unsupervised Environment Design method that evaluates training levels by the magnitude of induced policy‑parameter updates, offering a low‑variance, computationally cheap signal that consistently outperforms prior UED approaches on MiniGrid and Craftax benchmarks.

CraftaxCurriculum LearningICML 2026
0 likes · 11 min read
Breaking the UED Bottleneck: PACE Locates the Reinforcement‑Learning Zone of Proximal Development
Machine Heart
Machine Heart
May 21, 2026 · Artificial Intelligence

Breaking the Traditional UED Bottleneck: Using RL to Precisely Locate the Zone of Proximal Development

The paper introduces PACE, a Parameter Change Environment Design method that evaluates training levels by measuring induced policy parameter updates, offering a low‑variance learning‑progress signal that outperforms prior UED approaches on MiniGrid and Craftax benchmarks, achieving higher success rates and more stable generalization.

CraftaxCurriculum LearningICML 2026
0 likes · 10 min read
Breaking the Traditional UED Bottleneck: Using RL to Precisely Locate the Zone of Proximal Development
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
May 14, 2026 · Artificial Intelligence

Turning Multi‑Teacher Conflict into Dynamic Constraints: Robust Reasoning Alignment for Multimodal LLMs (ICML 2026)

APO (Autonomous Preference Optimization) converts the drift and conflict among multiple teacher multimodal LLMs into dynamic negative constraints while treating consensus as a positive preference, enabling robust concept alignment and superior diagnostic accuracy on the CXR‑MAX benchmark, as demonstrated by extensive ICML‑2026 experiments.

APOICML 2026Knowledge Distillation
0 likes · 11 min read
Turning Multi‑Teacher Conflict into Dynamic Constraints: Robust Reasoning Alignment for Multimodal LLMs (ICML 2026)
Machine Heart
Machine Heart
May 13, 2026 · Artificial Intelligence

Turning Multi-Teacher Conflict into Dynamic Constraints for Precise Multimodal Model Alignment (ICML 2026)

The paper introduces APO, a novel autonomous preference optimization framework that converts concept drift among multiple teacher multimodal LLMs into dynamic negative constraints and treats consensus as a positive preference, achieving robust concept alignment and surpassing strong teachers on a high‑risk medical X‑ray benchmark.

APOCXR-MAXICML 2026
0 likes · 11 min read
Turning Multi-Teacher Conflict into Dynamic Constraints for Precise Multimodal Model Alignment (ICML 2026)
DataFunTalk
DataFunTalk
Nov 6, 2025 · Artificial Intelligence

What New AI Policies Are Shaping ICML 2026 Submissions?

ICML 2026 opens paper submissions with strict AI usage rules—LLMs cannot be listed as authors, prompt injection is banned, and AI reviewing is expanded—while outlining submission formats, important dates, reciprocal review limits, and ethical guidelines for authors.

AI policyICML 2026Machine Learning
0 likes · 11 min read
What New AI Policies Are Shaping ICML 2026 Submissions?