Tagged articles

9 articles

Page 1 of 1

May 31, 2026 · Artificial Intelligence

Why AI Agents Get Dumber Over Time? ICML 2026 Theory of Agent Explains

The article introduces the ICML 2026 Theory of Agent (ToA), analyzes four common failure modes of modern agents, explains the internal‑vs‑external tool trade‑off through a knowledge‑boundary framework, and outlines how effort‑conservation and the β parameter guide self‑evolving agent design and future research.

AI agentsICML 2026Theory of Agent

0 likes · 24 min read

Why AI Agents Get Dumber Over Time? ICML 2026 Theory of Agent Explains

Machine Heart

May 31, 2026 · Artificial Intelligence

LMNet: Enabling Language Models to Self‑Organize into Networks

The paper introduces Language Model Networks (LMNet), a framework that lets pretrained large language models act as reusable compute nodes communicating via dense, trainable vectors, showing measurable performance gains on general and supervised adaptation tasks with minimal extra training cost.

ICML 2026LLM collaborationLMNet

0 likes · 10 min read

LMNet: Enabling Language Models to Self‑Organize into Networks

Machine Heart

May 22, 2026 · Artificial Intelligence

Breaking the Echo Chamber: MP‑MoE Introduces Ensemble‑Pruning for Diverse Experts

The paper presents MP‑MoE, a new Mixture‑of‑Experts architecture that replaces top‑k routing with Mahalanobis‑based ensemble pruning, explicitly encouraging expert diversity via a co‑occurrence matrix, and uses an efficient greedy algorithm with incremental Cholesky updates, achieving higher performance with minimal training overhead and no inference cost.

Dynamic RoutingEnsemble PruningExpert Diversity

0 likes · 8 min read

Breaking the Echo Chamber: MP‑MoE Introduces Ensemble‑Pruning for Diverse Experts

Machine Learning Algorithms & Natural Language Processing

May 21, 2026 · Artificial Intelligence

Breaking the UED Bottleneck: PACE Locates the Reinforcement‑Learning Zone of Proximal Development

The paper introduces PACE, a Parameter‑Change based Unsupervised Environment Design method that evaluates training levels by the magnitude of induced policy‑parameter updates, offering a low‑variance, computationally cheap signal that consistently outperforms prior UED approaches on MiniGrid and Craftax benchmarks.

CraftaxCurriculum LearningICML 2026

0 likes · 11 min read

Breaking the UED Bottleneck: PACE Locates the Reinforcement‑Learning Zone of Proximal Development

Machine Heart

May 21, 2026 · Artificial Intelligence

Breaking the Traditional UED Bottleneck: Using RL to Precisely Locate the Zone of Proximal Development

The paper introduces PACE, a Parameter Change Environment Design method that evaluates training levels by measuring induced policy parameter updates, offering a low‑variance learning‑progress signal that outperforms prior UED approaches on MiniGrid and Craftax benchmarks, achieving higher success rates and more stable generalization.

CraftaxCurriculum LearningICML 2026

0 likes · 10 min read

Breaking the Traditional UED Bottleneck: Using RL to Precisely Locate the Zone of Proximal Development

Machine Learning Algorithms & Natural Language Processing

May 14, 2026 · Artificial Intelligence

Turning Multi‑Teacher Conflict into Dynamic Constraints: Robust Reasoning Alignment for Multimodal LLMs (ICML 2026)

APO (Autonomous Preference Optimization) converts the drift and conflict among multiple teacher multimodal LLMs into dynamic negative constraints while treating consensus as a positive preference, enabling robust concept alignment and superior diagnostic accuracy on the CXR‑MAX benchmark, as demonstrated by extensive ICML‑2026 experiments.

APOICML 2026Knowledge Distillation

0 likes · 11 min read

Turning Multi‑Teacher Conflict into Dynamic Constraints: Robust Reasoning Alignment for Multimodal LLMs (ICML 2026)

Machine Heart

May 13, 2026 · Artificial Intelligence

Turning Multi-Teacher Conflict into Dynamic Constraints for Precise Multimodal Model Alignment (ICML 2026)

The paper introduces APO, a novel autonomous preference optimization framework that converts concept drift among multiple teacher multimodal LLMs into dynamic negative constraints and treats consensus as a positive preference, achieving robust concept alignment and surpassing strong teachers on a high‑risk medical X‑ray benchmark.

APOCXR-MAXICML 2026

0 likes · 11 min read

Turning Multi-Teacher Conflict into Dynamic Constraints for Precise Multimodal Model Alignment (ICML 2026)

Machine Learning Algorithms & Natural Language Processing

Feb 16, 2026 · Artificial Intelligence

How ICML 2026 Used Prompt Injection to Trap Automated Reviewers

Reviewers discovered hidden text in ICML 2026 PDFs that injects specific phrases into large‑language‑model generated reviews, turning an attack technique into a defense mechanism and prompting new safeguards such as watermarking and OCR‑based checks.

AI securityAcademic Peer ReviewICML 2026

0 likes · 6 min read

How ICML 2026 Used Prompt Injection to Trap Automated Reviewers

DataFunTalk

Nov 6, 2025 · Artificial Intelligence

What New AI Policies Are Shaping ICML 2026 Submissions?

ICML 2026 opens paper submissions with strict AI usage rules—LLMs cannot be listed as authors, prompt injection is banned, and AI reviewing is expanded—while outlining submission formats, important dates, reciprocal review limits, and ethical guidelines for authors.

AI policyICML 2026Machine Learning

0 likes · 11 min read

What New AI Policies Are Shaping ICML 2026 Submissions?