Author

PaperAgent

Daily updates, analyzing cutting-edge AI research papers

202

Articles

Likes

170

Views

Comments

Latest from PaperAgent

100 recent articles max

PaperAgent

Apr 22, 2026 · Artificial Intelligence

Alibaba Unveils Four New Open‑Source Qwen3.6 Models: 27B Dense and 35B‑A3B MoE

Alibaba has added four new open‑source weight versions to its Qwen3.6 series, featuring the 27‑billion‑parameter dense multimodal model Qwen3.6‑27B and the 35‑billion‑parameter sparse expert model Qwen3.6‑35B‑A3B, both designed for stable, real‑world coding tasks and outperforming their Qwen3.5 predecessors.

AI AgentsAlibabaDense Model

0 likes · 4 min read

Alibaba Unveils Four New Open‑Source Qwen3.6 Models: 27B Dense and 35B‑A3B MoE

PaperAgent

Apr 21, 2026 · Artificial Intelligence

How to Understand Agents: From Resource‑Constrained Decisions to Contextual Cognition

This survey clarifies the essence of AI agents as resource‑limited sequential decision‑making and contextual‑cognition systems, introduces a formal definition, outlines a five‑stage evolution of large models, presents a four‑loop architecture, and illustrates the concepts with the OpenClaw agent case study.

AI SurveyAgent ArchitectureContextual Cognition

0 likes · 11 min read

How to Understand Agents: From Resource‑Constrained Decisions to Contextual Cognition

PaperAgent

Apr 21, 2026 · Artificial Intelligence

OpenMythos: Rebuilding Claude Mythos with Recursive Transformers and MoE

OpenMythos is an open‑source PyTorch reimplementation of Anthropic's Claude Mythos that uses a mixed‑expert routed recurrent Transformer, introduces Recursive Depth Transformers, Multi‑Latent Attention, and several stability mechanisms, and demonstrates parameter‑efficient scaling backed by empirical studies.

AI ArchitectureClaude MythosMoE

0 likes · 6 min read

OpenMythos: Rebuilding Claude Mythos with Recursive Transformers and MoE

PaperAgent

Apr 20, 2026 · Artificial Intelligence

How 9 Parallel Claude Agents Surpassed Human Researchers in Weak‑to‑Strong Supervision

Anthropic’s Automated Weak‑to‑Strong Researcher (AAR) system uses nine parallel Claude Opus agents to replace human researchers, achieving a Performance Gap Recovered (PGR) of 0.97 in five days at a cost of about $18,000, demonstrating that AI‑driven automation can outperform humans on well‑defined alignment tasks.

AARAI alignmentClaude

0 likes · 9 min read

How 9 Parallel Claude Agents Surpassed Human Researchers in Weak‑to‑Strong Supervision

PaperAgent

Apr 17, 2026 · Artificial Intelligence

How Automated Harnesses Are Revolutionizing LLM Agents: Memory and Action Constraints

This article reviews two recent papers that introduce automated harness methods—M⋆ for task‑specific memory programs and AutoHarness for code‑level action constraints—detailing their designs, reflective evolution processes, experimental evaluations across diverse benchmarks, and the broader shift toward harness‑centric LLM agent research.

AgentAutoHarnessLLM

0 likes · 10 min read

How Automated Harnesses Are Revolutionizing LLM Agents: Memory and Action Constraints

PaperAgent

Apr 16, 2026 · Artificial Intelligence

Do LLMs Learn Hidden Preferences? Inside the Subliminal Learning Phenomenon

A recent Nature paper by Anthropic reveals that large language models can covertly transmit preferences and misaligned behaviors through unrelated data, demonstrating a "subliminal learning" effect that spans numbers, code, and chain‑of‑thought tasks and is driven by shared model initialization.

AnthropicLLMNature Paper

0 likes · 10 min read

Do LLMs Learn Hidden Preferences? Inside the Subliminal Learning Phenomenon

PaperAgent

Apr 15, 2026 · Artificial Intelligence

How Open‑Source Agent Harnesses Are Redefining LLM Deployments

The article analyzes the shift from proprietary Claude Managed Agents to open‑source frameworks like LangChain Deep Agents Deploy, detailing harness engineering, deployment steps, memory management, and the benefits of an open ecosystem for building production‑ready AI agents.

DeploymentHarness EngineeringLangChain

0 likes · 8 min read

How Open‑Source Agent Harnesses Are Redefining LLM Deployments

PaperAgent

Apr 15, 2026 · Artificial Intelligence

Can AI Run an Entire Research Project End‑to‑End? Inside the AiScientist Breakthrough

The article analyzes the AiScientist system, which aims to let AI autonomously drive long‑horizon machine‑learning research projects from paper comprehension through environment setup, code generation, experiment execution, log analysis and iterative refinement, and reports strong benchmark results that demonstrate its practical feasibility.

AI AgentsAiScientistautonomous research

0 likes · 11 min read

Can AI Run an Entire Research Project End‑to‑End? Inside the AiScientist Breakthrough

PaperAgent

Apr 14, 2026 · Artificial Intelligence

Can Neural Computers Replace Traditional CPUs? Inside the Latest AI Harness Designs

This article analyzes the emerging concept of Neural Computers, explains how Harness engineering unifies compute, memory, and I/O into a single learned runtime, reviews recent multimodal models from Anthropic, Meta, and OpenAI, and presents detailed experimental results from the NCCLIGen and NCGUIWorld prototypes.

Neural computerharness designmultimodal models

0 likes · 8 min read

Can Neural Computers Replace Traditional CPUs? Inside the Latest AI Harness Designs

PaperAgent

Apr 13, 2026 · Artificial Intelligence

How Externalizing Memory, Skills, and Protocols Powers Next‑Gen LLM Agents

This article reviews recent research on externalizing the cognitive load of LLM agents into structured infrastructure, covering the evolution from weight‑based models to context‑rich prompts and finally to Harness systems, and detailing the four externalization dimensions—memory, skills, protocols, and the Harness engineering layer.

ExternalizationMemoryProtocols

0 likes · 11 min read

How Externalizing Memory, Skills, and Protocols Powers Next‑Gen LLM Agents