Author

PaperAgent

Daily updates, analyzing cutting-edge AI research papers

202

Articles

Likes

170

Views

Comments

Latest from PaperAgent

100 recent articles max

PaperAgent

Mar 28, 2026 · Artificial Intelligence

How ACCORD Breaks Concept Coupling in Custom Text‑to‑Image Generation

The ACCORD framework formalizes the concept‑coupling issue in text‑to‑image diffusion models as a statistical dependency problem and resolves it with two plug‑and‑play regularization losses, dramatically improving fidelity and text control without altering model architecture.

ACCORDAI researchconcept coupling

0 likes · 7 min read

How ACCORD Breaks Concept Coupling in Custom Text‑to‑Image Generation

PaperAgent

Mar 27, 2026 · Artificial Intelligence

How MASFactory Redefines Multi‑Agent System Orchestration with Graph‑Centric Vibe Graphing

MASFactory, an open‑source graph‑centric framework from Beijing University of Posts and Telecommunications, replaces hard‑coded multi‑agent orchestration with a natural‑language‑driven Vibe Graphing paradigm, offering a four‑layer architecture, human‑in‑the‑loop safety, and benchmark‑validated performance across seven public tasks.

AI orchestrationGraph-Centric ArchitectureMASFactory

0 likes · 7 min read

How MASFactory Redefines Multi‑Agent System Orchestration with Graph‑Centric Vibe Graphing

PaperAgent

Mar 26, 2026 · Artificial Intelligence

TurboQuant: How Google’s New Vector Quantization Cuts KV Memory 6× and Boosts Speed

TurboQuant, presented at ICLR 2026, introduces a theoretically grounded vector quantization technique that reduces large‑language‑model key‑value cache memory by at least six times, achieves up to eight‑fold speedups, and maintains zero accuracy loss by combining PolarQuant’s polar‑coordinate compression with a 1‑bit QJL error‑correction step, as demonstrated on benchmarks such as LongBench and GloVe.

AI inferenceMemory compressionTurboQuant

0 likes · 10 min read

TurboQuant: How Google’s New Vector Quantization Cuts KV Memory 6× and Boosts Speed

PaperAgent

Mar 22, 2026 · Artificial Intelligence

How AI Agents Like OpenClaw Turn LLMs into Autonomous Assistants

This article explains what AI agents are, how they differ from ordinary language‑model interfaces, and walks through OpenClaw’s workflow, tool usage, security challenges, memory handling, and advanced features such as sub‑agents and context compaction, offering practical insights for building safe autonomous AI systems.

AI agentContext EngineeringLarge Language Model

0 likes · 27 min read

How AI Agents Like OpenClaw Turn LLMs into Autonomous Assistants

PaperAgent

Mar 22, 2026 · Artificial Intelligence

Can LLM Agents Self‑Evolve Without Retraining? Inside Memento‑Skills

The article analyzes the Memento‑Skills framework, which treats external memory as executable skills to enable deployment‑time continual learning for frozen LLM agents, detailing its read‑write reflective loop, skill‑as‑memory design, behavior‑trained skill router, experimental validation on GAIA and HLE benchmarks, and theoretical guarantees without gradient updates.

AIAgentLLM

0 likes · 9 min read

Can LLM Agents Self‑Evolve Without Retraining? Inside Memento‑Skills

PaperAgent

Mar 21, 2026 · Artificial Intelligence

How Cursor’s Composer 2 Leverages Self‑Summarization and RL for Long‑Horizon Tasks

The article examines Cursor’s Composer 2 model, detailing its self‑summarization reinforcement‑learning workflow, the limitations of traditional compression methods, token‑efficient results on the CursorBench benchmark, and a challenging Terminal‑Bench case study that demonstrates dramatically reduced token usage while improving performance.

Composer 2CursorSelf‑Summarization

0 likes · 9 min read

How Cursor’s Composer 2 Leverages Self‑Summarization and RL for Long‑Horizon Tasks

PaperAgent

Mar 21, 2026 · Artificial Intelligence

Can AI Truly Be Creative? Inside the CreativeBench Benchmark

This article examines the CreativeBench benchmark, which redefines machine creativity by measuring both the quality and novelty of generated solutions, explains its combinatorial and exploratory task designs, details the self‑evolving task construction process, and discusses key findings and the EvoRePE enhancement method.

AI BenchmarkEvoRePElarge language models

0 likes · 18 min read

Can AI Truly Be Creative? Inside the CreativeBench Benchmark

PaperAgent

Mar 21, 2026 · Artificial Intelligence

Can Peer Review Boost Large Language Model Ensembles? Introducing LLM‑PeerReview

This article analyzes the unsupervised LLM‑PeerReview framework, which uses a peer‑review inspired scoring, reasoning, and selection pipeline—including a novel flipped‑triple scoring trick—to combine multiple large language models and achieve significant performance gains over existing ensemble and collaboration baselines.

Artificial IntelligenceFlipped Triple ScoringLLM Ensemble

0 likes · 11 min read

Can Peer Review Boost Large Language Model Ensembles? Introducing LLM‑PeerReview

PaperAgent

Mar 19, 2026 · Artificial Intelligence

How Scale‑SWE’s Real‑World Software Engineering Dataset Supercharges AI Models

The Scale‑SWE project releases a 100k‑task real software‑engineering dataset built with a sandboxed multi‑agent workflow, demonstrating that models fine‑tuned on this data achieve 64% on SWE‑bench‑Verified and surpass leading industrial baselines, highlighting the critical value of authentic SWE data.

AI AgentsQwen3-30A3B-InstructScale-SWE

0 likes · 7 min read

How Scale‑SWE’s Real‑World Software Engineering Dataset Supercharges AI Models

PaperAgent

Mar 19, 2026 · Artificial Intelligence

How MDER‑DR Boosts Multi‑Hop KG QA with Entity‑Centric Summaries

The article presents the MDER‑DR two‑stage framework that tackles semantic loss in knowledge‑graph triple indexing by generating context‑aware entity summaries and using an LLM‑driven decompose‑parse retrieval loop, achieving up to 66% performance gains on multi‑hop question answering benchmarks.

Entity SummarizationKG QALLM

0 likes · 5 min read

How MDER‑DR Boosts Multi‑Hop KG QA with Entity‑Centric Summaries