Tagged articles

7 articles

Page 1 of 1

May 22, 2026 · Artificial Intelligence

A Systematic Review of the Latest Auto‑Research Landscape

The article presents a four‑phase, eight‑stage systematic analysis of AI‑driven auto‑research, exposing reliability gaps, bottlenecks, and best‑practice deployment through human‑governed collaboration, while detailing benchmarks, failure modes, and architectural families.

AI research automationauto-researchevaluation benchmarks

0 likes · 11 min read

A Systematic Review of the Latest Auto‑Research Landscape

SuanNi

May 20, 2026 · Artificial Intelligence

AI‑Powered Research Workflow: When to Trust the Tools and When to Supervise

The article surveys AI‑assisted research across the full lifecycle—creation, writing, validation, and dissemination—detailing the capabilities of prompt engineering, retrieval‑augmented generation, training‑free agents and hybrid methods, reporting benchmark numbers, failure modes, and governance challenges that dictate when human oversight remains essential.

AI research automationGovernanceRetrieval-Augmented Generation

0 likes · 17 min read

AI‑Powered Research Workflow: When to Trust the Tools and When to Supervise

SuanNi

Apr 13, 2026 · Artificial Intelligence

How AI Researchers Built a 400% Better Multimodal Memory System with AutoResearchClaw

A fully automated AI research pipeline called AutoResearchClaw enabled a team from top universities to redesign a multimodal memory architecture, OMNIMEM, achieving over 400% performance gains on LoCoMo and Mem‑Gallery benchmarks by iteratively fixing code bugs, restructuring the system, and optimizing retrieval strategies.

AI research automationAutoResearchClawOMNIMEM

0 likes · 12 min read

How AI Researchers Built a 400% Better Multimodal Memory System with AutoResearchClaw

Machine Learning Algorithms & Natural Language Processing

Mar 26, 2026 · Artificial Intelligence

Harvard Physicist Uses Claude 4.5 to Write a Top‑Journal QCD Paper in Two Weeks

Harvard quantum‑field‑theory professor Matthew Schwartz trained Anthropic's Claude 4.5 as a G2‑level research assistant to solve a C‑parameter Sudakov‑shoulder resummation problem, and in just two weeks the model produced a 20‑page LaTeX draft, iterated 110 versions, consumed 36 million tokens, but required intensive human verification due to occasional fabricated results.

AI research automationAnthropicClaude 4.5

0 likes · 14 min read

Harvard Physicist Uses Claude 4.5 to Write a Top‑Journal QCD Paper in Two Weeks

PaperAgent

Mar 16, 2026 · Artificial Intelligence

How GLM-5-Turbo Turns an AI Research Lab into a 24‑Hour Autonomous Writer

The article details how the newly released GLM-5-Turbo "lobster" model powers an AI research Lab that automatically generates a complete OpenClaw survey paper—from topic brainstorming and literature mining to outline drafting, manuscript writing, and AAAI‑style submission—within an hour, showcasing benchmark results, prompt templates, and practical skill installations.

AI research automationAutoClawGLM-5-Turbo

0 likes · 10 min read

How GLM-5-Turbo Turns an AI Research Lab into a 24‑Hour Autonomous Writer

Machine Learning Algorithms & Natural Language Processing

Mar 9, 2026 · Artificial Intelligence

Can Self‑Iterating AI Agents Run on a Single GPU? Karpathy’s Autoresearch Demo

Karpathy’s open‑source “autoresearch” project demonstrates how a compact LLM training environment on a single GPU can let an AI agent autonomously modify code, run five‑minute training experiments, evaluate improvements, and iteratively produce better models, illustrating a new research paradigm where AI conducts experiments while humans design the system.

AI research automationAutoResearchKarpathy

0 likes · 6 min read

Can Self‑Iterating AI Agents Run on a Single GPU? Karpathy’s Autoresearch Demo

PaperAgent

Dec 1, 2025 · Artificial Intelligence

How Deep Research Turns LLMs into Autonomous AI Scientists

This article surveys the emerging Deep Research (DR) paradigm that upgrades large language models into research agents capable of autonomous planning, multi‑source evidence gathering, memory management, and verifiable long‑form report generation, outlining its stages, core components, training pipeline, and evaluation benchmarks.

AI agentsAI research automationLLM agents