Author

Machine Heart

Professional AI media and industry service platform

432

Articles

Likes

336

Views

Comments

Latest from Machine Heart

100 recent articles max

Machine Heart

May 31, 2026 · Artificial Intelligence

Can Low-Bit Models Cut Inference Costs Better Than Small Models?

The article analyzes how low‑bit quantization differs from simply using smaller LLMs, examines hardware‑level precision reduction, compares post‑training quantization with native low‑bit designs, and explains the runtime and testing requirements needed to achieve real inference cost savings.

LLM inferencecost optimizationhardware acceleration

0 likes · 7 min read

Can Low-Bit Models Cut Inference Costs Better Than Small Models?

Machine Heart

May 31, 2026 · Artificial Intelligence

Microsoft’s SkillOpt Turns Agent Skill Docs into Trainable Parameters for Self‑Evolving AI

Microsoft’s newly open‑source SkillOpt framework treats an agent’s skill document as external weights, applying a rollout‑reflect‑edit‑gate training loop with textual learning rates and rejected‑edit buffers, enabling self‑evolving skills that achieve optimal or tied‑optimal results across 52 model‑benchmark‑environment combinations.

AI agentsMicrosoftSkillOpt

0 likes · 12 min read

Microsoft’s SkillOpt Turns Agent Skill Docs into Trainable Parameters for Self‑Evolving AI

Machine Heart

May 31, 2026 · Artificial Intelligence

Defining a Good Answer in the Agent Era: A Rubrics Survey

This survey examines how rubrics can decompose the vague notion of a "good answer" for large language models into concrete, multi‑dimensional evaluation criteria, detailing their definition, construction methods, applications in training and evaluation, and the open challenges they present.

AI alignmentagentic AIevaluation

0 likes · 13 min read

Defining a Good Answer in the Agent Era: A Rubrics Survey

Machine Heart

May 31, 2026 · Artificial Intelligence

How a Child’s Finger‑Drawn Moustache Fooled AI Age Verification (and Made Engineers Speechless)

When Discord switched to a teen‑by‑default policy, users discovered that a simple thumb sketch with two eyes and a mouth could trick the on‑device AI age estimator into granting adult access, exposing the limits of lightweight facial analysis models.

AI age verificationDiscordMeta

0 likes · 6 min read

How a Child’s Finger‑Drawn Moustache Fooled AI Age Verification (and Made Engineers Speechless)

Machine Heart

May 31, 2026 · Artificial Intelligence

LMNet: Enabling Language Models to Self‑Organize into Networks

The paper introduces Language Model Networks (LMNet), a framework that lets pretrained large language models act as reusable compute nodes communicating via dense, trainable vectors, showing measurable performance gains on general and supervised adaptation tasks with minimal extra training cost.

ICML 2026LLM collaborationLMNet

0 likes · 10 min read

LMNet: Enabling Language Models to Self‑Organize into Networks

Machine Heart

May 31, 2026 · Artificial Intelligence

How a Near‑Invisible Image Can Make GPT‑5.4 and Claude Opus 4.6 Spread False Claims

Researchers from ETH Zurich show that tiny, human‑imperceptible perturbations to a single image can fool leading visual language models—including GPT‑5.4, Claude Opus 4.6, and Grok—into confidently delivering fabricated answers, enabling misinformation amplification, defamation, content‑filter evasion, and large‑scale AI authority laundering.

AI safetyClaude OpusGPT-5.4

0 likes · 7 min read

How a Near‑Invisible Image Can Make GPT‑5.4 and Claude Opus 4.6 Spread False Claims

Machine Heart

May 30, 2026 · Artificial Intelligence

Autogenesis: A Self‑Evolving Agent OS That Drives Near‑Perfect C++ LeetCode Scores

The paper introduces the Autogenesis Protocol (AGP), a two‑layer resource‑governed framework that lets agents safely modify their own prompts, tools, memory and environment, and demonstrates its effectiveness with the AGS system achieving 93.33% GAIA validation accuracy and near‑full scores on C++ LeetCode problems.

AGPAutogenesisGAIA benchmark

0 likes · 11 min read

Autogenesis: A Self‑Evolving Agent OS That Drives Near‑Perfect C++ LeetCode Scores

Machine Heart

May 30, 2026 · Artificial Intelligence

From 6 to 8: DeliAutoResearch SKILL’s Leap in Continual Learning and Self‑Iteration

The paper presents a unified three‑axis framework for continual learning and self‑iteration, classifies over a hundred prior works into five method categories, formalizes convergence conditions, highlights a jump from a 6‑point to an 8‑point peer‑review score, and outlines six open research challenges for autonomous LLMs.

AI autonomycontinual learninglarge language models

0 likes · 11 min read

From 6 to 8: DeliAutoResearch SKILL’s Leap in Continual Learning and Self‑Iteration

Machine Heart

May 30, 2026 · Artificial Intelligence

How Abstract Symbols Cut AI Inference Cost by 11×

The article examines IBM Research's Abstract‑CoT approach, which replaces verbose natural‑language chain‑of‑thought reasoning with a compact abstract token vocabulary, achieving up to an 11‑fold reduction in inference tokens while maintaining comparable accuracy across math, instruction‑following, and multi‑hop QA benchmarks.

AI inferenceAbstract-CoTchain-of-thought

0 likes · 11 min read

How Abstract Symbols Cut AI Inference Cost by 11×

Machine Heart

May 30, 2026 · Artificial Intelligence

Beyond Single-Agent: Survey of Collaboration, Attribution, and Self‑Evolution in LLM Multi‑Agents

This survey introduces the LIFE framework for LLM‑based multi‑agent systems, outlining four stages—from individual agent capabilities through collaborative structures, failure attribution, to systemic self‑evolution—while analyzing how role design, communication, and scheduling affect performance, error propagation, and adaptive improvement.

AI SurveyCollaborationFailure Attribution

0 likes · 10 min read

Beyond Single-Agent: Survey of Collaboration, Attribution, and Self‑Evolution in LLM Multi‑Agents