Author

Machine Learning Algorithms & Natural Language Processing

Focused on frontier AI technologies, empowering AI researchers' progress.

322

Articles

Likes

237

Views

Comments

Latest from Machine Learning Algorithms & Natural Language Processing

100 recent articles max

Machine Learning Algorithms & Natural Language Processing

May 28, 2026 · Artificial Intelligence

How PilotDeck’s Open‑Source Agent Cuts Token Costs by 70% with Parallel Workspaces

PilotDeck, an open‑source agent operating system from Tsinghua and partners, introduces isolated workspaces, transparent memory and smart routing that together reduce token expenses by up to 70% while keeping performance, and it demonstrates these gains through a milk‑tea game, a data‑visualisation dashboard, and a programmer‑personality test.

AgentMemoryOpenSource

0 likes · 12 min read

How PilotDeck’s Open‑Source Agent Cuts Token Costs by 70% with Parallel Workspaces

Machine Learning Algorithms & Natural Language Processing

May 28, 2026 · Artificial Intelligence

A New Paradigm for GUI Agent Trajectory Generation: FSM‑Synthesized Data at $0.04 per Trajectory

AutoWebWorld introduces a finite‑state‑machine‑driven pipeline that synthesizes verified web‑GUI trajectories at an average cost of only $0.04 each, producing longer interaction sequences, scaling efficiently, and demonstrably improving large‑language‑model agents on WebVoyager and grounding benchmarks.

AutoWebWorldData GenerationFinite State Machine

0 likes · 13 min read

A New Paradigm for GUI Agent Trajectory Generation: FSM‑Synthesized Data at $0.04 per Trajectory

Machine Learning Algorithms & Natural Language Processing

May 28, 2026 · R&D Management

Why Company Structure Is the New Moat in the AI Era

The article argues that in fast‑moving AI markets, a firm’s organizational shape—not its product or technology—has become the most durable competitive advantage, illustrating this with examples from OpenAI, Anthropic and Palantir.

AIAnthropicOpenAI

0 likes · 14 min read

Why Company Structure Is the New Moat in the AI Era

Machine Learning Algorithms & Natural Language Processing

May 28, 2026 · Artificial Intelligence

Synthesizing Agentic Factual SFT/Mid‑train Data: Query Filtering, Trajectory Generation, and Tool Usage

The article outlines a practical pipeline for creating agentic factual SFT and mid‑train datasets, covering how to define training goals, filter and classify queries, label processing tags, format trajectory samples, differentiate SFT from mid‑train data, and avoid common pitfalls when generating evidence‑driven AI training data.

SFTagentic AIdata synthesis

0 likes · 10 min read

Synthesizing Agentic Factual SFT/Mid‑train Data: Query Filtering, Trajectory Generation, and Tool Usage

Machine Learning Algorithms & Natural Language Processing

May 28, 2026 · Artificial Intelligence

Open‑Source 35B Intern‑S2‑Preview Rivals Trillion‑Parameter Models on Scientific Benchmarks

The open‑source 35‑billion‑parameter Intern‑S2‑Preview model achieves scientific‑task performance comparable to trillion‑parameter models, thanks to full‑link “general‑specialized” training, reinforced‑learning scaling, and hardware‑aware optimizations, and it outperforms leading closed‑source models on benchmarks such as MolecularIQ and crystal‑structure generation.

InternLMLarge Language ModelOpen Source

0 likes · 11 min read

Open‑Source 35B Intern‑S2‑Preview Rivals Trillion‑Parameter Models on Scientific Benchmarks

Machine Learning Algorithms & Natural Language Processing

May 28, 2026 · Industry Insights

Can’t Publish in Nature? Your Rubbish Paper Might Still Get Noticed

The article examines the satirical “Rubbish” journal—an impact‑factor‑zero outlet that accepts failed experiments and quirky research, its rapid rise on social media, the wave of similar “bottom‑journal” imitators, and what this phenomenon reveals about pressure in modern academic publishing.

Rubbish journalacademic publishingimpact factor

0 likes · 6 min read

Can’t Publish in Nature? Your Rubbish Paper Might Still Get Noticed

Machine Learning Algorithms & Natural Language Processing

May 28, 2026 · Artificial Intelligence

6 Practical Tips for Using Codex in Research Projects

The article shares a six‑step workflow for leveraging Codex in research tasks—read the codebase first, define long‑term AGENTS.md rules, always plan complex changes, verify before coding, start a new session after each task, and never trust the final output without a detailed audit.

AGENTS.mdAI Code GenerationCodex

0 likes · 7 min read

6 Practical Tips for Using Codex in Research Projects

Machine Learning Algorithms & Natural Language Processing

May 26, 2026 · Artificial Intelligence

AI Trends in Medical Imaging: From Recognition to Workflow Automation (CVPR'26)

The article reviews CVPR 2026 medical imaging papers, highlighting a shift from pure image recognition toward efficient model adaptation, clinical semantic understanding, and cross‑modal reasoning, with examples ranging from simple AI agents optimizing workflows to multimodal foundation models for CT, ultrasound, spatial transcriptomics, IMU‑video alignment, and dual‑view X‑ray analysis.

AICVPR 2026Multimodal

0 likes · 24 min read

AI Trends in Medical Imaging: From Recognition to Workflow Automation (CVPR'26)

Machine Learning Algorithms & Natural Language Processing

May 26, 2026 · Artificial Intelligence

Teaching 7,000 Languages: How LASA’s Semantic Bottleneck Enables Multilingual LLM Safety

The paper reveals a language‑agnostic "semantic bottleneck" layer inside large language models and introduces LASA, a three‑step framework that locates this layer, extracts safety signals with a lightweight interpreter, and injects them via KTO loss, dramatically improving multilingual safety without per‑language data collection.

AI safetyLASALLM safety

0 likes · 8 min read

Teaching 7,000 Languages: How LASA’s Semantic Bottleneck Enables Multilingual LLM Safety

Machine Learning Algorithms & Natural Language Processing

May 26, 2026 · Artificial Intelligence

Inside the GPT-5.6 Leak: 1.5M Token Context, Super‑Intelligent Agents, and a UI Revolution

A leaked OpenAI GPT‑5.6 model (iris‑alpha) promises a 1.5 million‑token context window, a breakthrough "de‑slop" UI generation that produces pixel‑perfect designs, dual standard/Pro variants for advanced reasoning and agent workflows, and a rapid June release that fuels an AI arms race with Anthropic, Google and others.

AI UI generationAI competitionGPT-5.6

0 likes · 10 min read

Inside the GPT-5.6 Leak: 1.5M Token Context, Super‑Intelligent Agents, and a UI Revolution