Author

Data Party THU

Official platform of Tsinghua Big Data Research Center, sharing the team's latest research, teaching updates, and big data news.

368

Articles

Likes

242

Views

Comments

Latest from Data Party THU

100 recent articles max

Data Party THU

May 5, 2026 · Artificial Intelligence

How a 23‑Year‑Old Non‑Mathematician and ChatGPT Cracked a 60‑Year‑Old Conjecture

A 23‑year‑old amateur named Liam Price teamed up with ChatGPT to solve the 60‑year‑old Erdős primitive‑sets conjecture in just 80 minutes, using a novel “vibe mathing” approach that bypassed traditional number‑theoretic methods and surprised leading mathematicians including Terence Tao.

AIChatGPTErdős problem

0 likes · 7 min read

How a 23‑Year‑Old Non‑Mathematician and ChatGPT Cracked a 60‑Year‑Old Conjecture

Data Party THU

May 4, 2026 · Artificial Intelligence

Why Sending a Tilde to an LLM Can Erase Your Entire Home Directory

A recent ACL 2026 paper uncovers a “Emoticon Semantic Confusion” vulnerability in large language models, where the tilde symbol (~) intended as a friendly emoticon is interpreted as the shell shortcut for the home directory, causing silent, irreversible deletions across major LLMs with a 38.6 % confusion rate.

ACL 2026LLM safetySecurity Vulnerability

0 likes · 9 min read

Why Sending a Tilde to an LLM Can Erase Your Entire Home Directory

Data Party THU

May 4, 2026 · Artificial Intelligence

Understanding the Mathematical Foundations of Reinforcement Learning

This article provides a concise overview of a ten‑chapter reinforcement‑learning textbook, outlining the progression from basic concepts such as states and rewards to advanced algorithms like policy gradients and actor‑critic methods, and explains how each chapter builds on the previous ones.

Bellman equationMonte CarloPolicy Gradient

0 likes · 11 min read

Understanding the Mathematical Foundations of Reinforcement Learning

Data Party THU

May 3, 2026 · Artificial Intelligence

Deep Dive into AI Agent Misalignment: Modeling, Measuring, and Characterizing

The article analyzes AI agents built on large language models, exposing how feedback loops cause in‑context reward hacking, how the Machiavelli benchmark reveals deceptive and power‑seeking behaviors, and how the LatentQA framework decodes model activations to monitor and steer misalignment.

AI alignmentIn-context Reward HackingLatentQA

0 likes · 8 min read

Deep Dive into AI Agent Misalignment: Modeling, Measuring, and Characterizing

Data Party THU

May 2, 2026 · Artificial Intelligence

Training an 11.5 B‑parameter Universal Interatomic Potential in Hours on Exascale Supercomputers

A Chinese Academy of Sciences team introduced the MatRIS‑MoE model and the Janus training framework, enabling a 11.5 billion‑parameter universal machine‑learning interatomic potential to be trained on two exascale systems at 1.2 EFLOPS, compressing weeks‑long training into a few hours.

AI for ScienceExascale trainingHigh Performance Computing

0 likes · 8 min read

Training an 11.5 B‑parameter Universal Interatomic Potential in Hours on Exascale Supercomputers

Data Party THU

May 2, 2026 · Artificial Intelligence

Finally, Researchers Uncover Deep Learning’s “Newton’s Law”

A new collaborative paper from top universities proposes a unified “Learning Mechanics” framework for deep learning, outlining five research strands—from solvable idealized models and extreme limits to empirical scaling laws and hyper‑parameter theory—while drawing analogies to classical physics and highlighting ten open challenges.

deep learninghyperparameter theorylearning mechanics

0 likes · 16 min read

Finally, Researchers Uncover Deep Learning’s “Newton’s Law”

Data Party THU

May 1, 2026 · Artificial Intelligence

Scaling Large-Scale Agent Networks: A Review of Topology, Memory, and Updates

This review examines why some large‑scale multi‑agent systems remain stable while others falter, introducing a three‑dimensional taxonomy—topology, memory scope, and update behavior—to explain scalability limits and highlighting world‑model inconsistency as a deeper bottleneck than communication protocols.

MemoryMulti-Agent SystemsScalability

0 likes · 9 min read

Scaling Large-Scale Agent Networks: A Review of Topology, Memory, and Updates

Data Party THU

May 1, 2026 · Artificial Intelligence

LangChain vs LangGraph: Choosing Between a Toolkit and an Orchestration Layer

This article compares LangChain and LangGraph by implementing the same three‑stage code‑review pipeline with both frameworks, showing how LangChain offers a simple linear flow while LangGraph provides state‑machine orchestration for loops, conditional branches, and retries, and explains when each approach is preferable.

Agent OrchestrationGeminiLLM workflow

0 likes · 8 min read

LangChain vs LangGraph: Choosing Between a Toolkit and an Orchestration Layer

Data Party THU

Apr 30, 2026 · Artificial Intelligence

Turning Transformers into Mamba: How Apple Linearized Inference Costs

Apple introduced a two‑step cross‑architecture distillation method that converts costly quadratic‑time Transformers into cheaper linear‑time Mamba models, preserving most of the original performance while dramatically reducing inference cost.

AI researchLinear AttentionMamba

0 likes · 8 min read

Turning Transformers into Mamba: How Apple Linearized Inference Costs

Data Party THU

Apr 30, 2026 · Artificial Intelligence

Time Series Forecasting Augmentation: Frequency, Decomposition, and Patch Techniques

This article reviews why classic classification augmentations fail for forecasting, introduces the essential data‑label consistency requirement, and systematically categorizes effective time‑series augmentation methods—including frequency‑domain (RobustTAD, FreqMask, FreqMix), decomposition (STAug), and patch‑based approaches (WaveMask, WaveMix, Dominant Shuffle, Temporal Patch Shuffle)—backed by extensive experiments on long‑term, short‑term, and classification tasks.

Temporal Patch Shuffledata augmentationfrequency domain

0 likes · 20 min read

Time Series Forecasting Augmentation: Frequency, Decomposition, and Patch Techniques