Data Party THU
Author

Data Party THU

Official platform of Tsinghua Big Data Research Center, sharing the team's latest research, teaching updates, and big data news.

368
Articles
0
Likes
242
Views
0
Comments
Recent Articles

Latest from Data Party THU

100 recent articles max
Data Party THU
Data Party THU
May 4, 2026 · Artificial Intelligence

Why Sending a Tilde to an LLM Can Erase Your Entire Home Directory

A recent ACL 2026 paper uncovers a “Emoticon Semantic Confusion” vulnerability in large language models, where the tilde symbol (~) intended as a friendly emoticon is interpreted as the shell shortcut for the home directory, causing silent, irreversible deletions across major LLMs with a 38.6 % confusion rate.

ACL 2026LLM safetySecurity Vulnerability
0 likes · 9 min read
Why Sending a Tilde to an LLM Can Erase Your Entire Home Directory
Data Party THU
Data Party THU
May 4, 2026 · Artificial Intelligence

Understanding the Mathematical Foundations of Reinforcement Learning

This article provides a concise overview of a ten‑chapter reinforcement‑learning textbook, outlining the progression from basic concepts such as states and rewards to advanced algorithms like policy gradients and actor‑critic methods, and explains how each chapter builds on the previous ones.

Bellman equationMonte CarloPolicy Gradient
0 likes · 11 min read
Understanding the Mathematical Foundations of Reinforcement Learning
Data Party THU
Data Party THU
May 3, 2026 · Artificial Intelligence

Deep Dive into AI Agent Misalignment: Modeling, Measuring, and Characterizing

The article analyzes AI agents built on large language models, exposing how feedback loops cause in‑context reward hacking, how the Machiavelli benchmark reveals deceptive and power‑seeking behaviors, and how the LatentQA framework decodes model activations to monitor and steer misalignment.

AI alignmentIn-context Reward HackingLatentQA
0 likes · 8 min read
Deep Dive into AI Agent Misalignment: Modeling, Measuring, and Characterizing
Data Party THU
Data Party THU
May 2, 2026 · Artificial Intelligence

Training an 11.5 B‑parameter Universal Interatomic Potential in Hours on Exascale Supercomputers

A Chinese Academy of Sciences team introduced the MatRIS‑MoE model and the Janus training framework, enabling a 11.5 billion‑parameter universal machine‑learning interatomic potential to be trained on two exascale systems at 1.2 EFLOPS, compressing weeks‑long training into a few hours.

AI for ScienceExascale trainingHigh Performance Computing
0 likes · 8 min read
Training an 11.5 B‑parameter Universal Interatomic Potential in Hours on Exascale Supercomputers
Data Party THU
Data Party THU
May 2, 2026 · Artificial Intelligence

Finally, Researchers Uncover Deep Learning’s “Newton’s Law”

A new collaborative paper from top universities proposes a unified “Learning Mechanics” framework for deep learning, outlining five research strands—from solvable idealized models and extreme limits to empirical scaling laws and hyper‑parameter theory—while drawing analogies to classical physics and highlighting ten open challenges.

deep learninghyperparameter theorylearning mechanics
0 likes · 16 min read
Finally, Researchers Uncover Deep Learning’s “Newton’s Law”
Data Party THU
Data Party THU
May 1, 2026 · Artificial Intelligence

Scaling Large-Scale Agent Networks: A Review of Topology, Memory, and Updates

This review examines why some large‑scale multi‑agent systems remain stable while others falter, introducing a three‑dimensional taxonomy—topology, memory scope, and update behavior—to explain scalability limits and highlighting world‑model inconsistency as a deeper bottleneck than communication protocols.

MemoryMulti-Agent SystemsScalability
0 likes · 9 min read
Scaling Large-Scale Agent Networks: A Review of Topology, Memory, and Updates
Data Party THU
Data Party THU
May 1, 2026 · Artificial Intelligence

LangChain vs LangGraph: Choosing Between a Toolkit and an Orchestration Layer

This article compares LangChain and LangGraph by implementing the same three‑stage code‑review pipeline with both frameworks, showing how LangChain offers a simple linear flow while LangGraph provides state‑machine orchestration for loops, conditional branches, and retries, and explains when each approach is preferable.

Agent OrchestrationGeminiLLM workflow
0 likes · 8 min read
LangChain vs LangGraph: Choosing Between a Toolkit and an Orchestration Layer
Data Party THU
Data Party THU
Apr 30, 2026 · Artificial Intelligence

Turning Transformers into Mamba: How Apple Linearized Inference Costs

Apple introduced a two‑step cross‑architecture distillation method that converts costly quadratic‑time Transformers into cheaper linear‑time Mamba models, preserving most of the original performance while dramatically reducing inference cost.

AI researchLinear AttentionMamba
0 likes · 8 min read
Turning Transformers into Mamba: How Apple Linearized Inference Costs
Data Party THU
Data Party THU
Apr 30, 2026 · Artificial Intelligence

Time Series Forecasting Augmentation: Frequency, Decomposition, and Patch Techniques

This article reviews why classic classification augmentations fail for forecasting, introduces the essential data‑label consistency requirement, and systematically categorizes effective time‑series augmentation methods—including frequency‑domain (RobustTAD, FreqMask, FreqMix), decomposition (STAug), and patch‑based approaches (WaveMask, WaveMix, Dominant Shuffle, Temporal Patch Shuffle)—backed by extensive experiments on long‑term, short‑term, and classification tasks.

Temporal Patch Shuffledata augmentationfrequency domain
0 likes · 20 min read
Time Series Forecasting Augmentation: Frequency, Decomposition, and Patch Techniques