Author

DeepHub IMBA

A must‑follow public account sharing practical AI insights. Follow now. internet + machine learning + big data + architecture = IMBA

Articles

Likes

Views

Comments

Latest from DeepHub IMBA

60 recent articles

DeepHub IMBA

Mar 8, 2026 · Artificial Intelligence

MIT Study: How Self‑Generated History Pollutes LLM Context and Degrades Multi‑Turn Chats

An MIT paper reveals that storing a language model’s own prior replies—known as context pollution—significantly lengthens the dialogue context while offering little quality benefit, with up to a ten‑fold reduction in tokens and comparable responses for about 70% of turns, especially in open‑source models.

AI agentsLLMMIT study

0 likes · 11 min read

MIT Study: How Self‑Generated History Pollutes LLM Context and Degrades Multi‑Turn Chats

DeepHub IMBA

Mar 7, 2026 · Artificial Intelligence

From AutoGen v0.4 to Microsoft Agent Framework: A Complete Architectural Evolution

This article traces the rise of Microsoft AutoGen, explains its core design and v0.4 architecture, showcases code examples and benchmark results, examines its limitations, and details the transition to the Microsoft Agent Framework and its current state in 2026.

AutoGenGroupChatLLM multi-agent

0 likes · 16 min read

From AutoGen v0.4 to Microsoft Agent Framework: A Complete Architectural Evolution

DeepHub IMBA

Mar 6, 2026 · Artificial Intelligence

New March 2026 Paper Exposes Fraudulent Third‑Party APIs for Large Language Models

A recent arXiv study audited 17 popular shadow APIs used in 187 papers, finding up to a 47.21% performance gap versus official models—e.g., Gemini‑2.5‑flash’s accuracy drops from 83.82% to about 37% on MedQA—highlighting serious reliability and safety risks of unofficial LLM services.

AI safetyPerformance Evaluationlarge language models

0 likes · 3 min read

New March 2026 Paper Exposes Fraudulent Third‑Party APIs for Large Language Models

DeepHub IMBA

Mar 6, 2026 · Fundamentals

Measuring Multivariate Distribution Differences with Energy Distance

Energy Distance is a statistical metric that quantifies how far two multivariate probability distributions diverge by comparing cross‑distribution and within‑distribution Euclidean distances, and it can be combined with permutation testing to assess the significance of observed shifts.

Energy Distancedata driftdistribution comparison

0 likes · 6 min read

Measuring Multivariate Distribution Differences with Energy Distance

DeepHub IMBA

Mar 6, 2026 · Artificial Intelligence

Shadow APIs vs Official LLMs: Up to 47% Performance Gap Revealed in New Study

A recent arXiv paper audits 17 widely used shadow APIs, showing that their outputs can deviate from official large language model APIs by as much as 47.21%, with accuracy on the MedQA benchmark dropping from 83.82% to around 37%, raising serious reliability concerns.

AI safetyPerformance Evaluationlarge language models

0 likes · 3 min read

Shadow APIs vs Official LLMs: Up to 47% Performance Gap Revealed in New Study

DeepHub IMBA

Mar 4, 2026 · Fundamentals

Deriving Bayes’ Theorem: How Joint Probability Symmetry Reveals Conditional Reversal

The article walks through a simple two‑box, four‑ball example to illustrate basic probability, conditional probability, joint probability, and then reverses the conditioning to derive Bayes’ theorem, showing each step with concrete calculations and visual partitions of the sample space.

Bayes theoremconditional probabilityjoint probability

0 likes · 9 min read

Deriving Bayes’ Theorem: How Joint Probability Symmetry Reveals Conditional Reversal

DeepHub IMBA

Mar 3, 2026 · Artificial Intelligence

The Evolution of KV Cache Management: From Continuous Allocation to Unified Hybrid Memory Architecture

The article traces five eras of KV cache management for LLM inference—from its absence before Transformers to the emerging unified hybrid memory architecture—comparing vLLM, SGLang, and TensorRT‑LLM and offering a decision framework for selecting the right solution in various deployment scenarios.

KV CacheLLM inferencePagedAttention

0 likes · 16 min read

The Evolution of KV Cache Management: From Continuous Allocation to Unified Hybrid Memory Architecture

DeepHub IMBA

Mar 2, 2026 · Fundamentals

PhysioDSP: A Python Library for Wearable Physiological Signal Processing

PhysioDSP is an open‑source Python library that unifies fragmented wearable signal‑processing pipelines by providing type‑safe, reproducible algorithms for activity analysis, ECG peak detection, and HRV scoring, with a modular architecture and ready‑to‑use data models.

ECGHRVOpen Source

0 likes · 6 min read

PhysioDSP: A Python Library for Wearable Physiological Signal Processing

DeepHub IMBA

Mar 1, 2026 · Artificial Intelligence

Demystifying VAE: From Probabilistic Encoding to Latent Space Regularization

This article walks through the fundamentals of variational autoencoders, explaining why they are needed, detailing their three core components, loss formulation, PyTorch implementation, training loop, and multiple inference modes such as anomaly detection, data generation, conditional generation, latent space manipulation, and data imputation.

Anomaly DetectionConditional VAEGenerative Models

0 likes · 15 min read

Demystifying VAE: From Probabilistic Encoding to Latent Space Regularization

DeepHub IMBA

Feb 28, 2026 · Artificial Intelligence

Why Energy‑Based Models Could Outperform Probabilistic LLMs, According to Yann LeCun

Yann LeCun argues that the probability‑driven, token‑by‑token design of current large language models may never reach human‑level intelligence, and explains how Energy‑Based Models replace probability distributions with an energy function, offering more flexible training, inference, and multi‑modal capabilities.

Contrastive DivergenceDensity EstimationEBM

0 likes · 23 min read

Why Energy‑Based Models Could Outperform Probabilistic LLMs, According to Yann LeCun