Why Huawei’s Ascend 950 PR and DT Have Different Names – The Technical Rationale

Huawei’s Ascend 950 series splits a single die into two variants—PR (Prefill & Recommendation) optimized for compute‑intensive inference with low cost, and DT (Decode & Training) tuned for memory‑bandwidth‑heavy generation and training—illustrating a scenario‑driven, P/D‑separated architecture that maximizes efficiency.

AI ChipAscend 950Decode

0 likes · 5 min read

Why Huawei’s Ascend 950 PR and DT Have Different Names – The Technical Rationale

Architect

Jul 2, 2024 · Artificial Intelligence

Mooncake: A Separated Architecture for Large‑Language‑Model Inference

The article presents Mooncake, a split‑architecture inference platform for the Kimi LLM assistant, detailing its three elastic resource pools, the rationale for using Time‑Between‑Tokens over TPOT, and design choices for Prefill, KVCache, and Decode stages to improve latency and throughput.

AI SystemsDecodeKVCache

0 likes · 9 min read

Mooncake: A Separated Architecture for Large‑Language‑Model Inference

Python Crawling & Data Mining

Oct 5, 2022 · Backend Development

How to Decode URL‑Encoded Strings in Python Web Scraping

An in‑depth guide shows how to decode URL‑encoded strings encountered during Python web scraping, explains the difference between two encoding formats, and provides ready‑to‑run urllib code that prints the original Chinese characters, helping developers troubleshoot similar crawling issues.

DecodeURL encodingWeb Scraping

0 likes · 4 min read

How to Decode URL‑Encoded Strings in Python Web Scraping