Tag

stability

1 views collected around this technical thread.

JD Tech
JD Tech
Mar 13, 2025 · Operations

Ensuring Stability of the Double 11 Supply‑Chain Dashboard: Full‑Link Process, Risk Points, and Technical Safeguards

This article details how JD Logistics guarantees the stability of its Double 11 supply‑chain dashboard by mapping the entire data‑flow, identifying risk points across ingestion, processing, storage, service, and monitoring layers, and applying targeted technical and organizational safeguards.

Dashboardbig datamonitoring
0 likes · 10 min read
Ensuring Stability of the Double 11 Supply‑Chain Dashboard: Full‑Link Process, Risk Points, and Technical Safeguards
JD Tech Talk
JD Tech Talk
Mar 12, 2025 · Big Data

Ensuring Stability of the Double‑11 Supply Chain Dashboard: Full‑Chain Process, Risk Points, and Technical Safeguard Strategies

This article details how the supply‑chain big‑screen dashboard for Double‑11 maintains high stability by mapping the full data‑flow, identifying risk points across ingestion, processing, storage and service layers, and applying comprehensive technical safeguards such as high‑availability design, fault‑tolerance, monitoring, and coordinated operational procedures.

Dashboardbig datadata pipeline
0 likes · 11 min read
Ensuring Stability of the Double‑11 Supply Chain Dashboard: Full‑Chain Process, Risk Points, and Technical Safeguard Strategies
Baidu Geek Talk
Baidu Geek Talk
Feb 19, 2025 · Frontend Development

Technical Practice of Baidu Live‑Streaming Interactive Framework: Performance and Stability Optimization

Baidu live streaming interactive framework optimized performance and stability for music+red‑packet activities, using component reuse, page pre‑static generation, SSR, ISR, prefetching, view prerender, fallback mechanisms, and animation downgrade, achieving first‑screen load reductions to 0.5 s and delivering a reusable solution for large‑scale live events.

Front-end ArchitectureLive StreamingPerformance Optimization
0 likes · 16 min read
Technical Practice of Baidu Live‑Streaming Interactive Framework: Performance and Stability Optimization
JD Tech Talk
JD Tech Talk
Feb 6, 2025 · Operations

Stability Assurance Mechanisms and Practices for Site Reliability Engineering (SRE)

This article outlines comprehensive stability assurance mechanisms—including standards, process workflows, the distinction between developers and SREs, personal responsibilities, and practical construction directions—to guide teams in building resilient, high‑availability systems through proactive, daily, and incident‑response practices.

Reliability EngineeringSREincident management
0 likes · 10 min read
Stability Assurance Mechanisms and Practices for Site Reliability Engineering (SRE)
JD Tech
JD Tech
Dec 24, 2024 · Backend Development

Stability Challenges and Engineering Solutions for an Inventory Platform

The article analyzes the stability problems faced by an e‑commerce inventory platform—including complex workflows, data accuracy, database hotspots, and high‑frequency calculations—and details a series of backend engineering solutions such as traffic splitting, gray‑release links, Redis caching, consistency checks, async rate limiting, and comprehensive monitoring to improve reliability and performance.

InventoryRedisbackend
0 likes · 13 min read
Stability Challenges and Engineering Solutions for an Inventory Platform
JD Tech Talk
JD Tech Talk
Dec 10, 2024 · Operations

Stability Challenges and Solutions for an Inventory Platform

This article analyzes the stability challenges faced by an e‑commerce inventory platform—including complex business flows, database hotspots, and high‑frequency calculations—and details a series of engineering solutions such as traffic splitting, gray‑release pipelines, Redis caching, consistency checks, throttling, and comprehensive monitoring to improve reliability and performance.

InventoryRedisbackend
0 likes · 14 min read
Stability Challenges and Solutions for an Inventory Platform
Architect
Architect
Nov 28, 2024 · Backend Development

Designing a High‑Performance Message Notification System

This article explains how to design and implement a high‑performance, scalable message notification system, covering service partitioning, system architecture, first‑time and retry message handling, idempotency, dynamic routing, thread‑pool management, stability measures such as traffic surge handling, resource isolation, monitoring, and elastic scaling.

Message Notificationbackend developmenthigh performance
0 likes · 17 min read
Designing a High‑Performance Message Notification System
Efficient Ops
Efficient Ops
Nov 12, 2024 · Operations

How to Build Robust Online Stability: Practices, Metrics, and Team Strategies

This article outlines a comprehensive approach to online stability, covering preventive measures, service governance, capacity planning, incident detection, multi‑dimensional monitoring, alerting, R&D efficiency improvements, team building, and practical guidelines for simplifying, standardizing, automating, and scaling stability initiatives across an organization.

Incident Responseautomationmonitoring
0 likes · 15 min read
How to Build Robust Online Stability: Practices, Metrics, and Team Strategies
Baidu Geek Talk
Baidu Geek Talk
Oct 14, 2024 · Backend Development

Evolution of Baidu Visual Search Architecture: Stack Upgrade, Full‑Link Refactoring, and Stability Enhancements

Baidu Visual Search upgraded its PHP/HHVM stack to Golang, introduced a Backend‑For‑Frontend layer, refactored presentation and system modules with the GDP framework and ExGraph, and built comprehensive monitoring and self‑healing tools, delivering a modular, scalable, and stable AI‑driven search platform.

architecturebackendgolang
0 likes · 13 min read
Evolution of Baidu Visual Search Architecture: Stack Upgrade, Full‑Link Refactoring, and Stability Enhancements
Model Perspective
Model Perspective
Sep 5, 2024 · Fundamentals

Why Balance Matters: Stability Across Physics, Chemistry, Biology, Economics & Psychology

This article examines the interdisciplinary concept of balance, defining it as a system’s stable state, and explores how equilibrium arises and functions in physics, chemistry, biology, economics, and psychology, while questioning whether equilibrium always signifies a desirable or optimal condition.

Interdisciplinarybalanceequilibrium
0 likes · 8 min read
Why Balance Matters: Stability Across Physics, Chemistry, Biology, Economics & Psychology
Architecture and Beyond
Architecture and Beyond
Jul 28, 2024 · Frontend Development

Comprehensive Guide to Front‑End Stability: Observability, Full‑Chain Monitoring, High‑Availability Architecture, Performance Management, Risk Governance, Process Mechanisms, and Engineering Practices

This extensive article presents a systematic approach to front‑end stability, covering observability systems, full‑chain monitoring, high‑availability design, performance management, risk governance, process mechanisms, and engineering practices to ensure reliable user experiences and business continuity.

High AvailabilityObservabilityfrontend
0 likes · 44 min read
Comprehensive Guide to Front‑End Stability: Observability, Full‑Chain Monitoring, High‑Availability Architecture, Performance Management, Risk Governance, Process Mechanisms, and Engineering Practices
Soul Technical Team
Soul Technical Team
Jul 23, 2024 · Big Data

Kafka Stability Challenges and Governance Framework at Soul

This article analyzes the role, application scenarios, stability challenges, and comprehensive governance framework of Apache Kafka at Soul, covering deployment, configuration, monitoring, standard controls, common misuse, and future directions toward cloud‑native solutions.

KafkaStreamingmonitoring
0 likes · 30 min read
Kafka Stability Challenges and Governance Framework at Soul
JD Tech Talk
JD Tech Talk
Jul 3, 2024 · Big Data

Real-time Monitoring Dashboard for Logistics Supply Chain: Architecture, Data Processing, and Stability Practices

This article describes the design and implementation of a high‑availability, real‑time logistics supply‑chain dashboard using Flink and ClickHouse, covering data processing pipelines, metric consistency, stability mechanisms, extensible configurations, and monitoring techniques to guide similar large‑screen projects.

ClickHouseReal-time Dashboardbig data
0 likes · 9 min read
Real-time Monitoring Dashboard for Logistics Supply Chain: Architecture, Data Processing, and Stability Practices
360 Smart Cloud
360 Smart Cloud
Jul 3, 2024 · Operations

Practical Practices for Enhancing Kafka Cluster Stability at 360

This article details 360's comprehensive approach to improving Apache Kafka cluster stability through proactive operations, capacity assessment, parameter tuning, monitoring, version upgrades, and traffic control, offering concrete guidelines and best‑practice recommendations for large‑scale message‑queue deployments.

ClusterKafkaUpgrade
0 likes · 33 min read
Practical Practices for Enhancing Kafka Cluster Stability at 360
NetEase LeiHuo Testing Center
NetEase LeiHuo Testing Center
Jun 7, 2024 · Game Development

Game Compatibility Testing: Concepts, Common Issues, and Process

This article explains the concept of game compatibility, outlines typical hardware and UI compatibility problems on mobile devices, and details a comprehensive testing workflow—including preparation, test case design, environment setup, execution, reporting, and issue tracking—to help developers ensure stable, consistent gameplay across diverse platforms.

UI adaptationcompatibility testinggame development
0 likes · 21 min read
Game Compatibility Testing: Concepts, Common Issues, and Process
Cognitive Technology Team
Cognitive Technology Team
May 16, 2024 · Operations

Guide to Building Stability in Distributed Systems

This guide presents comprehensive principles, best practices, and techniques for designing, deploying, and maintaining stable distributed systems, covering fault tolerance, monitoring, capacity planning, incident response, and operational reliability to help engineers achieve high availability.

Reliability Engineeringdistributed systemsfault tolerance
0 likes · 1 min read
Guide to Building Stability in Distributed Systems
IT Services Circle
IT Services Circle
Apr 30, 2024 · Fundamentals

Intel i9-13900K and i9-14900K Stability Issues Linked to Motherboard BIOS Settings

Since 2023 users have reported that Intel's flagship desktop CPUs i9-13900K and i9-14900K suffer stability problems, especially during gaming, which Intel attributes to non‑standard BIOS configurations on 600/700 series motherboards that disable critical thermal and power protections.

BIOSCPUIntel
0 likes · 4 min read
Intel i9-13900K and i9-14900K Stability Issues Linked to Motherboard BIOS Settings
Efficient Ops
Efficient Ops
Apr 14, 2024 · Operations

How to Ensure System Stability and High Availability: An SRE Playbook

This article explains the definitions of stability and high availability, distinguishes their relationship, outlines key performance indicators, and provides a comprehensive framework—including fault prevention, detection, and recovery, as well as design, coding, testing, monitoring, and emergency response practices—to help teams build reliable, highly available systems.

High AvailabilitySREcapacity planning
0 likes · 10 min read
How to Ensure System Stability and High Availability: An SRE Playbook
Wukong Talks Architecture
Wukong Talks Architecture
Apr 4, 2024 · Operations

Cloud Stability Governance: Frontend and Backend Strategies, Deployment, and Monitoring Practices

This article presents a comprehensive view of cloud stability governance from both front‑end and back‑end perspectives, detailing system architecture, micro‑frontend integration, CI/CD deployment pipelines, SLB forwarding and health‑check configurations, monitoring dashboards, UI automation testing, and the resulting operational improvements.

CI/CDDeploymentSLB
0 likes · 13 min read
Cloud Stability Governance: Frontend and Backend Strategies, Deployment, and Monitoring Practices
DataFunTalk
DataFunTalk
Mar 20, 2024 · Artificial Intelligence

Challenges and Optimization Techniques for Large Language Model Training

The article outlines the resource and efficiency challenges of scaling large language models, explains data and model parallelism strategies, and details practical I/O, communication, and stability optimizations—including high‑availability storage, RDMA networking, and fault‑tolerance measures—to improve training throughput and reliability.

AI EngineeringI/O optimizationcommunication optimization
0 likes · 13 min read
Challenges and Optimization Techniques for Large Language Model Training