Tagged articles
4063 articles
Page 20 of 41
Tencent Cloud Developer
Tencent Cloud Developer
Dec 7, 2022 · Cloud Native

Kubernetes Architecture Analysis: Design Patterns, Principles and Implementation

The article examines Kubernetes architecture from a software‑design viewpoint, showing how its declarative API and extensible ecosystem outpace Swarm and Mesos, and detailing core concepts, control‑plane components, identified design patterns such as microkernel, event‑driven and CQRS, key architectural decisions, and the resulting strengths and trade‑offs.

Control PlaneEvent-drivenK8s Architecture
0 likes · 13 min read
Kubernetes Architecture Analysis: Design Patterns, Principles and Implementation
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Dec 7, 2022 · Cloud Native

How to Scale Kubernetes to 5,000 Nodes: Master, API Server, and Component Tuning

This guide explains how to push a Kubernetes cluster toward its theoretical limit of 5,000 nodes by detailing official limits, master node sizing for GCE and AWS, kube‑apiserver high‑availability and connection‑count tuning, scheduler and controller‑manager leader election settings, kubelet optimizations, and DNS anti‑affinity configuration.

OperationsPerformance tuningcloud-native
0 likes · 6 min read
How to Scale Kubernetes to 5,000 Nodes: Master, API Server, and Component Tuning
Architect's Guide
Architect's Guide
Dec 5, 2022 · Cloud Native

Step-by-Step Guide to Deploying a High‑Availability Kubernetes Cluster with NFS, Ingress, Dashboard, and Harbor

This comprehensive tutorial walks through preparing the operating system, installing Docker and containerd, configuring yum repositories, initializing a multi‑master HA Kubernetes cluster with IPVS, deploying the Kubernetes dashboard, setting up NFS storage, installing an Ingress controller, and finally installing Harbor with Helm and a custom NFS provisioner, providing all necessary commands and configuration files.

DockerHAHarbor
0 likes · 38 min read
Step-by-Step Guide to Deploying a High‑Availability Kubernetes Cluster with NFS, Ingress, Dashboard, and Harbor
ITPUB
ITPUB
Dec 4, 2022 · Cloud Native

How Qunar Scaled Container Monitoring with VictoriaMetrics: A Cloud‑Native Case Study

This article details Qunar's migration from a Prometheus‑based monitoring stack to VictoriaMetrics, describing the limitations they faced, the architectural redesign using vmagent, vmcluster, and vmalert, and the resulting performance improvements and operational benefits for large‑scale Kubernetes environments.

PrometheusTime Series DatabaseVictoriaMetrics
0 likes · 14 min read
How Qunar Scaled Container Monitoring with VictoriaMetrics: A Cloud‑Native Case Study
Efficient Ops
Efficient Ops
Dec 1, 2022 · Operations

Why Choose Loki Over ELK? A Hands‑On Guide to Deploying and Using Grafana Loki

This article explains the motivations for selecting Grafana Loki instead of ELK/EFK, introduces its core concepts and features, provides step‑by‑step deployment instructions for Promtail and Loki, and demonstrates how to configure Grafana, query logs, and handle label indexing, dynamic tags, and high‑cardinality challenges.

GrafanaLokiObservability
0 likes · 15 min read
Why Choose Loki Over ELK? A Hands‑On Guide to Deploying and Using Grafana Loki
Cloud Native Technology Community
Cloud Native Technology Community
Dec 1, 2022 · Cloud Native

Integrating OpenStack and Kubernetes Networks with Kube-OVN: Cluster Interconnect and Shared OVN Modes

This guide explains how to use Kube-OVN to connect OpenStack virtual machines and Kubernetes containers by configuring cluster interconnect or shared OVN modes, covering prerequisites, OVN‑IC database deployment, Kubernetes and OpenStack side settings, and example manifests for creating Pods in OpenStack subnets.

Kube-OVNOVNOpenStack
0 likes · 11 min read
Integrating OpenStack and Kubernetes Networks with Kube-OVN: Cluster Interconnect and Shared OVN Modes
Huolala Tech
Huolala Tech
Dec 1, 2022 · Cloud Computing

How to Master Spot Instances for Cost‑Effective Cloud Scaling

This article explains what Spot (preemptible) instances are, compares them with on‑demand and reserved instances, details AWS Spot pricing and signals, and provides practical strategies—including node‑group design, Kubernetes scheduling, health checks, and rollback plans—to reliably reduce cloud costs while maintaining application availability.

Cloud Computingawscost optimization
0 likes · 22 min read
How to Master Spot Instances for Cost‑Effective Cloud Scaling
Efficient Ops
Efficient Ops
Nov 30, 2022 · Cloud Native

How kubectl top Retrieves Real‑Time Metrics in Kubernetes: A Deep Dive

This article explains how the kubectl top command gathers real‑time CPU and memory usage for nodes and pods, details the underlying data flow and metric API implementation in Kubernetes, compares heapster and metrics‑server, and addresses common troubleshooting scenarios.

HeapstercAdvisorkubectl top
0 likes · 15 min read
How kubectl top Retrieves Real‑Time Metrics in Kubernetes: A Deep Dive
Qunar Tech Salon
Qunar Tech Salon
Nov 29, 2022 · Cloud Native

Qunar’s Experience Replacing Prometheus with VictoriaMetrics for Cloud‑Native Container Monitoring

This article details Qunar’s migration from a traditional Prometheus‑based monitoring stack to VictoriaMetrics, describing the challenges of large‑scale container metrics collection, the architectural redesign using VM‑Cluster, vmagent, and vmalert, and the performance improvements achieved after full replacement.

PrometheusTime Series DatabaseVictoriaMetrics
0 likes · 14 min read
Qunar’s Experience Replacing Prometheus with VictoriaMetrics for Cloud‑Native Container Monitoring
Alibaba Cloud Native
Alibaba Cloud Native
Nov 27, 2022 · Cloud Native

How HeLian Health Scaled Its Low‑Fault Medical Platform with Cloud‑Native Microservices

HeLian Health’s journey from a fast‑iteration monolith to a stable, cloud‑native microservice architecture illustrates how modular design, Dubbo‑Nacos integration, Kubernetes, and Alibaba Cloud MSE can overcome low‑fault tolerance challenges in the medical check‑up domain while boosting release efficiency and reducing operational costs.

MSENacoscloud-native
0 likes · 10 min read
How HeLian Health Scaled Its Low‑Fault Medical Platform with Cloud‑Native Microservices
ITPUB
ITPUB
Nov 25, 2022 · Big Data

How Berserker’s Big Data Platform Solved Scheduling, State and Scaling Challenges

This article details the architecture, evolution, and technical solutions of the Berserker big‑data platform—including component design, state‑management problems, release strategies, two‑phase commit, RPC handling, routing, message queuing, containerized execution, dependency model redesign, and future roadmap—demonstrating how the system achieved high availability, low latency, and scalable operations.

Data PlatformDockerRaft
0 likes · 19 min read
How Berserker’s Big Data Platform Solved Scheduling, State and Scaling Challenges
Tencent Cloud Developer
Tencent Cloud Developer
Nov 24, 2022 · Cloud Native

Large‑Scale Cost Optimization for Kubernetes/TKE: Data Collection, Measures, and Implementation

The article details a Tencent‑led, end‑to‑end cost‑optimization project for large‑scale Kubernetes/TKE clusters that collected extensive workload metrics, applied VPA/HPA enhancements, custom scheduling and node‑downscaling via the open‑source Crane platform, ultimately delivering up to 70% CPU and 50% memory savings with zero‑fault deployments.

AutoscalingVPAhpa
0 likes · 29 min read
Large‑Scale Cost Optimization for Kubernetes/TKE: Data Collection, Measures, and Implementation
dbaplus Community
dbaplus Community
Nov 23, 2022 · Operations

Choosing the Right Kubernetes Monitoring Stack: Tools & Best Practices

Monitoring Kubernetes clusters is essential for visibility and scalability, but selecting the right tools can be complex; this article outlines best‑practice approaches and compares popular open‑source solutions such as Prometheus, Grafana, Thanos, Elasticsearch, Logstash, and Kibana, helping you build an effective monitoring stack.

GrafanaPrometheuskubernetes
0 likes · 8 min read
Choosing the Right Kubernetes Monitoring Stack: Tools & Best Practices
MaGe Linux Operations
MaGe Linux Operations
Nov 22, 2022 · Operations

Why Kubernetes Pods Fail with “Resource temporarily unavailable” – Understanding PID Limits

This article analyzes a Kubernetes‑Docker environment where Java pods encounter “fork: Resource temporarily unavailable” errors, tracing the issue through kernel event logs, ulimit settings, PID limits, and related sysctl parameters, and provides detailed recommendations for kernel and user‑level configuration to prevent such failures.

DockerLinux kernelPID limits
0 likes · 23 min read
Why Kubernetes Pods Fail with “Resource temporarily unavailable” – Understanding PID Limits
Alibaba Cloud Native
Alibaba Cloud Native
Nov 21, 2022 · Cloud Computing

What’s New in Open Cluster Management v0.9.0 and Future Roadmap?

The Open Cluster Management (OCM) project releases v0.9.0 with enhanced security, service exposure, plugin configurability, API updates, and outlines upcoming features like independent control‑plane deployment, ArgoCD pull‑mode integration, placement strategies, and edge‑scenario support.

ArgoCDCluster‑ProxyOpen Cluster Management
0 likes · 10 min read
What’s New in Open Cluster Management v0.9.0 and Future Roadmap?
Tencent Cloud Developer
Tencent Cloud Developer
Nov 21, 2022 · Operations

DSP Algorithm Principles and Application in Crane Predictive Autoscaling

The article details how Tencent Cloud’s Crane predictive autoscaling leverages a digital signal processing pipeline—transforming Prometheus time‑series data via Fourier analysis, using DFT/FFT to identify periodic patterns, predict load with IFFT, and configure margins, thresholds, and spectrum parameters to overcome HPA’s reactive limits.

Cloud Native AutoscalingCraneFast Fourier Transform
0 likes · 16 min read
DSP Algorithm Principles and Application in Crane Predictive Autoscaling
Top Architect
Top Architect
Nov 21, 2022 · Cloud Native

Understanding Underlay and Overlay Network Models in Kubernetes

This article provides a comprehensive overview of Kubernetes networking by explaining the concepts of underlay and overlay network models, describing their implementations such as flannel host‑gw, Calico BGP, IPVLAN/MACVLAN, and tunneling technologies like VxLAN and IPIP, and offering practical references for each approach.

CNICalicoFlannel
0 likes · 14 min read
Understanding Underlay and Overlay Network Models in Kubernetes
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Nov 21, 2022 · Cloud Native

Zero‑Touch Kubernetes Deployment: A Minimalist, Automated Solution

This article analyzes the challenges of traditional Kubernetes cluster installation, proposes a fully automated, zero‑touch deployment framework that abstracts and packages all required resources, and explores its architecture, implementation using Go, cloud‑edge collaboration with KubeEdge, and future extensions toward CRD‑based operators.

CRDGocloud-native
0 likes · 22 min read
Zero‑Touch Kubernetes Deployment: A Minimalist, Automated Solution
Cloud Native Technology Community
Cloud Native Technology Community
Nov 21, 2022 · Cloud Native

How to Replace Istio Ingress Gateway with Kubernetes Gateway API – Step‑by‑Step Guide

This tutorial explains the background of the Kubernetes Gateway API, compares it with the traditional Ingress API, and provides a hands‑on walkthrough for installing the Gateway API CRDs, configuring Istio to use a Gateway resource, exposing a sample httpbin service, updating routes, and cleaning up the environment.

Gateway APIIstioService Mesh
0 likes · 12 min read
How to Replace Istio Ingress Gateway with Kubernetes Gateway API – Step‑by‑Step Guide
Dada Group Technology
Dada Group Technology
Nov 18, 2022 · Artificial Intelligence

JD Daojia Machine Learning Platform: Architecture and Implementation

This article introduces JD Daojia's machine learning platform, detailing its architecture, implementation principles, and practical applications in various business scenarios, achieving significant improvements in recommendation and search systems.

Deep LearningFeature EngineeringMachine Learning Platform
0 likes · 28 min read
JD Daojia Machine Learning Platform: Architecture and Implementation
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Nov 18, 2022 · Cloud Native

Top 10 Common Kubernetes Issues and How to Fix Them

This guide walks through ten frequent Kubernetes problems—from service access failures and port‑mapping errors to image pull issues and CrashLoopBackOff—explaining root causes, showing exact error messages, and providing concise command‑line solutions to restore cluster health.

DevOpsTroubleshootingcloud-native
0 likes · 6 min read
Top 10 Common Kubernetes Issues and How to Fix Them
Open Source Linux
Open Source Linux
Nov 18, 2022 · Cloud Native

Understanding Kubernetes Underlay and Overlay Network Models: A Deep Dive

This article provides a comprehensive overview of Kubernetes networking, explaining the concepts of underlay and overlay network models, their implementations with Flannel, Calico, IPVLAN, MACVLAN, Multus, DANM, and common tunneling protocols such as VxLAN, IPIP, and GRE, while highlighting practical configuration notes and architectural differences.

CNICalicoFlannel
0 likes · 14 min read
Understanding Kubernetes Underlay and Overlay Network Models: A Deep Dive
360 Smart Cloud
360 Smart Cloud
Nov 17, 2022 · Databases

Exploring StarRocks Applications, Performance Tests, and Cloud‑Native Integration at 360

This article reviews the practical applications and experimental explorations of StarRocks at 360, describing the cloud‑native lake‑warehouse product Yunzhou, its three‑tier architecture, performance comparisons with Trino using TPCH 100 GB, challenges of Kubernetes integration, and future directions for storage‑compute separation.

Big DataData WarehouseOLAP
0 likes · 7 min read
Exploring StarRocks Applications, Performance Tests, and Cloud‑Native Integration at 360
Tencent Cloud Developer
Tencent Cloud Developer
Nov 16, 2022 · Cloud Native

Prometheus Monitoring Practices for Tencent Happy Dou Dizhu Game

Tencent transformed its popular Happy Dou Dizhu game’s monitoring by migrating to Tencent Cloud Managed Prometheus and Grafana, unifying metric naming, consolidating ServiceMonitors, defining dashboards as code, and avoiding high‑cardinality labels, which cut labor costs by over 30% and greatly improved operational efficiency.

GrafanaPrometheusTencent Cloud
0 likes · 11 min read
Prometheus Monitoring Practices for Tencent Happy Dou Dizhu Game
Cloud Native Technology Community
Cloud Native Technology Community
Nov 16, 2022 · Cloud Native

How Do Packets Travel Inside and Outside Kubernetes? A Deep Dive into Pods, Network Namespaces, and CNI

This article explains how Kubernetes forwards packets from the initial web request through container networking, covering the network model, pod creation steps, the role of the pause container, same‑node and cross‑node pod‑to‑pod traffic, service IP translation, and the underlying CNI, iptables, and conntrack mechanisms.

CNINetwork NamespacePod Communication
0 likes · 28 min read
How Do Packets Travel Inside and Outside Kubernetes? A Deep Dive into Pods, Network Namespaces, and CNI
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Nov 16, 2022 · Cloud Native

SREWorks v1.3 Unveiled: Plugin Architecture, UI Enhancements & Cloud‑Native Ops

Version 1.3 of SREWorks introduces a modular plugin system with independent plugin packages, visual configuration UI, standardized component definition workflow, upgraded application mechanisms separating enterprise and ops apps, offline app packages, remote UMD component loading, new front‑end widgets, and a Grafana‑based streaming job monitoring dashboard.

GrafanaSREWorkscloud-native operations
0 likes · 12 min read
SREWorks v1.3 Unveiled: Plugin Architecture, UI Enhancements & Cloud‑Native Ops
DevOps Cloud Academy
DevOps Cloud Academy
Nov 13, 2022 · Cloud Native

Grafana Phlare: Open‑Source Continuous Profiling Database – Architecture, Features, and Kubernetes Deployment Guide

Grafana Phlare is an open‑source, horizontally scalable continuous profiling database that integrates with Grafana, offering easy installation, multi‑tenant support, and object‑storage‑backed long‑term storage, with detailed deployment instructions for both monolithic and micro‑service modes on Kubernetes using Helm.

Continuous ProfilingGrafanaObservability
0 likes · 11 min read
Grafana Phlare: Open‑Source Continuous Profiling Database – Architecture, Features, and Kubernetes Deployment Guide
MaGe Linux Operations
MaGe Linux Operations
Nov 13, 2022 · Cloud Native

Mastering Kubernetes CNI: Choose the Right Network Plugin for Your Cloud‑Native Apps

This article introduces common container network scenarios and explains how various Kubernetes CNI plugins—Kube‑OVN, Antrea, Cilium, Calico, Flannel, Submariner, and others—implement these functions, guiding users on selecting and deploying the appropriate plugin with tools like Kubekey for multi‑cluster and policy needs.

CNICloudNativeKubekey
0 likes · 20 min read
Mastering Kubernetes CNI: Choose the Right Network Plugin for Your Cloud‑Native Apps
Tencent Cloud Middleware
Tencent Cloud Middleware
Nov 10, 2022 · Cloud Native

How We Scaled Apache Pulsar on Kubernetes for WeChat’s Billion‑User Real‑Time Recommendations

This article details the WeChat engineering team’s practical experience deploying and optimizing Apache Pulsar on Kubernetes for massive real‑time recommendation workloads, covering cloud‑native advantages, non‑persistent topics, load‑balancing tweaks, broker cache improvements, COS offloader development, and future roadmap.

Apache Pulsarcloud-nativekubernetes
0 likes · 13 min read
How We Scaled Apache Pulsar on Kubernetes for WeChat’s Billion‑User Real‑Time Recommendations
Alibaba Cloud Native
Alibaba Cloud Native
Nov 9, 2022 · Cloud Native

13 Common Kubernetes Pod Failures and How to Diagnose Them

This article outlines the Kubernetes pod lifecycle, describes the five pod phases, enumerates 13 typical failure scenarios—including scheduling, image pull, dependency, init container, probe, and OOM issues—provides error states, root causes, and step‑by‑step kubectl commands for diagnosis and remediation.

OperationsPodTroubleshooting
0 likes · 22 min read
13 Common Kubernetes Pod Failures and How to Diagnose Them
Cloud Native Technology Community
Cloud Native Technology Community
Nov 9, 2022 · Cloud Native

How Sidecars Are Revolutionizing Load Testing in Kubernetes

The article explains why traditional load testing struggles with modern, scalable infrastructures and shows how integrating sidecar containers into Kubernetes can simplify traffic capture, enrich replay data, reduce operational overhead, and enable more realistic, automated load‑testing pipelines.

DevOpsTraffic Capturecloud-native
0 likes · 8 min read
How Sidecars Are Revolutionizing Load Testing in Kubernetes
Ops Development Stories
Ops Development Stories
Nov 9, 2022 · Operations

How Grafana Phlare Enables Scalable Continuous Profiling for Cloud‑Native Environments

Grafana Phlare is an open‑source, horizontally scalable continuous profiling database that integrates with Grafana, offering easy installation, multi‑tenant support, cheap object‑storage persistence, and both monolithic and microservice deployment modes, with detailed Helm‑based Kubernetes setup and usage instructions.

Continuous Profilingcloud-nativehelm
0 likes · 14 min read
How Grafana Phlare Enables Scalable Continuous Profiling for Cloud‑Native Environments
Open Source Linux
Open Source Linux
Nov 7, 2022 · Cloud Native

Unlock Scalable Cloud‑Native Alerting with Grafana Mimir: Architecture & Setup

This article explains the current state of cloud‑native alerting, introduces Grafana Mimir as a horizontally scalable, multi‑tenant storage for Prometheus, details its architecture and components, and provides step‑by‑step guidance for installing, configuring, and operating Mimir in Kubernetes environments.

MimirObservabilityPrometheus
0 likes · 24 min read
Unlock Scalable Cloud‑Native Alerting with Grafana Mimir: Architecture & Setup
Cloud Native Technology Community
Cloud Native Technology Community
Nov 7, 2022 · Cloud Computing

How Edge Computing Is Transforming Automotive Manufacturing

This article explores how edge computing, combined with cloud-native technologies, 5G, and GPU acceleration, enables real‑time data processing, intelligent inspection, digital twins, and autonomous driving in the automotive industry, outlining practical architectures, hardware choices, and deployment patterns.

5GAutomotiveGPU Acceleration
0 likes · 19 min read
How Edge Computing Is Transforming Automotive Manufacturing
Programmer DD
Programmer DD
Nov 7, 2022 · Cloud Native

Kubernetes Multi‑Tenant Isolation: Strategies, Tools, and How to Choose

This article explains how Kubernetes can be configured for multi‑tenant isolation using control‑plane mechanisms like namespaces, RBAC, and quotas, data‑plane techniques such as container runtimes, storage policies, and network policies, and compares namespace‑based tenancy with virtual control‑plane and multi‑cluster approaches.

Multi‑TenantVirtualizationcloud-native
0 likes · 11 min read
Kubernetes Multi‑Tenant Isolation: Strategies, Tools, and How to Choose
Top Architect
Top Architect
Nov 7, 2022 · Cloud Native

Step‑by‑Step Deployment of a Highly Available Kubernetes Cluster with Nginx/Keepalived Load Balancer, Flannel CNI, IPVS, Dashboard, and Harbor Registry

This comprehensive guide walks you through installing Docker and containerd, configuring yum repositories, setting up kubeadm/kubelet/kubectl, initializing a multi‑master Kubernetes cluster, enabling Flannel CNI and IPVS, building a Nginx‑Keepalived HA load balancer, deploying the Kubernetes dashboard, configuring NFS storage with a dynamic provisioner, and installing a secure Harbor image registry for private images.

FlannelHarborNFS
0 likes · 44 min read
Step‑by‑Step Deployment of a Highly Available Kubernetes Cluster with Nginx/Keepalived Load Balancer, Flannel CNI, IPVS, Dashboard, and Harbor Registry
MaGe Linux Operations
MaGe Linux Operations
Nov 6, 2022 · Cloud Native

How to Safely Shut Down and Restart a Kubernetes Cluster

This guide walks you through the essential steps, commands, and precautions for safely draining nodes, backing up applications, CRDs, and etcd, then shutting down and later restarting a Kubernetes cluster while avoiding common pitfalls.

Cluster MaintenanceOperationsbackup
0 likes · 6 min read
How to Safely Shut Down and Restart a Kubernetes Cluster
Alibaba Cloud Native
Alibaba Cloud Native
Nov 4, 2022 · Cloud Native

How Koordinator Revolutionizes Cloud‑Native Mixed‑Workload Scheduling

Koordinator, an open‑source cloud‑native scheduler launched in April 2022, unifies heterogeneous workloads on Kubernetes through zero‑intrusion plugins, fine‑grained resource oversubscription, QoS‑aware scheduling, and a flexible descheduler framework, dramatically improving resource utilization and latency‑sensitive service performance.

Mixed WorkloadSchedulercloud-native
0 likes · 22 min read
How Koordinator Revolutionizes Cloud‑Native Mixed‑Workload Scheduling
Top Architect
Top Architect
Nov 4, 2022 · Cloud Native

Step-by-Step Guide to Deploy a High‑Availability Kubernetes Cluster with Dashboard, Nginx/Keepalived, NFS, Harbor, and Ingress

This comprehensive tutorial walks through preparing hosts, installing Docker and containerd, setting up Kubernetes components, initializing a HA master cluster, configuring networking, deploying the Kubernetes dashboard, NFS storage, Harbor registry, and an Nginx/Keepalived load balancer, all with detailed commands and configuration files.

DockerHarborNFS
0 likes · 41 min read
Step-by-Step Guide to Deploy a High‑Availability Kubernetes Cluster with Dashboard, Nginx/Keepalived, NFS, Harbor, and Ingress
Architecture Digest
Architecture Digest
Nov 4, 2022 · Cloud Native

Containerization Journey at Ximalaya: Practices, Tools, and Lessons Learned

This article recounts Ximalaya’s evolution from early Docker adoption to a mature cloud‑native deployment platform, detailing principles, custom tools such as barge and k8s‑sync, health‑check strategies, multi‑process management, and integration with existing middleware to achieve reliable, zero‑downtime service releases.

ContainerizationDevOpscloud-native
0 likes · 18 min read
Containerization Journey at Ximalaya: Practices, Tools, and Lessons Learned
Open Source Linux
Open Source Linux
Nov 3, 2022 · Cloud Native

Why kt-connect Beats Telepresence for Kubernetes Local Debugging

This article traces the evolution of local debugging from simple Tomcat setups to Kubernetes, compares proxy tools like port‑forwarding, VPN, Telepresence and kt‑connect, and explains kt‑connect's four operation modes with practical command examples for developers.

DockerTelepresencekt-connect
0 likes · 13 min read
Why kt-connect Beats Telepresence for Kubernetes Local Debugging
Architect
Architect
Nov 3, 2022 · Cloud Native

Step-by-Step Guide to Deploying a High‑Availability Kubernetes Cluster with Dashboard and Harbor

This comprehensive tutorial walks through preparing multiple Linux nodes, installing Docker and containerd, setting up kubeadm, kubelet and kubectl, initializing a HA Kubernetes control plane, configuring Flannel networking, deploying the Kubernetes dashboard, installing Nginx + Keepalived for load balancing, setting up NFS with rsync, provisioning storage via Helm, and finally installing a secure Harbor image registry, all with detailed commands and configuration snippets.

HAHarborNFS
0 likes · 39 min read
Step-by-Step Guide to Deploying a High‑Availability Kubernetes Cluster with Dashboard and Harbor
DataFunSummit
DataFunSummit
Nov 3, 2022 · Artificial Intelligence

Applying NVIDIA MPS to Boost GPU Utilization for Recommendation Inference

This article explains why traditional CPU inference and naïve GPU usage are inefficient for recommendation workloads, introduces NVIDIA Multi‑Process Service (MPS) technology, describes VIVO's custom Rust‑based inference engine and deployment strategies, and presents performance and cost benefits along with practical deployment considerations.

GPU inferenceMPSRecommendation Systems
0 likes · 13 min read
Applying NVIDIA MPS to Boost GPU Utilization for Recommendation Inference
Efficient Ops
Efficient Ops
Nov 2, 2022 · Cloud Native

Why Your HPA Isn’t Scaling: 3 Common Misconceptions and How to Fix Them

This article explains three frequent misunderstandings about Kubernetes Horizontal Pod Autoscaler—dead zones, misuse of utilization calculations, and perceived lag in scaling—while detailing HPA’s inner workings, metric sources, calculation methods, and behavior configuration to help you avoid scaling pitfalls.

Autoscalinghpakubernetes
0 likes · 12 min read
Why Your HPA Isn’t Scaling: 3 Common Misconceptions and How to Fix Them
DevOps Cloud Academy
DevOps Cloud Academy
Nov 2, 2022 · Cloud Native

An Overview of CI/CD Pipelines with Kubernetes

This article examines CI/CD approaches within a Kubernetes ecosystem, detailing pull- and push-based pipeline architectures, their advantages and disadvantages, key considerations such as automated testing, validation and rollbacks, and reviews popular open-source tools like Spinnaker, Jenkins X and Argo CD.

DevOpsautomationci/cd
0 likes · 19 min read
An Overview of CI/CD Pipelines with Kubernetes
AI Cyberspace
AI Cyberspace
Nov 1, 2022 · Cloud Native

10 Essential Cloud‑Native Tools Every Agile Team Should Use

This article outlines ten indispensable cloud‑native tools—from Docker and Kubernetes to Serverless, Helm, Ansible, and Wireshark—explaining how each supports modern agile development, improves infrastructure stability, and accelerates digital transformation in post‑pandemic enterprises.

DevOpsDockerServerless
0 likes · 16 min read
10 Essential Cloud‑Native Tools Every Agile Team Should Use
Cloud Native Technology Community
Cloud Native Technology Community
Nov 1, 2022 · Cloud Native

Essential kubectl Commands and Concepts for Managing Kubernetes Clusters

This article introduces the role of the Kubernetes API server, explains kubectl syntax and concepts, and provides detailed explanations of eight essential kubectl commands—including listing resources, describing objects, creating and modifying resources, deleting objects, handling PersistentVolumes, configuring security contexts, managing deployments, and executing commands inside pods—helping readers master everyday Kubernetes operations.

CLIcloud-nativecontainer orchestration
0 likes · 11 min read
Essential kubectl Commands and Concepts for Managing Kubernetes Clusters
Architect
Architect
Oct 31, 2022 · Cloud Native

Building a Cloud‑Native Container Platform at Ximalaya: Practices, Principles, and Evolution

This article recounts Ximalaya's journey from a simple Docker‑based project template to a full‑featured Kubernetes‑driven container platform, describing the principles, tools, health‑check strategies, deployment patterns, middleware integration, and lessons learned for building reliable cloud‑native services.

ContainerizationDevOpscloud-native
0 likes · 13 min read
Building a Cloud‑Native Container Platform at Ximalaya: Practices, Principles, and Evolution
DevOps Cloud Academy
DevOps Cloud Academy
Oct 31, 2022 · Operations

Rolling Deployment Strategy: Advantages, Disadvantages, and Considerations

The rolling deployment strategy incrementally replaces old application instances with new ones, allowing users to encounter both versions during rollout, and is praised for ease of implementation, low risk, and default support in platforms like Kubernetes, though it can be slow, costly for large infrastructures, and may affect user experience.

Deployment StrategyOperationskubernetes
0 likes · 2 min read
Rolling Deployment Strategy: Advantages, Disadvantages, and Considerations
Open Source Linux
Open Source Linux
Oct 30, 2022 · Operations

Unlock Kubernetes Insights: Master Event Types, Monitoring, and Alerting

This guide explains what Kubernetes events are, how to list and filter them, categorizes common event types, and shows practical ways to collect, store, and alert on events using native commands and open‑source tools, helping teams reduce alert fatigue and improve cluster observability.

EventsOperationsalerting
0 likes · 11 min read
Unlock Kubernetes Insights: Master Event Types, Monitoring, and Alerting
DataFunSummit
DataFunSummit
Oct 30, 2022 · Big Data

Integrating Apache Spark with Cloud‑Native Technologies: Principles, Kubernetes Deployments, EMR on ACK, and Serverless Spark on DLF

This article examines the challenges of traditional Spark clusters and explains how integrating Spark with cloud‑native platforms—through Kubernetes deployment modes, EMR on ACK practices, Remote Shuffle Service, and serverless Spark on DLF—provides elastic scaling, lower operational costs, and advanced features such as executor rolling and custom scheduler support.

Big DataDLFServerless
0 likes · 18 min read
Integrating Apache Spark with Cloud‑Native Technologies: Principles, Kubernetes Deployments, EMR on ACK, and Serverless Spark on DLF
Java Architect Essentials
Java Architect Essentials
Oct 25, 2022 · Cloud Native

Comprehensive Guide to Deploying a Highly Available Kubernetes Cluster with Nginx+Keepalived, Ingress, NFS, and Harbor

This step‑by‑step tutorial walks through setting up a multi‑node Kubernetes cluster, configuring high availability with Nginx and Keepalived, installing Docker and containerd, deploying an NFS provisioner, configuring Ingress, and installing a secure Harbor registry using Helm and custom TLS certificates.

DockerHarborNFS
0 likes · 39 min read
Comprehensive Guide to Deploying a Highly Available Kubernetes Cluster with Nginx+Keepalived, Ingress, NFS, and Harbor
Efficient Ops
Efficient Ops
Oct 25, 2022 · Cloud Native

How Guangdong Mobile Built a Resilient Container Cloud from Scratch

This article details Guangdong Mobile's end‑to‑end journey of designing, constructing, and operating a production‑grade container cloud platform, covering architecture decisions, monitoring, logging, high‑availability, scaling, network optimization, upgrade challenges, and lessons learned for cloud‑native practitioners.

DevOpscloud-nativecontainer cloud
0 likes · 26 min read
How Guangdong Mobile Built a Resilient Container Cloud from Scratch
Cloud Native Technology Community
Cloud Native Technology Community
Oct 25, 2022 · Cloud Native

Harbor: Cloud‑Native Docker Registry Authentication, Installation, and Image Push Guide

This article introduces Harbor, a CNCF‑hosted cloud‑native Docker registry, explains its token‑based authentication flow, details the required Kubernetes and Helm prerequisites, provides step‑by‑step installation and configuration instructions, and demonstrates how to push and pull images using containerd and Kubernetes.

AuthenticationDocker RegistryHarbor
0 likes · 17 min read
Harbor: Cloud‑Native Docker Registry Authentication, Installation, and Image Push Guide
ByteDance Cloud Native
ByteDance Cloud Native
Oct 24, 2022 · Cloud Native

How KubeGateway Solves kube‑apiserver Load‑Balancing and Traffic Governance

KubeGateway, a custom seven‑layer gateway built by ByteDance, eliminates kube‑apiserver load‑imbalance and adds comprehensive request governance—including routing, rate‑limiting, and degradation—by parsing HTTP2 traffic, supporting flexible policies, and transparently proxying requests without client changes.

Authenticationgatewaykubernetes
0 likes · 14 min read
How KubeGateway Solves kube‑apiserver Load‑Balancing and Traffic Governance
Top Architect
Top Architect
Oct 24, 2022 · Backend Development

Evolution of the Tianyi Account Gateway System: From Zuul‑Based 1.0 to Kong‑Based 3.0

The article chronicles the architectural evolution of China Telecom's Tianyi Account gateway—from its 2017 Zuul‑based 1.0 implementation, through a 2.0 redesign using Kong with custom plugins, to the current 3.0 version featuring CP/DP separation, multi‑language plugins, and cloud‑native deployment, highlighting performance gains, scalability, and operational improvements.

KongZuulapi-gateway
0 likes · 14 min read
Evolution of the Tianyi Account Gateway System: From Zuul‑Based 1.0 to Kong‑Based 3.0
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Oct 24, 2022 · Cloud Native

GitOps on Kubernetes: Secure, Scalable Continuous Deployment Best Practices

GitOps, a cloud‑native continuous deployment approach that uses Git as the single source of truth, enables secure, auditable, and multi‑tenant Kubernetes management by automating state reconciliation, supporting multi‑cluster and multi‑environment deployments, and integrating tools like Argo CD, Flux, Helm, and Kustomize.

Argo CDContinuous DeploymentDevOps
0 likes · 12 min read
GitOps on Kubernetes: Secure, Scalable Continuous Deployment Best Practices
MaGe Linux Operations
MaGe Linux Operations
Oct 22, 2022 · Cloud Native

How Do Packets Flow Inside and Outside Kubernetes? A Deep Dive into Pod Networking

This article explains how Kubernetes forwards packets from the initial web request through container networking, covering the Kubernetes network model, Linux network namespaces, the role of the pause container, pod‑to‑pod communication on the same and different nodes, CNI plugins, and how services use Netfilter and iptables to rewrite traffic.

CNIPod Networkingiptables
0 likes · 25 min read
How Do Packets Flow Inside and Outside Kubernetes? A Deep Dive into Pod Networking
21CTO
21CTO
Oct 21, 2022 · Cloud Native

How Ford is Driving Cloud‑Native Transformation: Tools, Culture, and Lessons Learned

Ford’s tech leaders Beckie Riss and Satish Puranam discuss how the automaker’s six‑year cloud‑native journey—spanning Kubernetes, Tekton, Knative, and a culture of curiosity—has reshaped development, accelerated delivery, and tackled the challenges of upskilling engineers and simplifying platform adoption.

DevOpsDigital Transformationcloud-native
0 likes · 9 min read
How Ford is Driving Cloud‑Native Transformation: Tools, Culture, and Lessons Learned
Programmer DD
Programmer DD
Oct 21, 2022 · Cloud Native

How Grafana Mimir Transforms Cloud‑Native Monitoring and Alerting

This article explains how Grafana Mimir provides a scalable, highly‑available, multi‑tenant long‑term storage for Prometheus, details its architecture and core components such as compactor, distributor, ingester, querier, query‑frontend and store‑gateway, and shows step‑by‑step installation, status checking, and Alertmanager configuration for cloud‑native environments.

AlertmanagerCloud Native MonitoringGrafana Mimir
0 likes · 22 min read
How Grafana Mimir Transforms Cloud‑Native Monitoring and Alerting
Bilibili Tech
Bilibili Tech
Oct 21, 2022 · Big Data

Kyuubi at Bilibili: Architecture, Enhancements, and Production Practices for Large‑Scale Data Processing

Bilibili adopted the open‑source Kyuubi proxy to replace its unstable STS layer, enabling multi‑tenant, multi‑engine (Spark, Presto, Flink) SQL/Scala processing with Hive Thrift compatibility, fine‑grained queue isolation, UI monitoring, stability safeguards, and Kubernetes/YARN deployment, while planning further cloud‑native extensions.

Big DataKyuubiSpark
0 likes · 20 min read
Kyuubi at Bilibili: Architecture, Enhancements, and Production Practices for Large‑Scale Data Processing
Hulu Beijing
Hulu Beijing
Oct 21, 2022 · Big Data

How Hulu Scales Spark on Kubernetes: Cloud‑Native Big Data at Disney‑Scale

Hulu’s data platform team describes how they migrated large‑scale Spark workloads from Yarn to native Spark on Kubernetes, leveraging AWS services such as EKS, S3, and custom operators to achieve dynamic scaling, unified monitoring, cost‑effective resource management, and improved stability for search, recommendation, and advertising pipelines.

Big DataData engineeringSpark
0 likes · 18 min read
How Hulu Scales Spark on Kubernetes: Cloud‑Native Big Data at Disney‑Scale
Alibaba Cloud Developer
Alibaba Cloud Developer
Oct 20, 2022 · Cloud Native

Why Kubernetes Remains Complex and How Serverless Designs Aim to Simplify It

The article examines the inherent and accidental complexities of Kubernetes as a distributed cluster manager, discusses challenges in resource scheduling, infrastructure diversity, and operational overhead, and explores how cloud‑native solutions such as managed services, nodeless and serverless Kubernetes architectures attempt to reduce these complexities while introducing new trade‑offs.

OperationsServerlesscloud-native
0 likes · 18 min read
Why Kubernetes Remains Complex and How Serverless Designs Aim to Simplify It
DevOps Cloud Academy
DevOps Cloud Academy
Oct 20, 2022 · Cloud Native

Comparative Analysis of KubeSphere and Rainbond Cloud‑Native Application Platforms

This article provides a detailed comparative analysis of the cloud‑native application platforms KubeSphere and Rainbond, evaluating their product positioning, community activity, installation experience, application deployment, micro‑service architecture, marketplace features, multi‑cluster management, and operational capabilities to help readers choose the most suitable solution for their needs.

KubeSphereRainbondcloud-native
0 likes · 17 min read
Comparative Analysis of KubeSphere and Rainbond Cloud‑Native Application Platforms
Huolala Tech
Huolala Tech
Oct 20, 2022 · Cloud Native

How Huolala Cuts Cloud Costs with Kubernetes: Spot Instances, Smart Autoscaling, and Predictive Scaling

This presentation details Huolala's end‑to‑end cloud‑native cost‑optimization strategy, covering the company's infrastructure basics, Kubernetes‑based server cost‑saving techniques, a tailored optimization roadmap, practical Spot Instance usage, and a custom CronHPA‑driven scheduled scaling solution to boost resource utilization.

Autoscalingcloud-nativecost optimization
0 likes · 23 min read
How Huolala Cuts Cloud Costs with Kubernetes: Spot Instances, Smart Autoscaling, and Predictive Scaling
Efficient Ops
Efficient Ops
Oct 19, 2022 · Big Data

Master Prometheus Monitoring for Big Data on Kubernetes: Design & Alerting

This article explains how to design and implement a Prometheus‑based monitoring system for big‑data components running on Kubernetes, covering metric exposure methods, scrape configurations, exporter deployment, and dynamic alert rule management with Alertmanager.

Alert RulesAlertmanagerBig Data Monitoring
0 likes · 17 min read
Master Prometheus Monitoring for Big Data on Kubernetes: Design & Alerting
Xiaolei Talks DB
Xiaolei Talks DB
Oct 19, 2022 · Databases

How to Deploy TiDB on Private or Public Cloud with TiDB Operator

This article explains how TiDB, a cloud‑native distributed database, can be deployed on private or public clouds using the TiDB Operator, covering Kubernetes basics, operator concepts, TiDB Operator features, architecture, and upcoming deployment guides.

Database DeploymentOperatorTiDB
0 likes · 10 min read
How to Deploy TiDB on Private or Public Cloud with TiDB Operator
Software Development Quality
Software Development Quality
Oct 19, 2022 · Cloud Native

Unlocking Developer Speed: Why Internal Development Platforms Matter

Internal Development Platforms (IDPs) unify existing tools and workflows to provide developers with self‑service capabilities, reduce cognitive load, and standardize deployments, while enabling ops teams to automate repetitive tasks, improve productivity, and deliver measurable gains in MTTR, change‑failure rate, deployment frequency, and lead time.

Developer ExperienceInternal Development Platformkubernetes
0 likes · 11 min read
Unlocking Developer Speed: Why Internal Development Platforms Matter
Cloud Native Technology Community
Cloud Native Technology Community
Oct 18, 2022 · Cloud Native

The Benefits of Containerization and Its Role in Modern DevOps

This article explains what containers are, outlines their advantages such as scalability, portability and DevOps integration, describes popular container images and tools like Docker and Kubernetes, and concludes with a summary of why containerization is essential for agile cloud‑native development.

ContainerizationDevOpsDocker
0 likes · 10 min read
The Benefits of Containerization and Its Role in Modern DevOps
Cloud Native Technology Community
Cloud Native Technology Community
Oct 17, 2022 · Cloud Native

A Three‑Step Approach to Understanding, Managing, and Preventing Kubernetes Failures

This article presents a practical three‑step methodology—understanding, managing, and preventing—to troubleshoot Kubernetes deployments, explains how to leverage monitoring, observability, and incident‑response tools, and offers guidance on fostering team collaboration and building resilient, self‑healing cloud‑native systems.

ObservabilityOperationsTroubleshooting
0 likes · 7 min read
A Three‑Step Approach to Understanding, Managing, and Preventing Kubernetes Failures
Open Source Linux
Open Source Linux
Oct 16, 2022 · Cloud Native

Mastering nsenter: Debug Container Networks and Linux Namespaces

This article explains how the nsenter command from util‑linux lets you enter a container's network or other namespaces to run host tools for debugging, details its syntax, provides practical examples with Docker and Kubernetes, and covers the underlying concepts of Linux namespaces, clone, and setns.

Linux Namespacesclonecontainer debugging
0 likes · 9 min read
Mastering nsenter: Debug Container Networks and Linux Namespaces