Tagged articles
4063 articles
Page 23 of 41
Alibaba Cloud Native
Alibaba Cloud Native
Jun 24, 2022 · Cloud Native

How Service Mesh Discovers Services: ASM Config Push & Selective Discovery

This article explains the service discovery and configuration push mechanisms of Alibaba Cloud's managed Istio‑compatible service mesh ASM, demonstrates practical experiments with Sidecar clusters, shows how VirtualService rewrites work across mesh‑in and mesh‑out services, and provides step‑by‑step guidance to enable selective service discovery for performance optimization.

ASMIstioconfiguration push
0 likes · 16 min read
How Service Mesh Discovers Services: ASM Config Push & Selective Discovery
360 Quality & Efficiency
360 Quality & Efficiency
Jun 24, 2022 · Cloud Native

Using Sonobuoy for Kubernetes Conformance Testing: Installation, Execution, and Result Analysis

This article explains how to use the Sonobuoy diagnostic tool to perform Kubernetes conformance testing, covering the definition of conformance, Sonobuoy’s features and architecture, environment setup, test execution with custom plugins, result retrieval, and analysis of failed test cases.

Cluster DiagnosticsConformance TestingSonobuoy
0 likes · 8 min read
Using Sonobuoy for Kubernetes Conformance Testing: Installation, Execution, and Result Analysis
IT Architects Alliance
IT Architects Alliance
Jun 23, 2022 · Cloud Native

Building a Spring Boot Microservice Demo with gRPC and Istio on Kubernetes

This article provides a step‑by‑step guide to creating a simple Spring Boot microservice that uses gRPC for communication, integrates with Istio as a service mesh, containerizes the applications with Docker, and deploys both server and client to a Kubernetes cluster with full configuration files and code examples.

DockerIstioSpring Boot
0 likes · 16 min read
Building a Spring Boot Microservice Demo with gRPC and Istio on Kubernetes
Efficient Ops
Efficient Ops
Jun 23, 2022 · Cloud Native

How Vivo Scales Kubernetes: Automated Multi‑Cluster Management with a Custom Operator

Vivo’s rapid migration to Kubernetes across multiple data centers required a secure, efficient, and reliable way to manage thousands of nodes, leading them to develop a custom k8s‑operator that streamlines cluster deployment, CI testing, declarative APIs, and automated repair for large‑scale cloud‑native environments.

Cluster AutomationDevOpsOperator
0 likes · 3 min read
How Vivo Scales Kubernetes: Automated Multi‑Cluster Management with a Custom Operator
Architect
Architect
Jun 22, 2022 · Cloud Native

Building a Spring Boot gRPC Microservice Demo Deployed on Istio with Kubernetes

This article walks through creating a simple Spring Boot microservice using gRPC, packaging it with Maven, containerizing it with Docker, and deploying both the server and client to a Kubernetes cluster with Istio sidecar injection, demonstrating how Istio removes service‑mesh logic from business code and enables seamless multi‑language microservice communication.

IstioService MeshSpring Boot
0 likes · 21 min read
Building a Spring Boot gRPC Microservice Demo Deployed on Istio with Kubernetes
Cloud Native Technology Community
Cloud Native Technology Community
Jun 22, 2022 · Industry Insights

How to Slash Cloud‑Native Costs: Practical Steps for Better Resource Utilization

This article analyzes the low server utilization problem in modern cloud‑native environments, presents industry survey data, and outlines a four‑step framework—including observability, optimal public‑cloud usage, elasticity sharing, and remote deployment—to help enterprises dramatically reduce cloud costs while maintaining performance.

cloud-nativecost optimizationelastic scaling
0 likes · 23 min read
How to Slash Cloud‑Native Costs: Practical Steps for Better Resource Utilization
Qunar Tech Salon
Qunar Tech Salon
Jun 22, 2022 · Operations

Design and Implementation of Multi‑Cluster HPA Metrics Collection, Analysis, and Reporting in Kubernetes

This article explains the background, benefits, and measurement criteria of Kubernetes Horizontal‑Pod‑Autoscaler (HPA), describes the creation of metric tables and SQL queries for collecting scaling events and CPU usage, and presents a Python‑based workflow that aggregates the data, stores daily reports, validates results, and sends automated email summaries.

OperationsPythonhpa
0 likes · 19 min read
Design and Implementation of Multi‑Cluster HPA Metrics Collection, Analysis, and Reporting in Kubernetes
MaGe Linux Operations
MaGe Linux Operations
Jun 21, 2022 · Cloud Native

How Multus CNI Enables Multi‑Network Pods in Kubernetes

Multus CNI extends Kubernetes by allowing Pods to attach multiple network interfaces through custom resource definitions, detailing its architecture, installation via DaemonSet, CNI configuration, and step‑by‑step examples for creating additional interfaces and verifying network status.

CNICRDMultus
0 likes · 13 min read
How Multus CNI Enables Multi‑Network Pods in Kubernetes
Open Source Linux
Open Source Linux
Jun 21, 2022 · Cloud Native

How Kubernetes Connects Pods, Services, and the Internet – A Deep Dive

This article explains Kubernetes' networking model, covering container‑to‑container, pod‑to‑pod (same‑node and cross‑node), pod‑to‑service, and external traffic handling via load balancers and Ingress, while detailing the underlying Linux namespaces, veth pairs, bridges, iptables, and IPVS mechanisms.

PodServiceingress
0 likes · 23 min read
How Kubernetes Connects Pods, Services, and the Internet – A Deep Dive
Alibaba Cloud Native
Alibaba Cloud Native
Jun 16, 2022 · Cloud Native

How Koordinator Improves Efficiency and Stability for Cloud‑Native Mixed Workloads

This article explains how Alibaba Cloud's open‑source Koordinator system tackles mixed‑workload challenges by introducing priority and QoS models, resource overcommit, load‑aware scheduling, fine‑grained CPU orchestration, and upcoming features such as GPU scheduling and resource recommendation, all illustrated with architecture diagrams and code examples.

KoordinatorQoSScheduling
0 likes · 24 min read
How Koordinator Improves Efficiency and Stability for Cloud‑Native Mixed Workloads
Open Source Linux
Open Source Linux
Jun 16, 2022 · Cloud Native

Mastering Kubernetes Control Plane: etcd, API Server, Scheduler, and Nodes

This article explains the key Kubernetes control‑plane components—including etcd, the API Server, Controller Manager, Scheduler, as well as worker‑node components like Kubelet, kube‑proxy, and the container runtime—detailing their roles, interactions, and the underlying mechanisms such as Raft consensus and admission control.

API ServerControl PlaneScheduler
0 likes · 10 min read
Mastering Kubernetes Control Plane: etcd, API Server, Scheduler, and Nodes
vivo Internet Technology
vivo Internet Technology
Jun 15, 2022 · Cloud Native

Vivo Container Cluster Monitoring Architecture and Cloud‑Native Observability Practices

Vivo’s cloud‑native monitoring solution combines high‑availability Prometheus clusters, VictoriaMetrics storage, Grafana visualization, and a custom leader‑election adapter to deduplicate data while forwarding metrics to Kafka and OLAP systems, addressing large‑scale performance, scalability, and integration challenges and paving the way for AI‑driven AIOps.

Cloud Native MonitoringObservabilityPrometheus
0 likes · 18 min read
Vivo Container Cluster Monitoring Architecture and Cloud‑Native Observability Practices
IT Architects Alliance
IT Architects Alliance
Jun 14, 2022 · Cloud Native

Design and Challenges of Multi‑Active Architecture in Hybrid Cloud Environments

This article examines the design principles, challenges, and implementation details of a multi‑active architecture for hybrid cloud environments, covering stability, cost, efficiency, network topology, container orchestration, service discovery, traffic scheduling, and data storage, and outlines practical solutions used by the Zuoyebang platform.

OperationsReliabilitycloud-native
0 likes · 13 min read
Design and Challenges of Multi‑Active Architecture in Hybrid Cloud Environments
Ops Development Stories
Ops Development Stories
Jun 14, 2022 · Cloud Native

Demystifying Kubernetes Networking: From Pods to the Internet

This comprehensive guide explains Kubernetes' networking model, covering core concepts such as network namespaces, virtual Ethernet pairs, pod-to-pod and pod-to-service communication, CIDR routing, NAT, iptables, IPVS, DNS, and how traffic flows between pods, services, and the external Internet.

CNIService Meshcloud-native
0 likes · 33 min read
Demystifying Kubernetes Networking: From Pods to the Internet
Open Source Linux
Open Source Linux
Jun 14, 2022 · Cloud Native

What Is Cloud Native? A Deep Dive into CNCF, Kubernetes, Docker, and Modern Architecture

This article explores the evolution of cloud computing into its second half, defining cloud native through CNCF's perspective, tracing its history from early Linux containers and Cgroups to Docker and Kubernetes, and detailing the core technologies—containers, service mesh, microservices, immutable infrastructure, and declarative APIs—while highlighting their practical benefits for rapid iteration, automated deployment, and efficient, scalable applications.

DockerService Meshimmutable infrastructure
0 likes · 23 min read
What Is Cloud Native? A Deep Dive into CNCF, Kubernetes, Docker, and Modern Architecture
Cloud Native Technology Community
Cloud Native Technology Community
Jun 13, 2022 · Information Security

2022 Cloud‑Native Threat Report: Key Findings on Kubernetes Attacks, Supply‑Chain Risks, and Log4j Exploits

The 2022 Cloud‑Native Threat Report reveals a rise in Kubernetes‑targeted attacks, persistent supply‑chain threats in container images, and active exploitation of the Log4j zero‑day, underscoring the urgent need for specialized security measures in modern cloud‑native environments.

Supply Chaincloud-nativekubernetes
0 likes · 4 min read
2022 Cloud‑Native Threat Report: Key Findings on Kubernetes Attacks, Supply‑Chain Risks, and Log4j Exploits
Ops Development Stories
Ops Development Stories
Jun 12, 2022 · Cloud Native

How to Deploy and Manage Applications with Zadig: A Step‑by‑Step Guide

This comprehensive guide walks you through installing, configuring, and upgrading the cloud‑native continuous delivery platform Zadig—including MySQL setup, Helm chart deployment, GitLab integration, environment and pipeline management, and optional SonarQube code scanning—providing practical commands and screenshots for end‑to‑end implementation.

DevOpsGitLabZadig
0 likes · 21 min read
How to Deploy and Manage Applications with Zadig: A Step‑by‑Step Guide
Architects' Tech Alliance
Architects' Tech Alliance
Jun 12, 2022 · Cloud Computing

Design, Challenges, and Best Practices of Multi‑Active Hybrid Cloud Architecture

This article examines the motivations, stability and cost considerations, technical challenges, and design principles of a multi‑active hybrid cloud architecture, illustrating how container orchestration, service governance, traffic scheduling, and data storage are coordinated to achieve high availability and cost efficiency across multiple cloud providers.

Stabilitycost optimizationhybrid cloud
0 likes · 14 min read
Design, Challenges, and Best Practices of Multi‑Active Hybrid Cloud Architecture
ITPUB
ITPUB
Jun 12, 2022 · Cloud Native

How Vivo Scales Multi‑Data‑Center Kubernetes with a Custom Operator

Vivo describes how it built a Kubernetes‑Operator and CI pipeline to automate large‑scale, multi‑data‑center cluster deployment, modular management, and lifecycle operations using Ansible, kubeadm, and kubevirt, improving reliability, maintainability, and scalability of its Kubernetes fleets.

AnsibleMulti-ClusterOperator
0 likes · 14 min read
How Vivo Scales Multi‑Data‑Center Kubernetes with a Custom Operator
IT Architects Alliance
IT Architects Alliance
Jun 12, 2022 · Cloud Native

Why Kubernetes’ Real Power Lies in Its API, Not Just Containers

The article explains that Kubernetes’ core value is its universal, extensible declarative API framework rather than containers, walks through API types, CRD extensions, and uses a fruit‑CRD example to show how CRUD operations map to database concepts, concluding that the API is the true engine of Kubernetes.

APICRDDeclarative
0 likes · 16 min read
Why Kubernetes’ Real Power Lies in Its API, Not Just Containers
Cloud Native Technology Community
Cloud Native Technology Community
Jun 9, 2022 · Cloud Native

Ericsson’s Cloud‑Native Journey for 5G Transformation

Ericsson leverages cloud‑native technologies—including Kubernetes, micro‑services, CI/CD pipelines, and CNCF tools—to accelerate its 5G network rollout, reduce service feedback times from weeks to hours, lower TCO, and enable rapid, reliable deployment of telecom applications across carrier networks.

5Gci/cdcloud-native
0 likes · 10 min read
Ericsson’s Cloud‑Native Journey for 5G Transformation
vivo Internet Technology
vivo Internet Technology
Jun 8, 2022 · Cloud Native

Vivo’s Large‑Scale Kubernetes Operator Practice for Multi‑Data‑Center Cluster Management

Vivo replaced error‑prone manual Ansible playbooks with a custom Kubernetes Operator that uses declarative CRDs and modular Ansible scripts to automate the full lifecycle—deployment, scaling, upgrades, and recovery—of thousands of nodes across multiple data‑centers, supported by extensive CI testing and future kubeadm integration.

AnsibleCI/CDCluster Management
0 likes · 14 min read
Vivo’s Large‑Scale Kubernetes Operator Practice for Multi‑Data‑Center Cluster Management
Efficient Ops
Efficient Ops
Jun 6, 2022 · Cloud Native

How to Migrate Kubernetes from Docker to cri‑o: Step‑by‑Step Guide

This guide explains why Kubernetes dropped dockershim, clarifies OCI and CRI standards, compares Docker, containerd and cri‑o runtimes, and provides a step‑by‑step procedure—including node draining, Docker removal, kernel tuning, cri‑o installation, kubelet reconfiguration, and verification—to successfully migrate a cluster from Docker to cri‑o before upgrading to version 1.24.

CRI-ODevOpsDocker migration
0 likes · 14 min read
How to Migrate Kubernetes from Docker to cri‑o: Step‑by‑Step Guide
IT Architects Alliance
IT Architects Alliance
Jun 5, 2022 · Cloud Native

Comparing Spring Cloud, Kubernetes, and Istio: From Monolithic Applications to Service Mesh Architecture

This article examines the evolution from large monolithic applications to distributed micro‑service systems, comparing Spring Cloud with Kubernetes and Istio, discussing the role of Service Mesh, and evaluating the benefits and trade‑offs of using Spring Boot on K8s versus a full Service Mesh solution.

IstioService Meshcloud-native
0 likes · 10 min read
Comparing Spring Cloud, Kubernetes, and Istio: From Monolithic Applications to Service Mesh Architecture
Top Architect
Top Architect
Jun 2, 2022 · Cloud Native

A Beginner's Guide to Designing, Implementing, and Deploying Microservices on Kubernetes

This article walks readers through the complete lifecycle of a microservice system—from architectural design and Java Spring Boot implementation to Kubernetes deployment, high‑availability setup, monitoring with Prometheus/Grafana, tracing with Zipkin, and flow‑control with Sentinel—providing practical code snippets and step‑by‑step instructions.

Tracingcloud-nativehigh-availability
0 likes · 21 min read
A Beginner's Guide to Designing, Implementing, and Deploying Microservices on Kubernetes
Open Source Linux
Open Source Linux
Jun 1, 2022 · Cloud Native

How Does Kubernetes Really Handle Container Networking? A Deep Dive

This article explains Kubernetes' network model, covering container‑to‑container, pod‑to‑pod, pod‑to‑service, and external traffic flows, while detailing network namespaces, veth pairs, Linux bridges, iptables/IPVS load‑balancing, and cloud‑native solutions like LoadBalancer and Ingress controllers.

IPVSNetwork ModelPod Networking
0 likes · 22 min read
How Does Kubernetes Really Handle Container Networking? A Deep Dive
DevOps Cloud Academy
DevOps Cloud Academy
May 31, 2022 · Information Security

Kubernetes Security Best Practices and Assessment Tools

This article outlines essential Kubernetes security principles based on the 4C model, discusses best practices for clusters, containers, and code, and reviews several open‑source tools such as Kubescape, kube‑bench, Terrascan, kube‑hunter, and Anchore for assessing and improving cluster security.

ComplianceDevOpscloud-native
0 likes · 7 min read
Kubernetes Security Best Practices and Assessment Tools
Efficient Ops
Efficient Ops
May 30, 2022 · Cloud Native

Understanding Low‑Level vs High‑Level Container Runtimes and CRI in Kubernetes

This article explains the evolution of container runtimes, distinguishing low‑level and high‑level runtimes, introduces key projects such as runC, containerd, CRI‑O, and demonstrates practical demos and CRI integration with Kubernetes, providing code examples and architectural diagrams for cloud‑native practitioners.

CRIcontainer-runtimehigh-level
0 likes · 16 min read
Understanding Low‑Level vs High‑Level Container Runtimes and CRI in Kubernetes
Alibaba Cloud Native
Alibaba Cloud Native
May 30, 2022 · Cloud Native

How Raven Enables Non-Intrusive Cross-Edge Container Networking in OpenYurt v0.7.0

OpenYurt v0.7.0 introduces the Raven solution, a non‑intrusive, encrypted‑tunnel based approach that seamlessly connects pods across public‑cloud, edge‑to‑edge, and edge‑to‑cloud environments without modifying existing CNI plugins, while also adding EdgeX Foundry LTS support, Kubernetes 1.22 compatibility, and IPv6 networking.

OpenYurtRAVENcloud-native
0 likes · 8 min read
How Raven Enables Non-Intrusive Cross-Edge Container Networking in OpenYurt v0.7.0
Zuoyebang Tech Team
Zuoyebang Tech Team
May 27, 2022 · Cloud Native

How We Scaled 10,000+ K8s CronJobs with Serverless and Solved Node Instability

This article describes the challenges encountered when migrating tens of thousands of Kubernetes cronjobs from VMs to a cluster—node instability, low resource utilization, and scheduling delays—and explains how introducing a serverless architecture with virtual nodes, a custom job scheduler, unified logging and monitoring, and sandbox reuse restored stability, improved performance, and reduced resource costs by about 70%.

CronJobServerlesskubernetes
0 likes · 9 min read
How We Scaled 10,000+ K8s CronJobs with Serverless and Solved Node Instability
Shopee Tech Team
Shopee Tech Team
May 26, 2022 · Cloud Computing

Shopee's Green Computing Practices: Optimizing Resource Utilization in Data Centers

Shopee reduces data‑center carbon emissions by over 40,000 tons annually through three 2021 green‑computing technologies—Overcommit resource oversubscription, mixed‑model Colocation of latency‑sensitive and batch workloads, and enhanced Auto Scaling that leverages global metrics to cut machine usage and improve resource efficiency.

Auto ScalingCloud Computingcarbon emissions
0 likes · 15 min read
Shopee's Green Computing Practices: Optimizing Resource Utilization in Data Centers
MaGe Linux Operations
MaGe Linux Operations
May 25, 2022 · Operations

Why Kubernetes LIST Requests Can Cripple Your Cluster and How to Fix Them

This article examines how heavy LIST operations in unstructured storage systems like Ceph and etcd consume massive I/O, network and CPU, threaten cluster stability, and offers detailed code analysis, performance testing, and practical tuning recommendations to keep large‑scale Kubernetes clusters reliable.

CachingListScalability
0 likes · 29 min read
Why Kubernetes LIST Requests Can Cripple Your Cluster and How to Fix Them
dbaplus Community
dbaplus Community
May 23, 2022 · Cloud Native

Why Kubernetes Is the Core of Cloud‑Native Architecture: A Deep Dive

This article provides a comprehensive overview of Kubernetes, covering its revolutionary role in cloud computing, master‑worker architecture, core concepts such as Pods and Controllers, storage and networking models, workload types, and extensibility through CRI, CNI, and CSI standards.

Podsarchitecturecloud-native
0 likes · 18 min read
Why Kubernetes Is the Core of Cloud‑Native Architecture: A Deep Dive
ITPUB
ITPUB
May 23, 2022 · Databases

How to Choose the Right Cloud‑Native Database: A Maturity Model Guide

This article explains the evolution of cloud service models, defines cloud‑native databases, and presents a detailed maturity model—from basic cloud‑ready deployments to fully serverless data services—helping enterprises evaluate and adopt the most suitable architecture and technology stack for their workloads.

CaaSDBaaSDatabases
0 likes · 11 min read
How to Choose the Right Cloud‑Native Database: A Maturity Model Guide
Cloud Native Technology Community
Cloud Native Technology Community
May 23, 2022 · Cloud Native

How Kube-OVN Redefines Cloud‑Native Networking for Modern Enterprises

This article reviews the evolution of cloud‑native networking in China, analyzes platformization and underlay trends, explains the technical challenges of traditional CNI solutions, and details Kube‑OVN's architecture, features, and roadmap as a comprehensive enterprise‑grade Kubernetes networking platform.

CNICloud Native NetworkingKube-OVN
0 likes · 21 min read
How Kube-OVN Redefines Cloud‑Native Networking for Modern Enterprises
Efficient Ops
Efficient Ops
May 22, 2022 · Cloud Native

How to Run Multiple Containers Sequentially in a Single Kubernetes Pod

This article explains how to execute several containers one after another within a single Kubernetes pod by leveraging initContainers and native Job mechanisms, compares alternative solutions such as Volcano and Argo, provides complete YAML examples, and discusses practical considerations like volume sharing, security contexts, and timeout settings.

ArgoJobVolcano
0 likes · 9 min read
How to Run Multiple Containers Sequentially in a Single Kubernetes Pod
MaGe Linux Operations
MaGe Linux Operations
May 22, 2022 · Cloud Native

Master K3s: Complete Guide to Install, Configure, and Scale Lightweight Kubernetes

This comprehensive tutorial walks you through the fundamentals of K3s, a lightweight CNCF‑certified Kubernetes distribution, covering its architecture, installation on server and agent nodes, configuration options, networking, storage, Helm integration, high‑availability setups, offline installation, upgrades, backups, and advanced customization for edge and cloud‑native environments.

K3scloud-nativehelm
0 likes · 44 min read
Master K3s: Complete Guide to Install, Configure, and Scale Lightweight Kubernetes
Alibaba Cloud Native
Alibaba Cloud Native
May 22, 2022 · Cloud Native

How Alibaba Evolved Envoy Gateway from 1.0 to 3.0: A Cloud‑Native Journey

This article chronicles Alibaba's three‑stage evolution of Envoy Gateway—from the 1.0 incubation that powered Double‑11 traffic, through the 2.0 growth phase adding north‑south routing and hybrid‑cloud support, to the mature 3.0 release that unifies traffic and micro‑service gateways with high performance and community‑driven extensions.

AlibabaEnvoyService Mesh
0 likes · 8 min read
How Alibaba Evolved Envoy Gateway from 1.0 to 3.0: A Cloud‑Native Journey
IT Architects Alliance
IT Architects Alliance
May 21, 2022 · Backend Development

Why Choose Microservice Architecture? A Comprehensive Roadmap and Tool Guide

This article explains why microservice architecture improves flexibility and scalability, outlines a clear learning roadmap, and details essential components such as Docker, orchestration, API gateways, load balancing, service discovery, event buses, logging, monitoring, tracing, data persistence, caching, and cloud providers, helping readers start or migrate to microservices.

Dockerarchitecturekubernetes
0 likes · 16 min read
Why Choose Microservice Architecture? A Comprehensive Roadmap and Tool Guide
G7 EasyFlow Tech Circle
G7 EasyFlow Tech Circle
May 20, 2022 · Backend Development

Securing Public‑Facing Kafka: Authentication, Configuration, and Scaling Strategies

This article shares G7 Tech’s practical experience of exposing Kafka to the public internet, covering encryption, AAA, three authentication schemes, listener configuration, scaling for massive topics with Kubernetes, storage optimization, and integration with the gmq management platform and Kafka‑REST.

AuthenticationKafkaPublic Network
0 likes · 10 min read
Securing Public‑Facing Kafka: Authentication, Configuration, and Scaling Strategies
Ops Development Stories
Ops Development Stories
May 20, 2022 · Cloud Native

How to Use Zadig’s Self‑Test Mode for Isolated Microservice Development with Istio and SkyWalking

This guide explains how to leverage Zadig v1.11.0’s environment self‑test mode to create isolated developer sub‑environments for large‑scale microservice projects, using Istio’s traffic routing and SkyWalking tracing, detailing prerequisite components, configuration of a PiggyMetrics Spring Boot example, and step‑by‑step verification via Kubernetes commands and curl requests.

IstioSelf-testService Mesh
0 likes · 11 min read
How to Use Zadig’s Self‑Test Mode for Isolated Microservice Development with Istio and SkyWalking
Ops Development Stories
Ops Development Stories
May 18, 2022 · Cloud Native

Mastering K8s Application Lifecycle: Health Checks, Graceful Shutdown, Metrics & Tracing

This article explains how developers and operators should prepare a Go‑based service for Kubernetes by implementing health‑check endpoints, graceful shutdown handling, metrics exposure, tracing integration, standardized logging, and operational best practices such as stateless design, high availability, self‑healing, and HTTPS configuration.

DevOpsGraceful ShutdownTracing
0 likes · 20 min read
Mastering K8s Application Lifecycle: Health Checks, Graceful Shutdown, Metrics & Tracing
Cloud Native Technology Community
Cloud Native Technology Community
May 17, 2022 · Cloud Native

Deep Dive into Kube-OVN Controller Architecture and Initialization

This article provides an in‑depth technical overview of the Kube‑OVN controller, covering its deployment as a Kubernetes deployment, main initialization flow, worker processes for VPC handling, default VPC creation, and related code snippets, illustrating how OpenStack‑style networking is integrated into Kubernetes.

CNIKube-OVNNetwork Controller
0 likes · 7 min read
Deep Dive into Kube-OVN Controller Architecture and Initialization
Cloud Native Technology Community
Cloud Native Technology Community
May 17, 2022 · Cloud Native

Technical Overview of Kube-OVN Based Network Solution for Mixed VM and Container Environments

This article presents a detailed technical overview of how ByteDance selected Kube-OVN for a mixed virtual‑machine and container networking scenario, describes the initial network design, identifies performance issues, and outlines three improvement plans including OVS‑DPDK, source‑route optimization, and a switch from Geneve to VXLAN.

CNIDPDKKube-OVN
0 likes · 7 min read
Technical Overview of Kube-OVN Based Network Solution for Mixed VM and Container Environments
Efficient Ops
Efficient Ops
May 16, 2022 · Cloud Native

How Kubernetes Scheduler Finds the Best Node for Your Pods

This article explains the evolution and core principles of the Kubernetes scheduler, outlines its fairness, efficiency, performance, and flexibility goals, describes the scheduling cycle and filtering‑scoring process, and introduces configuration methods for custom scheduling policies.

Pod SchedulingSchedulerScheduling Algorithms
0 likes · 8 min read
How Kubernetes Scheduler Finds the Best Node for Your Pods
Programmer DD
Programmer DD
May 16, 2022 · Cloud Native

Master Loki: Scalable Log Aggregation for Kubernetes and Prometheus

This guide introduces Loki, the open‑source, horizontally scalable log aggregation system optimized for Prometheus and Kubernetes, covering its core concepts, architecture, components, deployment steps, Grafana integration, label‑based indexing, and best practices for handling dynamic and high‑cardinality tags.

GrafanaLokiObservability
0 likes · 19 min read
Master Loki: Scalable Log Aggregation for Kubernetes and Prometheus
Alibaba Cloud Native
Alibaba Cloud Native
May 16, 2022 · Cloud Native

OpenClusterManagement 0.7 Highlights: DefaultClusterSet, v1beta1 Placement API, Hosted Mode

Version 0.7 of OpenClusterManagement introduces the DefaultClusterSet for automatic cluster grouping, upgrades the Placement API to v1beta1 with Taint/Toleration‑based scheduling, lowers the Hub cluster version requirement to 1.12, adds a Hosted deployment mode, and integrates enhanced multi‑cluster features with KubeVela 1.3 and Alibaba Cloud ACK Agile.

Hosted DeploymentKubeVelaMulti-Cluster
0 likes · 6 min read
OpenClusterManagement 0.7 Highlights: DefaultClusterSet, v1beta1 Placement API, Hosted Mode
Baidu Geek Talk
Baidu Geek Talk
May 16, 2022 · Cloud Native

Cloud-Native Environment Management Platform for Document Library: Architecture, Implementation and Benefits

To overcome unsynchronized development, slow deployments, and fragmented testing after moving its document library to the cloud, the team built a Kubernetes‑driven environment‑management platform that automatically provisions containerized sub‑environments on demand, scaling to thousands, cutting costs, boosting security, and enabling rapid, parallel testing for hundreds of projects.

DevOpsPlatform Engineeringci/cd
0 likes · 13 min read
Cloud-Native Environment Management Platform for Document Library: Architecture, Implementation and Benefits
Ctrip Technology
Ctrip Technology
May 12, 2022 · Cloud Native

Ctrip's Service Mesh Evolution: Architecture, Implementation, and Lessons Learned

This article details Ctrip's transition from traditional SOA to a cloud‑native Service Mesh using Istio, covering background challenges, technical solutions for control and data planes, SDK compatibility, configuration management, performance optimizations, and future directions such as WebAssembly and sidecar considerations.

IstioSOAkubernetes
0 likes · 28 min read
Ctrip's Service Mesh Evolution: Architecture, Implementation, and Lessons Learned
Open Source Linux
Open Source Linux
May 12, 2022 · Cloud Native

Mastering Kubernetes Control Plane: etcd, API Server, Scheduler & More

This article explains the core components of the Kubernetes control plane—including etcd, the API Server, Controller Manager, Scheduler—as well as key worker‑node components like Kubelet, kube‑proxy, and the container runtime, detailing their roles, interactions, and essential functions.

API ServerControl PlaneScheduler
0 likes · 11 min read
Mastering Kubernetes Control Plane: etcd, API Server, Scheduler & More
Alibaba Cloud Native
Alibaba Cloud Native
May 11, 2022 · Cloud Native

How Zuoyebang Cut 22% Costs with Kubernetes Serverless Virtual Nodes

Zuoyebang’s shift to cloud‑native architecture leveraged Alibaba Cloud’s Kubernetes Serverless virtual nodes, achieving a 22.5% cost reduction during peak traffic by dynamically scaling workloads, while addressing scheduling, observability, and performance challenges through custom schedulers, enhanced monitoring, and careful testing.

ObservabilityServerlesscloud-native
0 likes · 11 min read
How Zuoyebang Cut 22% Costs with Kubernetes Serverless Virtual Nodes
Cloud Native Technology Community
Cloud Native Technology Community
May 10, 2022 · Cloud Native

How PayPal Scaled Kubernetes to 4,100 Nodes and 200k Pods

PayPal’s engineering team detailed their journey of scaling Kubernetes from a few hundred nodes to over 4,100 nodes and 200,000 Pods, describing cluster topology, workload generation, API server bottlenecks, controller manager and scheduler tuning, extensive etcd optimizations, and the resulting performance gains that met Kubernetes SLOs.

PayPalScalingcloud-native
0 likes · 13 min read
How PayPal Scaled Kubernetes to 4,100 Nodes and 200k Pods
IT Architects Alliance
IT Architects Alliance
May 9, 2022 · Cloud Native

Mastering Kubernetes: Core Concepts from Container Ops to Network Isolation

This article provides a comprehensive overview of Kubernetes, covering container operation fundamentals, multi‑site deployment models, service discovery mechanisms, shared pod resources, common CNI plugins, layered load‑balancing approaches, isolation dimensions, and essential network model principles for large‑scale clusters.

CNIcloud-nativecontainer orchestration
0 likes · 13 min read
Mastering Kubernetes: Core Concepts from Container Ops to Network Isolation
Alibaba Cloud Native
Alibaba Cloud Native
May 9, 2022 · Cloud Native

Build a Two‑Region, Three‑Center Disaster‑Recovery System with ACK One Multi‑Cluster Deployment

This guide walks through creating a two‑region, three‑center disaster‑recovery architecture using ACK One’s multi‑cluster application distribution and GTM traffic management, covering cluster setup, differential configuration, workflow control, traffic policies, and verification steps with concrete kubectl commands and YAML manifests.

ACK OneGTMMulti-Cluster
0 likes · 18 min read
Build a Two‑Region, Three‑Center Disaster‑Recovery System with ACK One Multi‑Cluster Deployment
Efficient Ops
Efficient Ops
May 8, 2022 · Cloud Native

How Do Kubernetes Pods and Services Achieve Load Balancing?

This article reviews the core structure of Kubernetes pods, explains how pods expose services externally, explores load‑balancing strategies using Services and virtual IPs, and details how kube‑proxy and selectors keep services in sync with changing pod lifecycles.

PodServicecloud-native
0 likes · 7 min read
How Do Kubernetes Pods and Services Achieve Load Balancing?
Tencent Cloud Developer
Tencent Cloud Developer
May 7, 2022 · Cloud Native

Fourth Techo TVP Developer Conference: Cloud Native Trends and Best Practices

The Fourth Techo TVP Developer Conference highlighted current cloud‑native adoption, FinOps cost‑optimization, distributed‑cloud strategies, and maturity models on Day 1, then showcased practical best‑practice case studies—from automotive edge computing to service‑mesh migration, hybrid‑cloud PaaS evolution, observability standards, and high‑performance API‑gateway deployments—on Day 2.

APISIXDevOpsFinOps
0 likes · 33 min read
Fourth Techo TVP Developer Conference: Cloud Native Trends and Best Practices
Java High-Performance Architecture
Java High-Performance Architecture
May 7, 2022 · Cloud Native

Unlocking Kubernetes Architecture: From Pods to High‑Availability Design

This article provides a comprehensive overview of Kubernetes architecture, covering its core components, service discovery methods, pod shared resources, common CNI plugins, multi‑layer load balancing, isolation dimensions, and network model principles that enable scalable, high‑availability container operations.

CNIPod Architecturecloud-native
0 likes · 10 min read
Unlocking Kubernetes Architecture: From Pods to High‑Availability Design
Open Source Linux
Open Source Linux
May 6, 2022 · Cloud Native

What’s New in Kubernetes 1.24? A Deep Dive into Major Features and Changes

Kubernetes 1.24 introduces 46 enhancements—including dockershim removal, default‑disabled beta APIs, signed release artifacts, OpenAPI v3 support, storage capacity tracking, CSI volume expansion, and new priority options—while also detailing CNI version considerations, upgraded runtimes, and how to access the release notes and tutorials.

1.24CNIRelease Notes
0 likes · 10 min read
What’s New in Kubernetes 1.24? A Deep Dive into Major Features and Changes
Alibaba Cloud Native
Alibaba Cloud Native
May 5, 2022 · Cloud Native

Achieving Low‑Cost, High‑Elastic Kubernetes Deployments with ACK, ECI, and OpenKruise

This article explains how to use Kubernetes native autoscaling components—HPA, VPA, Cluster Autoscaler—and cloud‑native extensions such as Alibaba Cloud's Virtual Node, Elastic Container Instance, Elastic Workload, and the open‑source OpenKruise to build a cost‑effective, highly elastic architecture on ACK clusters.

AutoscalingCluster AutoscalerElastic Workload
0 likes · 28 min read
Achieving Low‑Cost, High‑Elastic Kubernetes Deployments with ACK, ECI, and OpenKruise
Cloud Native Technology Community
Cloud Native Technology Community
May 5, 2022 · Cloud Native

What’s New in Kubernetes 1.24? Key Features and Changes Explained

Kubernetes 1.24, released in early 2022, introduces 46 enhancements—including the removal of dockershim, default‑off test APIs, signed release artifacts, OpenAPI v3 support, GA storage capacity tracking and volume expansion, new CNI requirements, and numerous API graduations—while also deprecating or deleting several legacy components.

CNIFeature HighlightsRelease 1.24
0 likes · 18 min read
What’s New in Kubernetes 1.24? Key Features and Changes Explained
MaGe Linux Operations
MaGe Linux Operations
May 1, 2022 · Cloud Native

Mastering Kubernetes Networking: From Containers to Services

This article explains Kubernetes' networking model, covering container‑to‑container, pod‑to‑pod, pod‑to‑service, and external traffic flows, detailing network namespaces, veth pairs, bridges, iptables/IPVS load balancing, and cloud‑provider integrations such as AWS VPC and Ingress controllers.

Servicecloud-nativekubernetes
0 likes · 25 min read
Mastering Kubernetes Networking: From Containers to Services
MaGe Linux Operations
MaGe Linux Operations
Apr 30, 2022 · Cloud Native

Mastering Kubernetes DaemonSets: Ensure One Pod per Node Efficiently

DaemonSets in Kubernetes automatically run a dedicated pod on each node, handling storage, logging, and monitoring tasks, while using node selectors, affinities, and tolerations to manage placement, resources, and lifecycle, with examples and YAML definitions for fluentd‑elasticsearch deployment.

DaemonSetNodeAffinityPod Scheduling
0 likes · 9 min read
Mastering Kubernetes DaemonSets: Ensure One Pod per Node Efficiently
Architect
Architect
Apr 29, 2022 · Cloud Native

Understanding Kubernetes Architecture and Its Role in Cloud‑Native Environments

This article explains the fundamentals of Kubernetes, its three core characteristics, practical deployment scenarios for enterprises of different sizes, resource‑utilization calculations, service migration challenges, the evolution from physical servers to containers, and a detailed overview of cloud‑native concepts and Kubernetes components.

Containerizationcloud-nativekubernetes
0 likes · 10 min read
Understanding Kubernetes Architecture and Its Role in Cloud‑Native Environments
Efficient Ops
Efficient Ops
Apr 29, 2022 · Operations

How Ctrip Scaled Its Cloud Platform to 10k Nodes: Real‑World Kubernetes Ops Lessons

This article shares Ctrip's practical experiences in scaling a hybrid private‑cloud platform to over ten thousand nodes, covering Kubernetes control‑plane stability, host monitoring, network observability, image management, and capacity planning to ensure high availability for massive online services.

Network ObservabilityPerformance Optimizationcloud operations
0 likes · 18 min read
How Ctrip Scaled Its Cloud Platform to 10k Nodes: Real‑World Kubernetes Ops Lessons
MaGe Linux Operations
MaGe Linux Operations
Apr 28, 2022 · Cloud Native

When to Use kubectl run vs kubectl create/apply in Kubernetes?

This article explains the two primary ways to create resources in Kubernetes—using the direct kubectl run command and using kubectl create/apply with YAML files—providing examples, code snippets, and guidance on when each method is appropriate for ad‑hoc testing or declarative, large‑scale deployments.

applycreatekubectl
0 likes · 7 min read
When to Use kubectl run vs kubectl create/apply in Kubernetes?
Efficient Ops
Efficient Ops
Apr 27, 2022 · Operations

Why Choose Loki Over ELK? A Practical Guide to Scalable Log Aggregation

This article explains the motivations for selecting Grafana Loki instead of traditional ELK/EFK stacks, introduces Loki's core concepts and architecture, details component roles, provides step‑by‑step deployment of Promtail and Loki, and demonstrates how to configure and query logs in Grafana while addressing label indexing, dynamic tags, high‑cardinality challenges, and query performance.

GrafanaLokiObservability
0 likes · 18 min read
Why Choose Loki Over ELK? A Practical Guide to Scalable Log Aggregation
ITPUB
ITPUB
Apr 27, 2022 · Artificial Intelligence

How 58’s WPAI Platform Boosted AI Resource Utilization by Over 50%

This article details the design and optimization of 58.com’s WPAI machine learning platform, covering background, training‑task scheduling, elastic inference scaling, offline‑online resource mixing, and model‑inference acceleration, and shows how these techniques collectively raised GPU usage by 51% and CPU usage by 38% while cutting costs.

AI platformGPU utilizationInference Acceleration
0 likes · 26 min read
How 58’s WPAI Platform Boosted AI Resource Utilization by Over 50%
Alibaba Cloud Native
Alibaba Cloud Native
Apr 27, 2022 · Cloud Native

How ACK’s Resource Profiling Optimizes Kubernetes CPU & Memory Requests

This article explains how Alibaba Cloud Container Service for Kubernetes (ACK) uses container‑level resource profiling with half‑life sliding windows and quantile algorithms to automatically recommend accurate CPU and memory requests, improving cluster utilization while maintaining application stability.

ACKQuantile AlgorithmResource Profiling
0 likes · 9 min read
How ACK’s Resource Profiling Optimizes Kubernetes CPU & Memory Requests
DataFunSummit
DataFunSummit
Apr 26, 2022 · Artificial Intelligence

Elastic Distributed Training at Huya: Design, Implementation, and Results

This talk describes Huya’s elastic distributed training system, covering the motivation behind elasticity, its design using Kubernetes and ETCD for dynamic node registration and scaling, implementation details of the EFDL framework, performance evaluations on ResNet‑50, and the resulting benefits and future directions.

AI platformGPU schedulingHuya
0 likes · 11 min read
Elastic Distributed Training at Huya: Design, Implementation, and Results
Zuoyebang Tech Team
Zuoyebang Tech Team
Apr 26, 2022 · Cloud Native

How Serverless Kubernetes Virtual Nodes Cut Costs and Boost Scalability

Zhang's team at Zuoyebang details their journey to serverless Kubernetes virtual nodes, explaining how elastic scaling, fine-grained scheduling, and cost‑effective resource utilization transformed high‑peak online services, while addressing challenges in scheduling, observability, performance, and multi‑cloud resilience.

Serverlesscost optimizationkubernetes
0 likes · 11 min read
How Serverless Kubernetes Virtual Nodes Cut Costs and Boost Scalability
Volcano Engine Developer Services
Volcano Engine Developer Services
Apr 26, 2022 · Operations

How Volcano Engine’s TLS Transforms Log Management for Kubernetes at Scale

This article explains the challenges of traditional open‑source log collection in cloud‑native environments, describes Volcano Engine’s unified TLS architecture, its centralized configuration, CRD‑based deployment, and showcases real‑world case studies that demonstrate improved availability, efficiency, and scalability.

Distributed SystemsLog ManagementObservability
0 likes · 15 min read
How Volcano Engine’s TLS Transforms Log Management for Kubernetes at Scale
Code DAO
Code DAO
Apr 26, 2022 · Artificial Intelligence

Building an Open-Source ML Pipeline – Part 1: Data Ingestion & Storage

This article walks through building the first stage of an open‑source MLOps pipeline—data ingestion and storage—by outlining requirements, selecting tools such as Argo Workflows, Minio and Great Expectations, showing how to set up a minikube cluster, and providing Python scripts and an Argo CronWorkflow to extract, transform, and load OpenAQ air‑quality data into Minio.

Argo WorkflowsMLOpsMinio
0 likes · 10 min read
Building an Open-Source ML Pipeline – Part 1: Data Ingestion & Storage