Tagged articles
4063 articles
Page 25 of 41
Alibaba Cloud Native
Alibaba Cloud Native
Feb 28, 2022 · Cloud Native

How to Observe and Diagnose DNS Failures in Kubernetes Clusters

This article explains how DNS operates inside Kubernetes, enumerates common failure causes, describes CoreDNS's built‑in observability plugins, introduces BPF‑based client‑side diagnostics, and provides a step‑by‑step troubleshooting workflow to identify and resolve DNS issues in cloud‑native environments.

BPFCoreDNSDNS
0 likes · 18 min read
How to Observe and Diagnose DNS Failures in Kubernetes Clusters
Open Source Linux
Open Source Linux
Feb 28, 2022 · Cloud Native

Master Kubernetes Essentials: Pods, Services, Deployments, and More

This article introduces Kubernetes, the open‑source container‑orchestration platform, and explains its core concepts—including Pods, Namespaces, Nodes, Services, Volumes, PersistentVolumes, Deployments, StatefulSets, DaemonSets, Ingress, Jobs, Horizontal Pod Autoscaling, Service Accounts, Secrets, ConfigMaps, and Resource Quotas—along with common kubectl commands for managing each component.

DevOpscloud-nativecontainer orchestration
0 likes · 17 min read
Master Kubernetes Essentials: Pods, Services, Deployments, and More
MaGe Linux Operations
MaGe Linux Operations
Feb 26, 2022 · Cloud Computing

How Docker’s Rise and Fall Reshaped the Cloud‑Container Landscape

This article chronicles Docker’s meteoric ascent as a pioneering container platform, its leadership turmoil, fierce competition with Kubernetes, the 2019 split and sale to Mirantis, and the lasting lessons it offers about the challenges open‑source startups face in sustaining commercial success.

Cloud ComputingContainersDocker
0 likes · 14 min read
How Docker’s Rise and Fall Reshaped the Cloud‑Container Landscape
MaGe Linux Operations
MaGe Linux Operations
Feb 25, 2022 · Cloud Native

Mastering Kubernetes Services: Types, Usage, and Iptables Implementation

This article explains the fundamental concepts, configuration methods, and underlying mechanisms of Kubernetes Services, covering typical Service definitions, the four Service types, their behavior under kube-proxy's iptables mode, and practical troubleshooting insights drawn from a real production outage.

ClusterIPServiceiptables
0 likes · 12 min read
Mastering Kubernetes Services: Types, Usage, and Iptables Implementation
ITPUB
ITPUB
Feb 24, 2022 · Cloud Native

Mastering Kubernetes: Core Concepts, Architecture, and Networking Essentials

This article provides a comprehensive overview of Kubernetes, covering its purpose for container operations, key functions, core components, multi‑center deployment, service discovery methods, pod resource sharing, common CNI plugins, load‑balancing layers, isolation dimensions, network model principles, and IP address classifications.

CNIIP addressingcontainer orchestration
0 likes · 14 min read
Mastering Kubernetes: Core Concepts, Architecture, and Networking Essentials
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 24, 2022 · Cloud Native

Achieving Consistent, Low‑Cost Environments with Declarative IaC and OAM

This article explains how to ensure software delivery stability by maintaining product and runtime environment consistency, explores the three stages of environment management—from documentation to declarative definitions—illustrates Kubernetes sidecar patterns, and discusses the benefits and challenges of Infrastructure‑as‑Code and OAM models.

Environment ConsistencyOAMiac
0 likes · 10 min read
Achieving Consistent, Low‑Cost Environments with Declarative IaC and OAM
Liangxu Linux
Liangxu Linux
Feb 24, 2022 · Cloud Native

Why Docker exec Fails: Diagnosing runc Errors and Resource Limits

This guide walks through a real‑world Docker exec failure, explains the relationship between kubelet, docker‑shim, containerd, and runc, shows step‑by‑step commands to isolate the faulty component, and reveals that a resource‑limit (pids) exhaustion in the container caused the runc exec error.

Dockercgroupscontainerd
0 likes · 12 min read
Why Docker exec Fails: Diagnosing runc Errors and Resource Limits
Alibaba Cloud Native
Alibaba Cloud Native
Feb 23, 2022 · Cloud Native

Achieving Zero‑Loss Service Deployment with Alibaba Cloud MSE

This guide explains how Alibaba Cloud's Microservice Engine (MSE) achieves zero‑loss service deployment by using adaptive offline waiting, active notifications, readiness‑aligned startup checks, and traffic warm‑up, providing step‑by‑step Kubernetes manifests, configuration details, and validation results to prevent traffic loss during releases.

MSEService Warm‑upZero‑Loss Deployment
0 likes · 20 min read
Achieving Zero‑Loss Service Deployment with Alibaba Cloud MSE
Ops Development Stories
Ops Development Stories
Feb 23, 2022 · Cloud Native

Why Your Kubernetes Pod Can't Reach the Server: DNS Search Domain Pitfalls and Fixes

An agent service running in a Kubernetes pod appeared healthy but failed to receive heartbeats due to DNS resolution errors caused by an unintended 'HOST' search domain, leading to incorrect IP resolution; the article details the investigation, explains Kubernetes DNS mechanics, and shows how adjusting ndots or using fully qualified names resolves the issue.

DNSPodcloud-native
0 likes · 18 min read
Why Your Kubernetes Pod Can't Reach the Server: DNS Search Domain Pitfalls and Fixes
IT Architects Alliance
IT Architects Alliance
Feb 22, 2022 · Cloud Native

Evolution of Microservice Architecture and Its Essential Technology Stack

This article traces the evolution of software architecture from monolithic to microservice models, explains key concepts such as service mesh, distributed tracing, and container orchestration, and lists the essential tools and frameworks—including Docker, Kubernetes, Ocelot, Consul, and Jenkins—required to build and operate modern cloud‑native microservice systems.

ContainerizationDistributed Systemsarchitecture
0 likes · 17 min read
Evolution of Microservice Architecture and Its Essential Technology Stack
Efficient Ops
Efficient Ops
Feb 22, 2022 · Operations

Tackling Cloud‑Native Ops Challenges: Real‑World Practices from NetEase

NetEase’s cloud‑native operations team shares how they confront new challenges of Kubernetes adoption—ranging from technical stack shifts and knowledge‑base gaps to capacity planning, automated diagnostics, monitoring, alerting, and cost‑saving strategies—offering practical insights for building efficient, stable, and scalable ops systems.

automationcloud-nativecost optimization
0 likes · 22 min read
Tackling Cloud‑Native Ops Challenges: Real‑World Practices from NetEase
MaGe Linux Operations
MaGe Linux Operations
Feb 22, 2022 · Cloud Native

Explore Kubernetes CI/CD Through an Engaging Comic

This article presents a comic series that visually explains Kubernetes fundamentals and demonstrates how to implement continuous integration and continuous delivery pipelines on the platform, offering a concise, illustrated guide for developers and DevOps engineers.

ComicDevOpsci/cd
0 likes · 4 min read
Explore Kubernetes CI/CD Through an Engaging Comic
Cloud Native Technology Community
Cloud Native Technology Community
Feb 21, 2022 · Industry Insights

What the Sysdig 2022 Cloud‑Native Security Report Reveals About Container Risks

The Sysdig 2022 Cloud‑Native Security and Usage Report shows that a majority of production containers and images contain high‑severity vulnerabilities, many cloud accounts expose S3 buckets, and mis‑configured resource limits lead to significant cost overruns, highlighting urgent security and operational challenges for enterprises adopting cloud‑native technologies.

Industry InsightsSysdig reportcloud usage
0 likes · 8 min read
What the Sysdig 2022 Cloud‑Native Security Report Reveals About Container Risks
Alibaba Cloud Native
Alibaba Cloud Native
Feb 20, 2022 · Cloud Native

What’s Driving the Cloud Native Revolution? Trends, Tech, and Future Directions

This article summarizes a cloud‑native expert’s talk, covering the definition and community origins of cloud native, CNCF’s role, current industry trends such as distributed cloud, AI workloads, FinOps, core technologies like containers, microservices, serverless, and Alibaba Cloud’s product ecosystem and open‑source contributions.

AIFinOpsServerless
0 likes · 13 min read
What’s Driving the Cloud Native Revolution? Trends, Tech, and Future Directions
MaGe Linux Operations
MaGe Linux Operations
Feb 18, 2022 · Operations

Master Continuous Profiling on Kubernetes with Pyroscope: A Step‑by‑Step Guide

Learn how to use the open‑source Pyroscope continuous profiler to identify performance bottlenecks in Kubernetes‑deployed microservices, compare it with other profilers, and follow detailed installation, configuration, and integration steps for Python, Go, and .NET services, including Helm deployment and resource monitoring.

Continuous ProfilingProfilingPyroscope
0 likes · 11 min read
Master Continuous Profiling on Kubernetes with Pyroscope: A Step‑by‑Step Guide
iQIYI Technical Product Team
iQIYI Technical Product Team
Feb 18, 2022 · Cloud Native

CPU Share Syncer: Enabling High‑Priority Task CPU Preemption in iQIYI Video Production Kubernetes Clusters

iQIYI’s cpu‑share‑syncer daemon runs on every node, reads a pod’s iqiyi.com/cpu‑share annotation, updates the pod’s cpu.shares after disabling the Kubernetes CPU CFS quota, and lets high‑priority video‑production pods pre‑empt CPU from lower‑priority pods, significantly speeding task execution.

CPU schedulingDaemonSetHigh priority tasks
0 likes · 13 min read
CPU Share Syncer: Enabling High‑Priority Task CPU Preemption in iQIYI Video Production Kubernetes Clusters
Architect
Architect
Feb 18, 2022 · Cloud Native

Large‑Scale etcd Cluster Performance Optimization and Pod Data Splitting in Ant Group’s Sigma

This article describes how Ant Group tackled the performance ceiling of its massive Sigma Kubernetes clusters by horizontally splitting etcd storage for Pods, Leases and Events, redesigning watch handling to avoid component restarts, and using snapshot‑based migration to preserve data integrity while reducing latency.

Cluster PerformanceData Migrationetcd
0 likes · 27 min read
Large‑Scale etcd Cluster Performance Optimization and Pod Data Splitting in Ant Group’s Sigma
Meituan Technology Team
Meituan Technology Team
Feb 17, 2022 · Cloud Native

Meituan's Cloud‑Native Cluster Scheduling System: Design, Challenges, and Future Directions

Meituan’s cloud‑native cluster scheduling system, built on a customized Kubernetes engine, unifies multi‑cluster management, improves CPU utilization, reduces costs, and enhances stability by balancing throughput, complexity, and reliability while addressing large‑scale deployment, fault‑tolerance, and dynamic resource allocation challenges.

Cluster SchedulingMeituancloud-native
0 likes · 21 min read
Meituan's Cloud‑Native Cluster Scheduling System: Design, Challenges, and Future Directions
TAL Education Technology
TAL Education Technology
Feb 17, 2022 · Fundamentals

Comprehensive Overview of the Web Access Process: From DNS Resolution to Kubernetes Deployment

This article explains the complete web request lifecycle—including DNS lookup, caching mechanisms, HTTP/HTTPS communication, TCP/UDP transport, gateway routing, backend service models, virtualization, Docker containerization, and Kubernetes orchestration—providing engineers with a solid foundation for diagnosing and optimizing modern web applications.

CachingDNSDocker
0 likes · 19 min read
Comprehensive Overview of the Web Access Process: From DNS Resolution to Kubernetes Deployment
IT Architects Alliance
IT Architects Alliance
Feb 16, 2022 · Cloud Native

Understanding Microservices: Concepts, Benefits, Drawbacks, and Their Relationship with Containers and Kubernetes

This article explains what microservices are, how they differ from traditional monolithic applications, provides an Amazon product‑listing example, discusses containers, compares the two, outlines when to adopt microservices, lists their pros and cons, and describes how Kubernetes supports cloud‑native microservice architectures.

ContainersSoftware Architecturecloud-native
0 likes · 12 min read
Understanding Microservices: Concepts, Benefits, Drawbacks, and Their Relationship with Containers and Kubernetes
Qingyun Technology Community
Qingyun Technology Community
Feb 15, 2022 · Cloud Native

Deploy GitLab CE on KubeSphere and Build CI/CD Pipelines – A Complete Guide

This tutorial walks you through installing KubeSphere, deploying GitLab Community Edition via Helm, configuring a minimal GitLab setup, creating a sample NodeJS project, disabling Auto DevOps, writing a Jenkinsfile, and linking GitLab with KubeSphere DevOps to run both scripted and graphical CI/CD pipelines, plus optional SSH access.

GitLabJenkinsKubeSphere
0 likes · 18 min read
Deploy GitLab CE on KubeSphere and Build CI/CD Pipelines – A Complete Guide
Ops Development Stories
Ops Development Stories
Feb 15, 2022 · Cloud Native

Mastering Canary Deployments with ingress-nginx: A Step-by-Step Guide

This article explains how to perform rolling, blue‑green, and canary (gray) releases in Kubernetes, focusing on implementing canary deployments with the ingress-nginx controller using specific annotations, demonstrating weight‑based and header‑based scenarios, providing full YAML manifests, CI/CD pipeline designs, and recommendations for automated rollouts.

CI/CDDevOpscanary deployment
0 likes · 18 min read
Mastering Canary Deployments with ingress-nginx: A Step-by-Step Guide
Architects Research Society
Architects Research Society
Feb 14, 2022 · Cloud Native

Microservices After a Decade: Evolution, Anti‑Patterns, Distributed Transactions, and the Role of Kubernetes

The panel discusses how microservices have changed over the past ten years, covering evolving definitions, common anti‑patterns, the challenges of distributed transactions, Kubernetes' influence on architecture, organizational impacts, and future trends such as application‑centric models and GitOps.

DevOpsarchitecturecloud-native
0 likes · 25 min read
Microservices After a Decade: Evolution, Anti‑Patterns, Distributed Transactions, and the Role of Kubernetes
Alibaba Cloud Native
Alibaba Cloud Native
Feb 11, 2022 · Cloud Native

Step‑by‑Step Guide to Building Full‑Link Gray Release with MSE on Kubernetes

This article explains how to enable full‑link gray release for microservices using Alibaba Cloud MSE, covering the underlying technologies, product features, environment preparation, Kubernetes deployment manifests, lane configuration, Ingress routing, traffic verification, and monitoring, all without modifying application code.

Ingress-nginxMSEfull-link gray release
0 likes · 17 min read
Step‑by‑Step Guide to Building Full‑Link Gray Release with MSE on Kubernetes
High Availability Architecture
High Availability Architecture
Feb 11, 2022 · Cloud Native

Cloud‑Native Migration of Tencent Happy Game Studio Backend Using Istio Service Mesh

The article details how Tencent's Happy Game Studio transformed its large‑scale, low‑utilization backend from a legacy distributed architecture to a cloud‑native, Istio‑enabled service‑mesh platform, achieving significant resource savings, smoother deployments, and improved observability across game, CGI, and storage services.

Istiocloud-nativekubernetes
0 likes · 16 min read
Cloud‑Native Migration of Tencent Happy Game Studio Backend Using Istio Service Mesh
Cloud Native Technology Community
Cloud Native Technology Community
Feb 8, 2022 · Cloud Native

Tracing the Path of Network Traffic in Kubernetes

This article provides a comprehensive guide to Kubernetes networking, covering pod network requirements, Linux network namespaces, the role of the pause container, IP allocation, veth pairs, bridge connections, inter‑pod traffic on same and different nodes, CNI plugins, and how services use iptables and conntrack for traffic routing.

CNILinux NamespacesPod
0 likes · 26 min read
Tracing the Path of Network Traffic in Kubernetes
Alibaba Cloud Native
Alibaba Cloud Native
Feb 8, 2022 · Cloud Native

How Does OCM Placement Dynamically Schedule Multi‑Cluster Workloads?

This article explains how Open Cluster Management's Placement API selects target clusters for workloads in multi‑cluster Kubernetes environments, covering ManagedClusterSet concepts, Placement spec fields, predicate filtering, prioritizer policies, dynamic scoring, and practical YAML examples that demonstrate weight adjustments and best‑practice scenarios.

Dynamic SchedulingMulti-ClusterOpen Cluster Management
0 likes · 22 min read
How Does OCM Placement Dynamically Schedule Multi‑Cluster Workloads?
IT Architects Alliance
IT Architects Alliance
Feb 8, 2022 · Cloud Native

What Are Microservices? Concepts, Benefits, Drawbacks, and Their Relationship with Containers and Kubernetes

This article explains the fundamentals of microservices, illustrates them with an Amazon product‑listing example, compares microservices to containers, outlines when to adopt microservices, discusses their advantages and disadvantages, and describes how Kubernetes supports microservice architectures.

ContainersDevOpscloud-native
0 likes · 12 min read
What Are Microservices? Concepts, Benefits, Drawbacks, and Their Relationship with Containers and Kubernetes
MaGe Linux Operations
MaGe Linux Operations
Feb 7, 2022 · Cloud Native

Why K8ssandra Is Switching from Helm to Its Own Operator

The article explains how K8ssandra, an Apache Cassandra distribution for Kubernetes, evolved from using Helm charts to developing a dedicated Operator to overcome Helm's limitations, improve multi‑cluster support, and align more closely with Kubernetes best practices.

GoK8ssandraOperator
0 likes · 13 min read
Why K8ssandra Is Switching from Helm to Its Own Operator
MaGe Linux Operations
MaGe Linux Operations
Jan 30, 2022 · Cloud Native

Upgrade a Kubernetes Cluster from v1.22 to v1.23 the Hard Way

This step‑by‑step tutorial explains how to upgrade a Kubernetes cluster from version 1.22 to 1.23 using the hard‑way approach, covering prerequisites, master and worker node procedures, package handling, and verification commands to ensure a successful upgrade.

Cluster UpgradeDevOpsHard Way
0 likes · 8 min read
Upgrade a Kubernetes Cluster from v1.22 to v1.23 the Hard Way
MaGe Linux Operations
MaGe Linux Operations
Jan 28, 2022 · Cloud Native

Top 7 Kubernetes Management Tools to Simplify Cluster Operations

Discover the most popular Kubernetes management solutions—including K9s, Rancher, Dashboard, Helm, Kubespray, Lens, and WKSctl—detailing their features, deployment options, and how they streamline cluster monitoring, scaling, and security for cloud-native environments and improve operational efficiency.

Cluster ManagementDevOpscloud-native
0 likes · 9 min read
Top 7 Kubernetes Management Tools to Simplify Cluster Operations
Baidu Tech Salon
Baidu Tech Salon
Jan 27, 2022 · Cloud Native

How China Unicom’s Service Mesh Evolved: From SDKs to Sidecars and Beyond

This article details China Unicom Software Research Institute's multi‑year journey of adopting Kubernetes‑based service mesh, outlining the evolution from SDK‑driven microservices to sidecar‑based architectures, migration strategies with Baidu, performance optimizations, observability enhancements, and future product roadmaps.

IstioObservabilityPerformance Optimization
0 likes · 13 min read
How China Unicom’s Service Mesh Evolved: From SDKs to Sidecars and Beyond
MaGe Linux Operations
MaGe Linux Operations
Jan 26, 2022 · Cloud Native

Unlock Kubernetes Essentials: Pods, Services, Deployments, and Beyond

This article introduces Kubernetes—Google's open‑source container orchestration platform—detailing its core concepts such as Pods, Namespaces, Nodes, Services, Volumes, PersistentVolumes, Deployments, StatefulSets, DaemonSets, Ingress, Jobs, HPA, ServiceAccounts, Secrets, ConfigMaps, and ResourceQuotas, providing practical commands and usage notes for each component.

DevOpscloud-nativecontainer orchestration
0 likes · 18 min read
Unlock Kubernetes Essentials: Pods, Services, Deployments, and Beyond
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jan 26, 2022 · Cloud Native

2021 Technology Year-End Review: Serverless, Kubernetes, Frontend, Service Mesh, Rust, Databases, Software 2.0, and Audio/Video Trends

The 2021 year‑end review highlights the rapid rise of Serverless, the accelerated release cadence and security focus of Kubernetes, the deepening of frontend low‑code development, the practical expansion of service mesh, the growing Rust ecosystem, cloud‑native database evolution, the maturation of Software 2.0, and emerging audio‑video innovations driven by the metaverse.

DatabasesRustSoftware 2.0
0 likes · 14 min read
2021 Technology Year-End Review: Serverless, Kubernetes, Frontend, Service Mesh, Rust, Databases, Software 2.0, and Audio/Video Trends
Cloud Native Technology Community
Cloud Native Technology Community
Jan 25, 2022 · Cloud Native

Technical Overview of Kube-OVN Deployment for Hybrid VM and Container Environments at ByteDance

This article details ByteDance's technical evaluation and implementation of Kube-OVN as a CNI solution for hybrid virtual‑machine and container workloads, covering selection criteria, the initial network design, identified issues, subsequent optimizations, and future roadmap considerations within a cloud‑native Kubernetes environment.

CNIKube-OVNOVS
0 likes · 7 min read
Technical Overview of Kube-OVN Deployment for Hybrid VM and Container Environments at ByteDance
DataFunTalk
DataFunTalk
Jan 25, 2022 · Cloud Native

Model Deployment Challenges and a Seldon‑Based Cloud‑Native Solution

This article analyzes the complexities of deploying machine‑learning models in production, outlines the limitations of the existing ABox architecture, and details a comprehensive cloud‑native redesign using Seldon on Kubernetes—including custom HDFS initializers, GPU management, logging, and resource monitoring—to streamline operations and enable unified CPU/GPU model serving.

GPUMLOpsSeldon
0 likes · 12 min read
Model Deployment Challenges and a Seldon‑Based Cloud‑Native Solution
Java High-Performance Architecture
Java High-Performance Architecture
Jan 25, 2022 · Cloud Native

Why Is Debugging Microservices on Kubernetes So Hard? Proven Strategies to Overcome It

Debugging microservices in a Kubernetes environment is challenging due to the abstraction of pods, network complexities, infrastructure issues, and application-level faults, but by monitoring at the service layer, aggregating data, and applying machine‑learning‑based anomaly detection, teams can effectively identify and resolve problems.

Troubleshootingkubernetesmachine learning
0 likes · 6 min read
Why Is Debugging Microservices on Kubernetes So Hard? Proven Strategies to Overcome It
Cloud Native Technology Community
Cloud Native Technology Community
Jan 24, 2022 · Cloud Native

Building a Unified Cross‑Cluster Network Plane with Kube‑OVN for TiDB Deployment

The article explains how Kube‑OVN can create a flexible, efficient, and secure cross‑Kubernetes‑cluster network plane to meet user demands such as high availability, multi‑cloud separation, disaster‑recovery, and hybrid‑cloud compliance, and demonstrates its use with TiDB, MySQL, and WordPress workloads.

Cross-Cluster NetworkingKube-OVNOverlay Network
0 likes · 8 min read
Building a Unified Cross‑Cluster Network Plane with Kube‑OVN for TiDB Deployment
Ops Development Stories
Ops Development Stories
Jan 24, 2022 · Cloud Native

Deploy and Configure vmagent on Kubernetes for Efficient Metrics

This guide explains what vmagent is, its key features, and provides step‑by‑step instructions to install, configure, and verify vmagent on a Kubernetes cluster, including namespace and RBAC setup, custom scrape configs, monitoring endpoints, and troubleshooting tips.

MetricsVictoriaMetricskubernetes
0 likes · 15 min read
Deploy and Configure vmagent on Kubernetes for Efficient Metrics
Open Source Linux
Open Source Linux
Jan 24, 2022 · Cloud Native

From LXC to Kubernetes: The Evolution and Core Mechanics of Containers

This article traces the history of container technology from early Linux Containers (LXC) through Docker’s rise, the emergence of Kubernetes, and explains core concepts such as cgroups, namespaces, OCI standards, and how containers appear as isolated processes from both host and container perspectives.

ContainersDockerLinux Namespaces
0 likes · 12 min read
From LXC to Kubernetes: The Evolution and Core Mechanics of Containers
MaGe Linux Operations
MaGe Linux Operations
Jan 22, 2022 · Cloud Native

Boost Kubernetes Monitoring: Migrate from Prometheus to Thanos for Scalable Low‑Cost Metrics

This article examines the limitations of a standard Prometheus‑based monitoring stack on Kubernetes, explains how adopting Thanos improves metric retention and reduces infrastructure costs, and provides a detailed multi‑cluster deployment guide with Terraform, TLS configuration, and Grafana visualization.

ObservabilityPrometheusThanos
0 likes · 16 min read
Boost Kubernetes Monitoring: Migrate from Prometheus to Thanos for Scalable Low‑Cost Metrics
Alibaba Cloud Native
Alibaba Cloud Native
Jan 22, 2022 · Cloud Native

Why OpenKruise v1.0 Is a Game-Changer for Cloud‑Native Workloads

OpenKruise v1.0, the CNCF‑hosted Kubernetes extension, expands beyond basic workloads with enhanced Workload types, in‑place upgrades, high‑availability safeguards, and advanced operational tools, drawing on Alibaba’s massive container experience and now adopted by over 35 global adopters.

In‑Place UpgradeOpenKruiseWorkload Management
0 likes · 12 min read
Why OpenKruise v1.0 Is a Game-Changer for Cloud‑Native Workloads
Qingyun Technology Community
Qingyun Technology Community
Jan 21, 2022 · Cloud Native

How to Build a Small‑Scale KubeSphere Kubernetes Cluster: A Step‑by‑Step Guide

This guide walks you through planning, deploying, and configuring a production‑grade KubeSphere‑based Kubernetes cluster for small environments, covering node layout, storage choices, middleware setup, OS and Docker preparation, HAProxy/Keepalived high‑availability, and KubeKey installation with verification steps.

DockerHAProxyKubeSphere
0 likes · 23 min read
How to Build a Small‑Scale KubeSphere Kubernetes Cluster: A Step‑by‑Step Guide
DevOps Cloud Academy
DevOps Cloud Academy
Jan 20, 2022 · Cloud Native

Comparative Overview of Terraform and Helm for Kubernetes Management

This article examines the core functionalities of Terraform and Helm, highlighting how each tool manages Kubernetes clusters, automates infrastructure provisioning, and supports DevOps workflows, helping teams decide which solution best fits their cloud‑native development needs.

DevOpscloud-nativehelm
0 likes · 8 min read
Comparative Overview of Terraform and Helm for Kubernetes Management
IT Architects Alliance
IT Architects Alliance
Jan 20, 2022 · Cloud Native

How to Build a High‑Availability Microservices System on Kubernetes – A Complete Guide

This guide walks you through designing a simple front‑back separation microservice architecture, implementing it with Java Spring Boot, deploying multiple instances with Eureka, adding Prometheus‑Grafana monitoring, logging, tracing, flow control, and finally installing Kubernetes using K8seasy and verifying high‑availability across the cluster.

JavaSpring Bootcloud-native
0 likes · 19 min read
How to Build a High‑Availability Microservices System on Kubernetes – A Complete Guide
DataFunTalk
DataFunTalk
Jan 19, 2022 · Artificial Intelligence

Alluxio for AI and Machine Learning: Architecture, Optimizations, and Performance Evaluation

This article presents a comprehensive technical overview of Alluxio, covering its role as a distributed data orchestration layer for AI workloads, core features such as caching and unified namespace, performance challenges in large‑scale machine‑learning pipelines, and the extensive optimizations and testing performed at Tencent to achieve high throughput and scalability.

AIAlluxioCephFS
0 likes · 23 min read
Alluxio for AI and Machine Learning: Architecture, Optimizations, and Performance Evaluation
Alibaba Cloud Native
Alibaba Cloud Native
Jan 18, 2022 · Cloud Native

How Alibaba Cloud’s Differential SLO Boosts Kubernetes Resource Utilization

This article explains Alibaba Cloud Container Service for Kubernetes's differential SLO approach, detailing the reclaimed‑resource model, CPU burst and topology‑aware scheduling, kernel group identity, memory watermark tiering, and real‑world case studies that demonstrate significant improvements in cluster efficiency and latency‑sensitive workload performance.

ACKAlibaba CloudCPU Burst
0 likes · 16 min read
How Alibaba Cloud’s Differential SLO Boosts Kubernetes Resource Utilization
Liulishuo Tech Team
Liulishuo Tech Team
Jan 18, 2022 · Cloud Native

Design and Implementation of an ArgoCD‑Based Continuous Delivery System at Liulishuo

This article describes how Liulishuo replaced its custom CD platform with the open‑source ArgoCD solution, detailing the motivations, architecture components, design concepts, deployment workflows, code snippets, advantages, and future plans for automated gray‑release in a cloud‑native Kubernetes environment.

ArgoCDContinuous DeliveryGitOps
0 likes · 11 min read
Design and Implementation of an ArgoCD‑Based Continuous Delivery System at Liulishuo
Youzan Coder
Youzan Coder
Jan 17, 2022 · Artificial Intelligence

Model Deployment Challenges and a Seldon‑Based Cloud‑Native Solution

The team replaced the cumbersome ABox deployment stack with Seldon‑based cloud‑native serving on Kubernetes, unifying TensorFlow and other framework models, adding GPU sharing, automated CRUD, per‑model ingress, monitoring, and log collection, achieving scalable, fault‑tolerant, zero‑downtime model deployment.

AI servingGPUModel Deployment
0 likes · 11 min read
Model Deployment Challenges and a Seldon‑Based Cloud‑Native Solution
Architect
Architect
Jan 17, 2022 · Cloud Native

Introducing NGINX Service Mesh: Features, Architecture, and Getting Started

This article introduces NGINX Service Mesh (NSM), a lightweight, highly integrated service‑mesh solution for Kubernetes that leverages NGINX Plus as a data plane, explains its security, traffic‑management, visualization, and hybrid‑deployment capabilities, outlines its architecture and components, and provides step‑by‑step commands for installation, verification, and sidecar injection.

NGINX Service MeshService Meshkubernetes
0 likes · 10 min read
Introducing NGINX Service Mesh: Features, Architecture, and Getting Started
TAL Education Technology
TAL Education Technology
Jan 13, 2022 · Cloud Native

Offline Mixed Deployment with Kubernetes: Architecture, Implementation, and Performance Evaluation for Big Data Workloads

This article describes a cloud‑native offline mixed‑deployment solution that leverages Kubernetes to share resources between big‑data clusters and business services, outlines its implementation steps, presents detailed performance comparisons between Yarn and Kubernetes using TPC‑DS, Spark, and Terasort workloads, and discusses production experience and future plans.

Big DataYARNcloud-native
0 likes · 8 min read
Offline Mixed Deployment with Kubernetes: Architecture, Implementation, and Performance Evaluation for Big Data Workloads
Top Architect
Top Architect
Jan 13, 2022 · Backend Development

Microservice Architecture Roadmap: Core Components and Recommended Tools

This article presents a comprehensive roadmap for adopting microservice architecture, explaining why it is chosen, outlining essential concerns such as Docker, container orchestration, API gateways, load balancing, service discovery, event buses, logging, monitoring, tracing, data persistence, caching, and cloud providers, and recommending popular tools for each component.

Dockerapi-gatewaykubernetes
0 likes · 16 min read
Microservice Architecture Roadmap: Core Components and Recommended Tools
HomeTech
HomeTech
Jan 13, 2022 · Cloud Native

AutoKH: A Mixed‑Workload Resource Management Solution on Kubernetes and Hadoop

AutoKH is a cloud‑native mixed‑workload framework that integrates Kubernetes and Hadoop to dynamically schedule online and offline tasks, improve CPU and memory utilization, enforce priority classes, and ensure service stability through operators, CronHPA, and resource‑control components.

CPU ManagerHadoopMixed Workload
0 likes · 19 min read
AutoKH: A Mixed‑Workload Resource Management Solution on Kubernetes and Hadoop
Alibaba Cloud Native
Alibaba Cloud Native
Jan 12, 2022 · Cloud Native

Achieving Low‑Cost Full‑Link Gray Release with Alibaba Cloud MSE: Laidian Tech’s Journey

Laidian Technology leveraged Alibaba Cloud's Microservices Engine (MSE) to implement low‑cost, non‑intrusive full‑link gray release, improving system stability, accelerating feature rollout, and showcasing the benefits of containerization, Kubernetes integration, and comprehensive service governance in a fast‑growing microservice environment.

Gray ReleaseMSEcloud-native
0 likes · 15 min read
Achieving Low‑Cost Full‑Link Gray Release with Alibaba Cloud MSE: Laidian Tech’s Journey
Cloud Native Technology Community
Cloud Native Technology Community
Jan 12, 2022 · Cloud Native

Choosing Cloud‑Native Persistent Storage Solutions for Financial Container Platforms

The article examines how banks can select and implement cloud‑native persistent storage on Kubernetes‑based container platforms, covering storage types, CSI integration, DBaaS options, data consistency challenges, and strategies for high‑concurrency fault recovery in the financial sector.

DBaaSData ConsistencyFinancial Services
0 likes · 8 min read
Choosing Cloud‑Native Persistent Storage Solutions for Financial Container Platforms
Ops Development Stories
Ops Development Stories
Jan 12, 2022 · Cloud Native

Deploy Kubernetes Inside Kubernetes: Mastering Kubefarm and PXE Boot

This article explains how to use Helm to deploy a full Kubernetes cluster within another Kubernetes cluster, detailing the Kubefarm project, PXE‑bootable server farms, Kubernetes‑in‑Kubernetes architecture, node provisioning via PXE, certificate management with cert‑manager, and advanced networking and webhook integration.

KubefarmPXE bootcloud-native
0 likes · 17 min read
Deploy Kubernetes Inside Kubernetes: Mastering Kubefarm and PXE Boot
Java High-Performance Architecture
Java High-Performance Architecture
Jan 12, 2022 · Cloud Native

Mastering Service Mesh with Istio: A Hands‑On Guide to Traffic, Security, and Observability

This tutorial explains the fundamentals of service mesh, explores Istio’s architecture and core components, and provides step‑by‑step instructions for installing Istio on Kubernetes, deploying a sample microservice application, and leveraging traffic management, mutual TLS, observability, and advanced use cases such as routing, circuit breaking, and JWT‑based access control.

IstioObservabilityService Mesh
0 likes · 22 min read
Mastering Service Mesh with Istio: A Hands‑On Guide to Traffic, Security, and Observability
DevOps Cloud Academy
DevOps Cloud Academy
Jan 12, 2022 · Cloud Native

Deploying Apache APISIX as a Kubernetes Ingress Controller with Helm

This article introduces Apache APISIX, a high‑performance API gateway built on OpenResty and etcd, explains its architecture, demonstrates how to install it as a Kubernetes Ingress controller using Helm charts, and shows configuration, routing, and dashboard usage with example manifests and commands.

APISIXapi-gatewaycloud-native
0 likes · 11 min read
Deploying Apache APISIX as a Kubernetes Ingress Controller with Helm
ITPUB
ITPUB
Jan 11, 2022 · Cloud Native

How Ctrip Achieved High Availability for Service Mesh Across Multiple Data Centers

Facing large‑scale production demands, Ctrip’s Cloud Container team designed a multi‑data‑center Service Mesh architecture that isolates failures, employs dual‑active IDC, multi‑cluster deployments, and optimized xDS push mechanisms, ensuring high availability, rapid fault recovery, and efficient rollout across hundreds of services.

IstioMulti-ClusterService Mesh
0 likes · 18 min read
How Ctrip Achieved High Availability for Service Mesh Across Multiple Data Centers
Yiche Technology
Yiche Technology
Jan 11, 2022 · Databases

Elasticsearch Overview, Comparison, Maintenance Challenges, Deployment Strategies, and Automation Management Platform

This document provides a comprehensive technical overview of Elasticsearch, comparing it with Solr and ClickHouse, detailing common operational pain points and configuration solutions, describing containerized and ECK deployments, and outlining a company‑wide automation platform for cluster provisioning, monitoring, index and security management, with future directions for lifecycle and backup strategies.

Cluster Managementautomationkubernetes
0 likes · 31 min read
Elasticsearch Overview, Comparison, Maintenance Challenges, Deployment Strategies, and Automation Management Platform
Open Source Linux
Open Source Linux
Jan 11, 2022 · Cloud Native

Why Bare‑Metal Kubernetes Beats Virtual Machines: 6 Compelling Advantages

This article explains why deploying Kubernetes on bare‑metal servers offers six key benefits—simpler networking, better cost efficiency for demanding workloads, superior performance, stronger security, workload‑specific optimization, and freedom from vendor lock‑in—making it a smarter choice than virtual‑machine‑based solutions.

Bare MetalScalabilitycloud-native
0 likes · 7 min read
Why Bare‑Metal Kubernetes Beats Virtual Machines: 6 Compelling Advantages
Laravel Tech Community
Laravel Tech Community
Jan 11, 2022 · Cloud Native

Alibaba Sentinel: Open‑Source Cloud‑Native High‑Availability Protection for Distributed Services (Sentinel Go v1.0.4 Release)

The article introduces Alibaba's open‑source Sentinel, a cloud‑native high‑availability protection component for distributed microservices that offers flow control, circuit breaking, concurrency limiting and adaptive overload protection, and highlights the new Sentinel Go v1.0.4 features such as Prometheus metrics, enhanced circuit‑breaker recovery, Apollo data source, and updated Kubernetes CRD support for version 1.22+.

Circuit BreakingFlow ControlGo
0 likes · 2 min read
Alibaba Sentinel: Open‑Source Cloud‑Native High‑Availability Protection for Distributed Services (Sentinel Go v1.0.4 Release)
58 Tech
58 Tech
Jan 10, 2022 · Artificial Intelligence

Resource Utilization Optimization Practices for the 58.com Machine Learning Platform (WPAI)

This article details the 58.com WPAI machine learning platform's architecture and the optimizations applied to training task scheduling, inference service elastic scaling, and offline‑online resource mixing, demonstrating how these techniques significantly improve GPU/CPU utilization and inference performance across both GPU and CPU environments.

AIInference Accelerationelastic scaling
0 likes · 27 min read
Resource Utilization Optimization Practices for the 58.com Machine Learning Platform (WPAI)
DevOps Cloud Academy
DevOps Cloud Academy
Jan 7, 2022 · Cloud Native

Understanding Containerization in DevOps: Benefits, Pipelines, and Orchestration

This article explains how containerization integrates with DevOps, detailing the advantages of container‑based delivery pipelines, the differences between traditional and containerized workflows, and the role of orchestration platforms like Kubernetes in creating efficient, portable, and scalable software deployments.

ContainerizationDevOpsOrchestration
0 likes · 10 min read
Understanding Containerization in DevOps: Benefits, Pipelines, and Orchestration
Open Source Linux
Open Source Linux
Jan 6, 2022 · Cloud Native

Choosing the Right CNI Plugin for Kubernetes: A Practical Guide

This article explains what the Container Network Interface (CNI) is, how Kubernetes uses CNI plugins, and provides a detailed comparison of Overlay, Routing, and Underlay plugins based on environment constraints, functional needs, and performance considerations.

CNINetwork Pluginscloud-native
0 likes · 8 min read
Choosing the Right CNI Plugin for Kubernetes: A Practical Guide