Tagged articles
4063 articles
Page 13 of 41
Alibaba Cloud Native
Alibaba Cloud Native
Mar 18, 2024 · Cloud Native

Implement Full‑Chain Gray Release with Alibaba Cloud MSE and AppStack

This guide walks you through using Alibaba Cloud MSE microservice engine and the Cloud Efficient AppStack platform to set up a gray‑release environment for Spring Cloud applications, covering prerequisite setup, baseline deployment, gray lane creation, pipeline integration, verification, and cleanup.

MSEkubernetesspring-cloud
0 likes · 11 min read
Implement Full‑Chain Gray Release with Alibaba Cloud MSE and AppStack
MaGe Linux Operations
MaGe Linux Operations
Mar 18, 2024 · Cloud Native

Is Your Kubernetes Setup Secure? A Complete Best‑Practice Checklist

This article provides a thorough checklist covering application deployment, service governance, and cluster configuration in Kubernetes, including health probes, graceful shutdown, fault tolerance, resource limits, labeling, logging, scaling, RBAC, network policies, and compliance with CIS benchmarks.

Best PracticesOperationscloud-native
0 likes · 27 min read
Is Your Kubernetes Setup Secure? A Complete Best‑Practice Checklist
Ops Development Stories
Ops Development Stories
Mar 18, 2024 · Cloud Native

13 Essential Kubernetes Tips to Boost Scalability, Security, and Management

Discover 13 practical Kubernetes techniques—including PreStop hooks, automatic secret rotation, ephemeral containers, custom metric autoscaling, init containers, node affinity, taints and tolerations, pod priority, ConfigMaps, debugging tools, resource requests, CRDs, and API automation—to enhance application reliability, scalability, and security in cloud‑native environments.

Scalingkubernetespod management
0 likes · 21 min read
13 Essential Kubernetes Tips to Boost Scalability, Security, and Management
MaGe Linux Operations
MaGe Linux Operations
Mar 16, 2024 · Cloud Native

Scaling Non‑CPU‑Bound Apps with HPA Using cAdvisor Network Metrics

This guide shows how to enable Horizontal Pod Autoscaling for traffic‑driven workloads by leveraging cAdvisor's container network receive and transmit byte counters, converting them to per‑second rates with Prometheus‑adapter, and validating the custom metric through Kubernetes commands and console views.

PrometheuscAdvisorcloud-native
0 likes · 7 min read
Scaling Non‑CPU‑Bound Apps with HPA Using cAdvisor Network Metrics
21CTO
21CTO
Mar 15, 2024 · Cloud Native

Can a Database Become the Next Cloud‑Native Operating System? Inside DBOS

DBOS, a database‑driven cloud‑native operating system founded by Michael Stonebraker and Matei Zaharia, aims to replace traditional OS state management with SQL‑based databases, offering enhanced security, rapid ransomware recovery, and seamless integration with Kubernetes and Firecracker hypervisors.

DBOSDatabase Operating SystemFirecracker
0 likes · 7 min read
Can a Database Become the Next Cloud‑Native Operating System? Inside DBOS
58 Tech
58 Tech
Mar 15, 2024 · Cloud Native

Design and Implementation of a Service Mesh Architecture for HTTP Traffic Governance

This document details the motivation, technology selection, overall architecture, data‑plane and control‑plane design, key implementation challenges such as HTTPS interception, precise URL monitoring, and plugin extensibility, and outlines future directions for scaling service mesh within the organization.

Control PlaneData PlaneHTTP traffic
0 likes · 17 min read
Design and Implementation of a Service Mesh Architecture for HTTP Traffic Governance
Practical DevOps Architecture
Practical DevOps Architecture
Mar 15, 2024 · Operations

Comprehensive Practical Guide to Prometheus Configuration, Optimization, and Source Code Development

This multi‑chapter guide provides in‑depth, hands‑on instruction for configuring and optimizing all Prometheus components, exploring Kubernetes monitoring, source‑code analysis, custom exporter development, high‑availability setups, service discovery, resource‑efficient scraping, and integrating Thanos for long‑term storage.

ObservabilityOperationsPrometheus
0 likes · 4 min read
Comprehensive Practical Guide to Prometheus Configuration, Optimization, and Source Code Development
Liangxu Linux
Liangxu Linux
Mar 14, 2024 · Cloud Native

From chroot to Kubernetes: The 40‑Year Evolution of Containerization

This article traces the history of container technology—from the 1979 chroot command and Linux namespaces to cgroups, LXC, Docker, Kubernetes, and modern cloud‑native services like Huawei CCE—explaining each milestone’s purpose, advantages, and impact on software deployment.

ContainerizationLXCNamespaces
0 likes · 10 min read
From chroot to Kubernetes: The 40‑Year Evolution of Containerization
Architect
Architect
Mar 14, 2024 · Cloud Native

Understanding Kubernetes: Architecture, Core Components, and Deployment Workflow

This article explains how Kubernetes serves as a cloud‑native middle layer that abstracts servers and application services, detailing its control‑plane and node components, the role of pods, containers, and the kubectl tool, and walks through a typical service deployment and request flow.

cloud-nativecontainer orchestrationdeployment
0 likes · 10 min read
Understanding Kubernetes: Architecture, Core Components, and Deployment Workflow
DevOps
DevOps
Mar 13, 2024 · Cloud Native

Integrating CMDB with Cloud‑Native Container Workloads Using Init Containers, Hooks, and Event Monitoring

The article examines how CMDB can be combined with cloud‑native container architectures by leveraging Kubernetes init containers, lifecycle hooks (PostStart and PreStop), and an event‑monitoring‑to‑MQ pipeline, discussing the technical rationale, implementation details, advantages, and challenges of each approach.

CMDBCloudNativeEventMonitoring
0 likes · 9 min read
Integrating CMDB with Cloud‑Native Container Workloads Using Init Containers, Hooks, and Event Monitoring
Liangxu Linux
Liangxu Linux
Mar 13, 2024 · Cloud Native

From chroot to Kubernetes: How Containerization Evolved Over Decades

Tracing the evolution of container technology—from the 1979 Unix chroot command, through Linux namespaces and cgroups, to LXC, Docker, and Kubernetes—this article explains each milestone’s role in isolation, resource control, and cloud-native orchestration, highlighting the shift toward managed cloud container services.

ContainerizationLinuxcloud-native
0 likes · 10 min read
From chroot to Kubernetes: How Containerization Evolved Over Decades
Bilibili Tech
Bilibili Tech
Mar 12, 2024 · Cloud Native

Containerizing Elasticsearch and ClickHouse on Kubernetes: Architecture, Implementation, and Benefits

Bilibili migrated its Elasticsearch and ClickHouse clusters to Kubernetes using custom operators, CRDs, LVM‑based local storage, MacVLAN networking, and pod anti‑affinity, achieving higher resource utilization, isolation, and automation that cut server count, reduced latency spikes, and lowered operational costs dramatically.

ClickHouseElasticsearchLVM
0 likes · 38 min read
Containerizing Elasticsearch and ClickHouse on Kubernetes: Architecture, Implementation, and Benefits
ITPUB
ITPUB
Mar 11, 2024 · Cloud Computing

What 4 Years of Startup Infrastructure Taught Me: AWS, Terraform, GitOps & More

After four years running infrastructure at a fast‑growing startup, the author reviews almost every major decision—from choosing AWS over GCP and adopting EKS, RDS, and Redis, to automating post‑mortems with Slack bots, standardising IaC with Terraform and GitOps, and evaluating SaaS tools like DataDog, PagerDuty, and Notion—highlighting the benefits, regrets, and practical lessons learned.

Cloud ComputingDevOpsaws
0 likes · 22 min read
What 4 Years of Startup Infrastructure Taught Me: AWS, Terraform, GitOps & More
Open Source Linux
Open Source Linux
Mar 11, 2024 · Big Data

Step‑by‑Step Guide to Deploying Flink on Standalone, Yarn, and Kubernetes

This tutorial explains how to install and configure Apache Flink in three deployment modes—Standalone, Hadoop YARN, and Kubernetes—covering node preparation, configuration files, package distribution, job submission, and monitoring through the Flink Web UI, with full command‑line examples and code snippets.

Big DataFlinkStandalone
0 likes · 12 min read
Step‑by‑Step Guide to Deploying Flink on Standalone, Yarn, and Kubernetes
dbaplus Community
dbaplus Community
Mar 10, 2024 · Cloud Native

How We Built Elastic Scaling and Hybrid‑Cloud Auto‑Scaling on Kubernetes

After fully containerizing their platform, the team tackled front‑line development scaling challenges by designing a custom elastic‑scaling solution that combines dual‑threshold and timed scaling, integrates hybrid‑cloud ClusterAutoScale, consolidates middleware resources, and implements a comprehensive K8s observability stack, delivering over 30% additional compute capacity and near‑perfect scaling reliability.

Auto Scalingcloud-nativehybrid cloud
0 likes · 12 min read
How We Built Elastic Scaling and Hybrid‑Cloud Auto‑Scaling on Kubernetes
Ops Development Stories
Ops Development Stories
Mar 8, 2024 · Cloud Native

How to Deploy and Test a Multi‑Cluster Istio Service Mesh with Kind and MetalLB

This guide explains why multi‑cluster deployments are needed for high‑availability, describes Istio's flat and non‑flat network models with single or multiple control planes, and provides step‑by‑step scripts to create Kind clusters, install MetalLB, configure root CAs, deploy Istio, set up gateways, and verify regional load balancing and failover.

IstioMetalLBMulti-Cluster
0 likes · 29 min read
How to Deploy and Test a Multi‑Cluster Istio Service Mesh with Kind and MetalLB
360 Quality & Efficiency
360 Quality & Efficiency
Mar 8, 2024 · Cloud Native

Understanding maxUnavailable and maxSurge in Kubernetes Rolling Updates

This article explains the roles of maxUnavailable and maxSurge in Kubernetes rolling updates, demonstrates their impact on update speed and service stability through multiple practical cases, and provides best‑practice guidelines for configuring these parameters to achieve smooth, reliable deployments.

Best Practicesdeploymentkubernetes
0 likes · 12 min read
Understanding maxUnavailable and maxSurge in Kubernetes Rolling Updates
Architect
Architect
Mar 7, 2024 · Cloud Native

Graceful Shutdown in Kubernetes: Concepts, Case Studies, and Optimizations

This article explains the concept of graceful shutdown, outlines the standard steps, and presents detailed Kubernetes, Spring Boot, and Nacos case studies, followed by optimization techniques, code examples, and practical recommendations for handling MQ, scheduled tasks, and traffic control during service termination.

Graceful ShutdownNacosSpring Boot
0 likes · 12 min read
Graceful Shutdown in Kubernetes: Concepts, Case Studies, and Optimizations
Ops Development & AI Practice
Ops Development & AI Practice
Mar 5, 2024 · Cloud Computing

Build a Cost‑Effective Kubernetes Dev Environment with GitHub Codespaces, DinD & KinD

This guide shows how to combine GitHub Codespaces, Docker‑in‑Docker, and KinD to create a fully functional, cloud‑based Kubernetes development environment that eliminates local setup, improves accessibility, and reduces costs, while providing step‑by‑step instructions and essential commands for deployment and testing.

DevOpsDocker-in-DockerGitHub Codespaces
0 likes · 6 min read
Build a Cost‑Effective Kubernetes Dev Environment with GitHub Codespaces, DinD & KinD
MaGe Linux Operations
MaGe Linux Operations
Mar 5, 2024 · Cloud Native

Master Kubernetes Scheduling: 15 Real-World Scenarios & Configurations

This guide explores fifteen practical Kubernetes scheduling scenarios—from basic node selectors to custom schedulers and pod priority—providing detailed YAML configurations, code snippets, and best‑practice recommendations to help you optimize resource utilization, high availability, and workload placement across your cluster.

NodeSelectorPodAffinityYAML
0 likes · 12 min read
Master Kubernetes Scheduling: 15 Real-World Scenarios & Configurations
MaGe Linux Operations
MaGe Linux Operations
Mar 5, 2024 · Cloud Native

How to Run GPU‑Accelerated AI Workloads on Kubernetes

This article explains how Kubernetes supports GPU workloads for AI and machine learning, covering device plugins, pod GPU requests, oversubscription, security isolation, cloud‑provider node setup, and protecting GPU nodes from non‑GPU pods.

AI workloadsDevice PluginGPU
0 likes · 8 min read
How to Run GPU‑Accelerated AI Workloads on Kubernetes
MaGe Linux Operations
MaGe Linux Operations
Mar 2, 2024 · Operations

How to Diagnose and Fix Constant Kubernetes Pod Restarts (OOM)

When a Kubernetes pod repeatedly restarts, you can pinpoint the cause by inspecting events, describing the pod, and checking previous logs, then identify OOM kills caused by memory limits in the deployment.yaml and resolve it by increasing the memory limit and redeploying the pod.

Memory LimitsOOMPod Restart
0 likes · 3 min read
How to Diagnose and Fix Constant Kubernetes Pod Restarts (OOM)
MaGe Linux Operations
MaGe Linux Operations
Feb 29, 2024 · Operations

Quickly Set Up OpenTelemetry on Kubernetes: Installation, Modes & Config

This guide walks you through deploying OpenTelemetry in Kubernetes, covering the purpose of otel‑collector, installation via manifests or Helm, the three deployment patterns (No‑Collector, Agent, Gateway), running the otel‑demo, and detailed configuration of receivers, processors, exporters, connectors, extensions, and service pipelines.

CollectorObservabilityOpenTelemetry
0 likes · 11 min read
Quickly Set Up OpenTelemetry on Kubernetes: Installation, Modes & Config
Architect
Architect
Feb 28, 2024 · Cloud Native

Lightweight Kubernetes Log Collection with Loki: Deployment and Configuration Guide

This article provides a comprehensive, lightweight solution for collecting Kubernetes logs using Grafana Loki, covering its advantages, component comparison, deployment modes (All‑In‑One, microservices, bare‑metal), required configuration files, ConfigMap and PersistentVolume setup, Promtail installation, Helm deployment, and common troubleshooting steps.

LokiPromtailhelm
0 likes · 20 min read
Lightweight Kubernetes Log Collection with Loki: Deployment and Configuration Guide
vivo Internet Technology
vivo Internet Technology
Feb 28, 2024 · Cloud Native

vivo's Online-Offline Co-location Technology Practice: Data Center Resource Optimization

Vivo’s online‑offline co‑location platform consolidates latency‑sensitive online services and batch offline workloads on shared Kubernetes nodes, using differentiated resource views, priority‑based QoS, and safety watermarks to boost CPU utilization from 13 % to 25 %, adding 20 000 cores and 50 TB memory for peak‑hour offline tasks.

DevOpsResource IsolationSpark on K8S
0 likes · 17 min read
vivo's Online-Offline Co-location Technology Practice: Data Center Resource Optimization
Alibaba Cloud Native
Alibaba Cloud Native
Feb 28, 2024 · Cloud Native

Building a Unified Cloud‑Native Serverless Platform Across Public Cloud and IDC with ACK One & Knative

This guide explains how to design and implement a unified cloud‑native serverless platform that runs seamlessly on public clouds and on‑premise IDC clusters using Alibaba Cloud ACK One, Kubernetes, and Knative, covering architecture, key components, deployment steps, and best‑practice recommendations.

ACK OneKnativeLangChain
0 likes · 11 min read
Building a Unified Cloud‑Native Serverless Platform Across Public Cloud and IDC with ACK One & Knative
Aikesheng Open Source Community
Aikesheng Open Source Community
Feb 27, 2024 · Cloud Native

Detailed Overview of LiteIO Architecture, Components, and Volume Lifecycle

This article provides a comprehensive technical overview of LiteIO, describing its core and CSI components, the complete volume lifecycle within Kubernetes, Disk‑Agent responsibilities, common implementation pitfalls, storage‑pool construction methods, and the design of the node‑disk‑controller, scheduler, and CSI modules.

CSIDistributed SystemsLiteIO
0 likes · 13 min read
Detailed Overview of LiteIO Architecture, Components, and Volume Lifecycle
dbaplus Community
dbaplus Community
Feb 26, 2024 · Cloud Native

10 Hard‑Earned Lessons from 3 Years Managing Kubernetes Clusters

After three years of hands‑on Kubernetes administration, the author shares ten practical lessons covering cloud‑hosted clusters, infrastructure‑as‑code, Helm chart usage, service mesh decisions, tool selection, resource limits, stateless design, HPA configuration, and upgrade strategies to help both newcomers and seasoned engineers manage clusters effectively.

Best PracticesCluster Managementcloud-native
0 likes · 7 min read
10 Hard‑Earned Lessons from 3 Years Managing Kubernetes Clusters
AntData
AntData
Feb 22, 2024 · Cloud Native

Detailed Overview of LiteIO Architecture, Components, and Volume Lifecycle

This article provides a comprehensive technical overview of LiteIO, describing its core and CSI components, their interactions, the complete volume lifecycle within Kubernetes, common implementation pitfalls, and configuration examples for storage pools and agents.

CSILiteIOVolume Lifecycle
0 likes · 14 min read
Detailed Overview of LiteIO Architecture, Components, and Volume Lifecycle
DevOps Cloud Academy
DevOps Cloud Academy
Feb 22, 2024 · Cloud Native

Blue‑Green Deployment with Host and Path‑Based Routing in Kubernetes

This guide explains how to implement a blue‑green deployment on Kubernetes with host‑ and path‑based routing, covering prerequisites, namespace creation, deployment manifests, service and ingress configuration, traffic switching, updates, verification, and rollback procedures.

Blue‑Green deploymentDevOpscloud-native
0 likes · 8 min read
Blue‑Green Deployment with Host and Path‑Based Routing in Kubernetes
DataFunTalk
DataFunTalk
Feb 22, 2024 · Big Data

Flink on Kubernetes: Kuaishou’s Practice, Migration, and Future Refactoring

This article details Kuaishou’s five‑year evolution of Flink, covering its background, production refactoring to Kubernetes, migration practices, and future improvements, highlighting architecture layers, resource management, observability, and testing strategies for large‑scale stream processing.

Big DataFlinkMigration
0 likes · 12 min read
Flink on Kubernetes: Kuaishou’s Practice, Migration, and Future Refactoring
Liangxu Linux
Liangxu Linux
Feb 21, 2024 · Cloud Native

Why a Default Kubernetes Setting Can Spike CPU Usage and How to Fix It

A Node.js service migrated to containers began experiencing intermittent timeouts and high CPU usage due to the default enableServiceLinks parameter injecting thousands of environment variables, and the analysis shows how to identify, reproduce, and resolve the issue with Kubernetes configuration and code adjustments.

ContainerDebuggingNode.js
0 likes · 14 min read
Why a Default Kubernetes Setting Can Spike CPU Usage and How to Fix It
Alibaba Cloud Native
Alibaba Cloud Native
Feb 21, 2024 · Cloud Native

How Fluid & JindoCache Accelerate Large‑Scale AI Training in a Cloud‑Native Environment

This article examines the challenges of data‑intensive AI training on heterogeneous cloud‑native infrastructure and explains how the Fluid framework combined with JindoCache and KubeDL provides distributed caching, metadata acceleration, and seamless POSIX access to dramatically improve I/O performance, GPU utilization, and cost efficiency.

AI trainingData CachingFluid
0 likes · 18 min read
How Fluid & JindoCache Accelerate Large‑Scale AI Training in a Cloud‑Native Environment
JD Retail Technology
JD Retail Technology
Feb 20, 2024 · Operations

Measuring Operations Automation Rate and Building a Self‑Coding Automation Platform

This article explains the challenges of manual operations, defines an automation‑rate metric, introduces the Tai‑Shan Kirin platform for self‑coded operational automation, provides step‑by‑step implementation guidance with code examples, and shares a case study demonstrating significant efficiency and stability gains.

Automation MetricsCRDOperations Automation
0 likes · 19 min read
Measuring Operations Automation Rate and Building a Self‑Coding Automation Platform
Liangxu Linux
Liangxu Linux
Feb 19, 2024 · Cloud Native

How CoreDNS and kubelet Configure /etc/resolv.conf in Kubernetes Pods

This article explains how CoreDNS runs on a Caddy‑based HTTP/2 server in Kubernetes, how kubelet injects the cluster DNS IP into each container’s /etc/resolv.conf, and how different dnsPolicy settings (Default, ClusterFirst, ClusterFirstWithHostNet, None) affect the resolv.conf configuration, including key options and examples.

CoreDNSdnsPolicykubelet
0 likes · 6 min read
How CoreDNS and kubelet Configure /etc/resolv.conf in Kubernetes Pods
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Feb 19, 2024 · Cloud Native

Unveiling Kubernetes: Architecture, Core Components, and Source Code Deep Dive

This article provides a comprehensive overview of Kubernetes, detailing its fundamental concepts, master‑worker architecture, networking model, security mechanisms, extensibility via custom resources, and an in‑depth examination of key source‑code modules such as kube‑apiserver, etcd, controller‑manager, scheduler, kubelet, and kube‑proxy, with links to the official repository.

Containersarchitecturecloud-native
0 likes · 10 min read
Unveiling Kubernetes: Architecture, Core Components, and Source Code Deep Dive
Alibaba Cloud Native
Alibaba Cloud Native
Feb 18, 2024 · Cloud Native

How to Build a Hybrid Cloud Disaster‑Recovery System with Alibaba ACK One

This guide explains how to use Alibaba Cloud's ACK One platform to connect on‑premises and public‑cloud Kubernetes clusters, configure network interconnectivity, create multi‑cluster fleets, optionally deploy applications via GitOps, and manage traffic with a multi‑cluster gateway for seamless same‑city disaster recovery.

ACK OneGitOpsMulti-Cluster
0 likes · 13 min read
How to Build a Hybrid Cloud Disaster‑Recovery System with Alibaba ACK One
Top Architect
Top Architect
Feb 18, 2024 · Backend Development

Why Token Pass‑Through Is Discouraged and Alternative Unified Authorization Designs for Microservices

The article explains why passing tokens between microservices is a poor design, proposes exposing explicit userId parameters, describes unified authentication via an API gateway with Feign, Dubbo or Spring Boot Web implementations, compares their pros and cons, and shows how to integrate these patterns with Kubernetes and internal API path rules.

AuthenticationBackendDubbo
0 likes · 9 min read
Why Token Pass‑Through Is Discouraged and Alternative Unified Authorization Designs for Microservices
MaGe Linux Operations
MaGe Linux Operations
Feb 17, 2024 · Cloud Native

From chroot to Kubernetes: The Evolution of Containerization

Tracing the history of containerization, this article explores how early file isolation with chroot evolved through namespaces and cgroups, leading to LXC, Docker’s lightweight application packaging, Kubernetes orchestration, and finally cloud-native services like Huawei CCE, highlighting each stage’s impact on modern software deployment.

DockerLinuxNamespaces
0 likes · 11 min read
From chroot to Kubernetes: The Evolution of Containerization
DevOps Cloud Academy
DevOps Cloud Academy
Feb 16, 2024 · Cloud Native

Configuring a Kubernetes Pod as a Jenkins Agent

This guide explains how to set up a Kubernetes pod to act as a Jenkins agent, covering prerequisites, deployment YAML, commands to launch and verify the pod and service, and the Jenkins UI configuration needed to connect the pod as a scalable CI/CD worker.

JenkinsPodagent
0 likes · 5 min read
Configuring a Kubernetes Pod as a Jenkins Agent
DevOps Cloud Academy
DevOps Cloud Academy
Feb 14, 2024 · Cloud Native

Weaveworks Shuts Down: What It Means for GitOps and the Future of Flux

Weaveworks, the company that coined the term GitOps, announced its closure due to unstable sales despite $10 million revenue, prompting industry analysis of GitOps’s commercial viability, competition with ArgoCD, and the future stewardship of the open‑source Flux project under CNCF.

Continuous DeliveryFluxGitOps
0 likes · 5 min read
Weaveworks Shuts Down: What It Means for GitOps and the Future of Flux
MaGe Linux Operations
MaGe Linux Operations
Feb 10, 2024 · Backend Development

Mastering the Sidecar Pattern: Log Collection, Request Forwarding, and Interception in Kubernetes

This article explains the sidecar concept, compares it with SDK approaches, and provides detailed Kubernetes examples—including a log‑collection sidecar, a request‑forwarding sidecar, and an HTTP‑intercepting sidecar—complete with YAML manifests and Rust and Scala code to demonstrate implementation and deployment.

RustScalakubernetes
0 likes · 9 min read
Mastering the Sidecar Pattern: Log Collection, Request Forwarding, and Interception in Kubernetes
Architect
Architect
Feb 10, 2024 · Backend Development

Why Token Pass‑Through Is Bad and How to Build Unified Auth in Microservices

The article critiques token pass‑through for microservice authentication, explains why internal APIs should stay stateless, and presents unified authorization patterns using Spring Cloud Gateway with Feign, Dubbo, or a gateway‑less design, plus Kubernetes integration and trade‑offs.

AuthenticationDubbofeign
0 likes · 9 min read
Why Token Pass‑Through Is Bad and How to Build Unified Auth in Microservices
MaGe Linux Operations
MaGe Linux Operations
Feb 9, 2024 · Cloud Native

Mastering Kubernetes Ingress: Controllers, Architecture, and Lua Extensions

This article explains Kubernetes Ingress fundamentals, compares major Ingress controllers such as Nginx, Kong, Traefik, HAProxy and APISIX, and details the internal architecture and Lua‑based extension points of the ingress‑nginx controller, providing a comprehensive guide for managing external traffic in cloud‑native environments.

Controllerscloud-nativeingress
0 likes · 8 min read
Mastering Kubernetes Ingress: Controllers, Architecture, and Lua Extensions
37 Interactive Technology Team
37 Interactive Technology Team
Feb 8, 2024 · Operations

What Are Kubernetes Events and How to Collect Them

Kubernetes events record state changes such as pod scheduling, image pulling, and failures, which can be inspected via kubectl but are retained only an hour, so tools like kube-eventer or kubernetes-event-exporter collect them for long‑term analysis, enabling monitoring of Warning types, failure reasons, and visualization through Grafana dashboards.

Cloud-nativeEventsGrafana
0 likes · 9 min read
What Are Kubernetes Events and How to Collect Them
MaGe Linux Operations
MaGe Linux Operations
Feb 7, 2024 · Backend Development

How to Deploy a High‑Availability RabbitMQ Cluster on Kubernetes with NFS Storage

This guide walks through installing RabbitMQ, explaining its features and typical use cases, then details step‑by‑step deployment of a mirrored‑mode RabbitMQ cluster on Kubernetes using StatefulSets, NFS‑backed persistent storage, RBAC, and verification of cluster health and management operations.

Cluster DeploymentMessage queueRabbitMQ
0 likes · 23 min read
How to Deploy a High‑Availability RabbitMQ Cluster on Kubernetes with NFS Storage
MaGe Linux Operations
MaGe Linux Operations
Feb 6, 2024 · Cloud Native

How to Build a Cilium Dual‑Stack IPv4/IPv6 Kubernetes Cluster with Kind

This guide explains the concepts of IPv4/IPv6 dual‑stack networking, outlines two dual‑stack implementation methods, and provides step‑by‑step instructions to set up a Cilium‑enabled Kubernetes cluster using Kind, configure dual‑stack settings, deploy a demo app, and analyze routing behavior for both IP families.

CNICiliumDual-Stack
0 likes · 9 min read
How to Build a Cilium Dual‑Stack IPv4/IPv6 Kubernetes Cluster with Kind
dbaplus Community
dbaplus Community
Feb 4, 2024 · Operations

How Ant Group Leverages SLO and AIOps for Fine‑Grained Operations

This article details Ant Group's practical implementation of Service Level Objectives (SLO) and AIOps to achieve fine‑grained operations, covering SLO fundamentals, health‑score architecture, GitOps‑based data pipelines, error‑budget alerting, AI‑driven anomaly detection, fault localization techniques, and real‑world case studies on dashboards, Kubernetes SLOs, and emergency response workflows.

Error BudgetFault LocalizationOperations
0 likes · 38 min read
How Ant Group Leverages SLO and AIOps for Fine‑Grained Operations
Alibaba Cloud Native
Alibaba Cloud Native
Feb 4, 2024 · Cloud Native

Build Dynamic Fan‑Out/Fan‑In DAG Workflows with Argo on ACK One

This guide explains how to use Argo Workflow on Alibaba Cloud ACK One to implement dynamic fan‑out/fan‑in DAGs, splitting large log files, running parallel map tasks, and aggregating results with a reduce step, including full YAML definitions and execution steps.

Argo WorkflowDynamic DAGFan-out Fan-in
0 likes · 10 min read
Build Dynamic Fan‑Out/Fan‑In DAG Workflows with Argo on ACK One
MaGe Linux Operations
MaGe Linux Operations
Feb 2, 2024 · Cloud Native

Connect Java Maven Apps to Kubernetes with kubeconfig & ServiceAccount

This guide demonstrates how to set up a Maven project with the Fabric8 Kubernetes Java client, configure minimal kubeconfig or ServiceAccount credentials, and use sample code to list namespaces, illustrating essential steps for connecting Java applications to a Kubernetes cluster with minimal configuration.

Fabric8JavaMaven
0 likes · 6 min read
Connect Java Maven Apps to Kubernetes with kubeconfig & ServiceAccount
Ops Development Stories
Ops Development Stories
Feb 2, 2024 · Cloud Native

Essential kubectl Commands for Efficient Kubernetes Management

This guide compiles a comprehensive set of kubectl and Docker commands for retrieving logs, sorting pods, managing secrets, cleaning resources, debugging, port forwarding, and performing cluster maintenance tasks, helping administrators streamline Kubernetes operations and troubleshoot issues effectively.

CLICluster ManagementDevOps
0 likes · 15 min read
Essential kubectl Commands for Efficient Kubernetes Management
Cloud Native Technology Community
Cloud Native Technology Community
Feb 2, 2024 · Cloud Native

Achieving Sub‑2‑Hour RTO: A Cloud‑Native Disaster Recovery Blueprint for Enterprises

This article examines how a leading global industrial group leveraged a cloud‑native platform to design a disaster‑recovery solution that meets a sub‑2‑hour RTO and a 1‑minute RPO, detailing architecture, data‑layer strategies, middleware replication, application and access‑layer handling, and operational best practices.

ACPGitOpsRPO
0 likes · 17 min read
Achieving Sub‑2‑Hour RTO: A Cloud‑Native Disaster Recovery Blueprint for Enterprises
AntData
AntData
Feb 1, 2024 · Cloud Native

Deploying LiteIO Cloud‑Native Block Storage Service on Kubernetes

This guide explains how to set up the high‑performance, cloud‑native LiteIO block storage service on a Kubernetes cluster, covering prerequisite VM preparation, kernel upgrade, Docker and Kubernetes installation, CRI configuration, LiteIO component deployment for both LVM and SPDK engines, and verification of Pods and PVCs.

CSICloud Native StorageLVM
0 likes · 14 min read
Deploying LiteIO Cloud‑Native Block Storage Service on Kubernetes
System Architect Go
System Architect Go
Jan 31, 2024 · Cloud Native

My CKA Certification Experience and Rapid‑Study Guide

The author shares how they passed the Certified Kubernetes Administrator exam with a 95 score, explains the certification’s scope, provides practical preparation tips, environment requirements, key study resources, and a concise strategy to quickly master the 17 recurring exam questions.

CKACertificationExam Guide
0 likes · 5 min read
My CKA Certification Experience and Rapid‑Study Guide
政采云技术
政采云技术
Jan 30, 2024 · Cloud Native

Understanding the Core Workflow of Kubernetes Informer in client-go

This article explains the internal workflow of the Kubernetes informer package in client-go, covering its architecture, key components such as Reflector, DeltaFIFO, and Indexer, and provides a step‑by‑step code example that demonstrates how informers are created, registered, started, and used to handle watch events efficiently.

ControllerDeltaFIFOGo
0 likes · 19 min read
Understanding the Core Workflow of Kubernetes Informer in client-go
MaGe Linux Operations
MaGe Linux Operations
Jan 30, 2024 · Cloud Native

How to Auto‑Recover Lost s3fs Mounts in a Huawei OBS CSI Plugin

This article explains why a Huawei OBS CSI plugin loses its s3fs process after a restart, causing "Transport endpoint is not connected" errors, and provides a step‑by‑step solution using client‑go to rebuild the mount and trigger kubelet remount via a liveness probe.

CSICloudNativeLivenessProbe
0 likes · 7 min read
How to Auto‑Recover Lost s3fs Mounts in a Huawei OBS CSI Plugin
Beike Product & Technology
Beike Product & Technology
Jan 29, 2024 · Information Security

Kubernetes Security Risks and Hardening Recommendations

This article analyzes Kubernetes security threats from cloud, cluster, and container perspectives, enumerates high‑risk permissions, default privileged accounts, and insecure configurations, and provides concrete hardening steps such as least‑privilege RAM policies, etcd encryption, RBAC tightening, and workload isolation measures.

CloudNativePodSecurityRBAC
0 likes · 31 min read
Kubernetes Security Risks and Hardening Recommendations
Liangxu Linux
Liangxu Linux
Jan 28, 2024 · Cloud Native

Master Kubernetes Troubleshooting: 100 Essential kubectl Commands

This guide compiles 100 practical kubectl commands that help you diagnose cluster information, pods, services, deployments, networking, storage, security, autoscaling, and many other Kubernetes components, providing a handy reference for effective cluster troubleshooting.

Clustercloud-nativecommands
0 likes · 19 min read
Master Kubernetes Troubleshooting: 100 Essential kubectl Commands
Liangxu Linux
Liangxu Linux
Jan 28, 2024 · Cloud Native

Debugging Running Pods in Kubernetes Without Root Access

This guide explains why kubectl exec often fails under security best practices, introduces kubectl debug with ephemeral containers for root‑level troubleshooting, shows how to create and use debug containers, and outlines alternative non‑native methods and tools for inspecting live pods.

EphemeralContainerskubectlkubernetes
0 likes · 10 min read
Debugging Running Pods in Kubernetes Without Root Access
DevOps Operations Practice
DevOps Operations Practice
Jan 28, 2024 · Cloud Native

Five Open‑Source Storage Projects for Kubernetes

This article introduces five major open‑source storage solutions—OpenEBS, Rook, GlusterFS, Ceph, and LongHorn—explaining how each simplifies persistent data management for Kubernetes workloads while offering features such as replication, self‑healing, and multi‑node scalability.

CephLonghornOpenEBS
0 likes · 6 min read
Five Open‑Source Storage Projects for Kubernetes
AntTech
AntTech
Jan 25, 2024 · Cloud Native

LiteIO: Open‑Source High‑Performance Cloud‑Native Block Device Service

LiteIO is an open‑source, high‑performance, cloud‑native block device service that uses NVMe‑oF and SPDK to provide point‑to‑point storage pooling, enabling efficient FinOps, serverless scaling, hot upgrades, zero‑copy I/O, snapshots, and thin provisioning for databases and applications in Kubernetes.

LiteIONVMe-oFSPDK
0 likes · 11 min read
LiteIO: Open‑Source High‑Performance Cloud‑Native Block Device Service
Aikesheng Open Source Community
Aikesheng Open Source Community
Jan 25, 2024 · Cloud Native

Introducing LiteIO: Open‑Source High‑Performance Cloud‑Native Block Device Service

LiteIO is an open‑source, cloud‑native block device service that leverages NVMe‑oF and SPDK to provide high‑performance, scalable storage for Kubernetes‑based workloads, improving storage utilization and enabling FinOps‑driven cost efficiency across large‑scale production environments.

Cloud Native StorageFinOpsLiteIO
0 likes · 12 min read
Introducing LiteIO: Open‑Source High‑Performance Cloud‑Native Block Device Service
Open Source Linux
Open Source Linux
Jan 25, 2024 · Cloud Native

Top 200 Kubernetes Interview Questions and Answers for Mastery

This comprehensive guide presents 200 essential Kubernetes interview questions covering fundamentals, architecture, real‑world scenarios, and advanced topics, complete with concise answers, diagrams, and practical insights to help candidates ace container orchestration interviews.

DevOpsInterviewcloud-native
0 likes · 21 min read
Top 200 Kubernetes Interview Questions and Answers for Mastery
360 Smart Cloud
360 Smart Cloud
Jan 24, 2024 · Cloud Native

Idle Compute Sharing in Dedicated Kubernetes Clusters Using Karmada

The article describes how a company implements an idle compute sharing feature for dedicated Kubernetes clusters, leveraging Karmada to allocate spare CPU and memory to offline workloads, thereby improving resource utilization, reducing costs, and outlining usage scenarios, configuration steps, technical architecture, and future plans.

Idle Compute SharingKarmadacloud-native
0 likes · 9 min read
Idle Compute Sharing in Dedicated Kubernetes Clusters Using Karmada
Volcano Engine Developer Services
Volcano Engine Developer Services
Jan 24, 2024 · Cloud Native

How ByteDance’s Gödel Scheduler Unifies Online and Offline Workloads at Massive Scale

The article details ByteDance’s Gödel Scheduler, a cloud‑native, distributed Kubernetes scheduler that unifies online and offline workloads, describing its architecture, enhanced features, performance gains, roadmap, and open‑source plans, including its multi‑instance design, optimistic concurrency, and rescheduling capabilities for improved throughput and scheduling quality.

Performance OptimizationSchedulerkubernetes
0 likes · 15 min read
How ByteDance’s Gödel Scheduler Unifies Online and Offline Workloads at Massive Scale
Linux Code Review Hub
Linux Code Review Hub
Jan 23, 2024 · Industry Insights

2024 eBPF and Networking Trends Forecast

The article forecasts rapid eBPF adoption across cloud‑native networking, mobile devices, and observability, highlights emerging eBPF marketplaces, discusses performance gains with NetKit and BIG TCP, predicts IPv6‑first Kubernetes clusters, AI‑assisted network troubleshooting, and the growing convergence of platform engineering and networking in 2024.

CiliumeBPFkubernetes
0 likes · 16 min read
2024 eBPF and Networking Trends Forecast
MaGe Linux Operations
MaGe Linux Operations
Jan 22, 2024 · Cloud Native

Deploy and Secure Nacos Config Center on Huawei CCE & Kubernetes

This guide explains how to use Nacos as a centralized configuration center in Spring Boot micro‑services, covering common pitfalls of static configs, best‑practice namespace/group/DataId design, dependency setup, YAML examples, security annotations, role‑based access, Dockerfile tweaks, CCE deployment, database schema, and Kubernetes manifests for test and production environments.

Configuration ManagementDevOpsHuawei CCE
0 likes · 19 min read
Deploy and Secure Nacos Config Center on Huawei CCE & Kubernetes
Alibaba Cloud Native
Alibaba Cloud Native
Jan 22, 2024 · Cloud Native

How Kube Queue Optimizes Batch Job Scheduling in Kubernetes

Batch jobs demand efficient resource use, but Kubernetes’ default scheduler struggles with large queues; Kube Queue, a cloud‑native AI suite component, introduces dedicated queues, flexible strategies, and quota management to automate scheduling, support multi‑tenant workloads, and improve cluster utilization.

AIJob Schedulingkubernetes
0 likes · 14 min read
How Kube Queue Optimizes Batch Job Scheduling in Kubernetes
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Jan 22, 2024 · Industry Insights

How Trustworthy Computing Power Measurement Can Transform Cloud‑Native Services

This article examines the urgent need for standardized, trustworthy computing power measurement, outlines narrow and broad measurement frameworks, and details a technical solution that integrates WASM virtual machines and blockchain with Kubernetes to achieve precise, tamper‑proof resource accounting for modern cloud‑native environments.

Wasmcloud-nativecomputing power measurement
0 likes · 14 min read
How Trustworthy Computing Power Measurement Can Transform Cloud‑Native Services