Tagged articles
4063 articles
Page 16 of 41
vivo Internet Technology
vivo Internet Technology
Sep 6, 2023 · Cloud Native

Multi-Cluster Management in Kubernetes: Concepts, Practices, and Karmada Exploration

The article explains why enterprises adopt multi‑cluster Kubernetes architectures, reviews community solutions such as Karmada, Clusternet and OCM, and details vivo’s hybrid strategy that combines a unified UI for independent clusters with Karmada‑based federation for resource distribution, elastic scaling, cross‑cluster scheduling, and gray‑release migration.

KarmadaMulti-ClusterScheduling
0 likes · 20 min read
Multi-Cluster Management in Kubernetes: Concepts, Practices, and Karmada Exploration
DevOps Cloud Academy
DevOps Cloud Academy
Sep 6, 2023 · Cloud Native

The Evolving Role of Developers in Infrastructure as Code and Cloud‑Native Platforms

This article examines how infrastructure management has shifted toward treating infrastructure as code, the growing responsibilities of developers in deploying and maintaining cloud‑native platforms such as Kubernetes, the challenges they face, and the supporting role of platform/DevOps teams and tools like Terraform and ArgoCD.

ArgoCDDevOpsGitOps
0 likes · 7 min read
The Evolving Role of Developers in Infrastructure as Code and Cloud‑Native Platforms
Cloud Native Technology Community
Cloud Native Technology Community
Sep 5, 2023 · Cloud Native

Why Kubernetes 1.28 Finally Makes Sidecars First‑Class Citizens

This guide explains what sidecars are in Kubernetes, why they matter, the challenges they introduce, and how the new sidecar KEP in Kubernetes 1.28—introducing an Always RestartPolicy for init containers—formalizes sidecars as first‑class API objects, reshaping service‑mesh implementations and operational practices.

Init ContainerKubernetes 1.28RestartPolicy
0 likes · 18 min read
Why Kubernetes 1.28 Finally Makes Sidecars First‑Class Citizens
Alibaba Cloud Native
Alibaba Cloud Native
Sep 3, 2023 · Cloud Native

Master Knative’s Request‑Based Autoscaling: KPA, Scale‑to‑Zero, and Advanced Strategies

This article explains how Knative implements request‑based autoscaling with KPA, details the scale‑to‑zero mechanism, shows how to handle burst traffic using stable and panic windows, and demonstrates advanced extensions such as resource pools, precise MPA scaling, and predictive AHPA configurations with concrete YAML examples.

AutoscalingKPAKnative
0 likes · 18 min read
Master Knative’s Request‑Based Autoscaling: KPA, Scale‑to‑Zero, and Advanced Strategies
DevOps Cloud Academy
DevOps Cloud Academy
Sep 1, 2023 · Cloud Native

Understanding Kubernetes Termination Signals and Graceful Shutdown

This article explains how Kubernetes termination signals work, the graceful shutdown workflow, handling of application termination, customization of grace periods, impact on high availability, best practices, and tools such as preStop hooks to ensure reliable container lifecycle management.

Grace PeriodPod LifecycleTermination Signals
0 likes · 9 min read
Understanding Kubernetes Termination Signals and Graceful Shutdown
MaGe Linux Operations
MaGe Linux Operations
Aug 31, 2023 · Cloud Native

How to Achieve Zero‑Downtime Deployments with Kubernetes

Learn how to configure Kubernetes for zero‑downtime applications by syncing container images, ensuring multiple pod replicas, using PodDisruptionBudgets, selecting appropriate deployment strategies, setting up liveness/readiness probes, handling graceful termination, applying pod anti‑affinity, and enabling autoscaling and proper resource limits.

AutoscalingProbesZero Downtime
0 likes · 12 min read
How to Achieve Zero‑Downtime Deployments with Kubernetes
Liangxu Linux
Liangxu Linux
Aug 29, 2023 · Cloud Native

Master Real-Time Multi-Pod Log Viewing in Kubernetes with Kubetail & Stern

This guide introduces two lightweight Kubernetes log‑tailing tools, Kubetail and Stern, explains their installation on various platforms, demonstrates common usage patterns and command‑line options, and provides practical examples for aggregating and filtering logs across multiple pods and containers.

DevOpscloud-nativekubernetes
0 likes · 10 min read
Master Real-Time Multi-Pod Log Viewing in Kubernetes with Kubetail & Stern
DevOps Cloud Academy
DevOps Cloud Academy
Aug 29, 2023 · Cloud Native

Achieving Zero‑Downtime Applications with Kubernetes

This article explains why and how to use Kubernetes features such as multiple pod replicas, PodDisruptionBudgets, deployment strategies, health probes, graceful termination, anti‑affinity, resource limits, and autoscaling to build zero‑downtime, highly available applications.

AutoscalingDeployment StrategiesHealth probes
0 likes · 12 min read
Achieving Zero‑Downtime Applications with Kubernetes
Alibaba Cloud Native
Alibaba Cloud Native
Aug 29, 2023 · Cloud Native

How Kruise Rollout Uses Lua Scripts for Extensible Gateway Traffic Scheduling

Kruise Rollout introduces a Lua‑script based, extensible traffic routing solution that enables progressive delivery across diverse gateway resources—such as Istio, Kong, and APISIX—by dynamically modifying VirtualService and DestinationRule objects, simplifying configuration, reducing custom code, and supporting automated canary, blue‑green, and A/B testing deployments.

Gateway APIIstioKruise Rollout
0 likes · 14 min read
How Kruise Rollout Uses Lua Scripts for Extensible Gateway Traffic Scheduling
Efficient Ops
Efficient Ops
Aug 28, 2023 · Operations

How to Build a Real‑Time ELK Log Analysis Platform for Scalable Operations

This article explains why centralized logging is essential for modern micro‑service systems, outlines an ELK‑based architecture with Filebeat, Kafka, Logstash, Elasticsearch and Kibana, and provides detailed configuration examples for both file‑based and Kubernetes‑based log collection, plus visualization techniques.

ELKElasticsearchFilebeat
0 likes · 10 min read
How to Build a Real‑Time ELK Log Analysis Platform for Scalable Operations
Volcano Engine Developer Services
Volcano Engine Developer Services
Aug 25, 2023 · Cloud Native

How ByteDance Scaled with Multi‑Cloud: Lessons from Their Cloud‑Native Journey

ByteDance’s multi‑cloud evolution, driven by rapid business growth, cost control, and compliance needs, showcases a distributed cloud‑native platform built on open‑source orchestration, unified resource management, and advanced data‑lake solutions, while addressing operational complexity, interoperability, and emerging AI‑driven challenges.

AIBig Datacloud-native
0 likes · 14 min read
How ByteDance Scaled with Multi‑Cloud: Lessons from Their Cloud‑Native Journey
Wukong Talks Architecture
Wukong Talks Architecture
Aug 25, 2023 · Cloud Native

Cloud-Native Architecture Evolution and Practices at ZEEK Automotive

This article details ZEEK Automotive's transition to a cloud‑native architecture, describing how the company adopted Kubernetes, containerization, micro‑service unification, and full‑link gray release to improve system stability, scalability, and development efficiency across its APP ecosystem.

AutomotiveContainerizationDevOps
0 likes · 18 min read
Cloud-Native Architecture Evolution and Practices at ZEEK Automotive
dbaplus Community
dbaplus Community
Aug 24, 2023 · Operations

Master Kubernetes Logging: Best Practices and Essential Tools

This article explains how Kubernetes logging works, reviews popular log collection agents like Fluentd, outlines best‑practice guidelines such as centralized storage, log rotation, structured logs, and sensitive data handling, and provides useful CLI tools and kubectl commands for efficient log inspection.

Best PracticesFluentdkubectl
0 likes · 11 min read
Master Kubernetes Logging: Best Practices and Essential Tools
dbaplus Community
dbaplus Community
Aug 23, 2023 · Information Security

Why APISIX TLS Handshakes Failed: Missing SNI and SSLv2 Pitfalls

During a migration from Nginx to APISIX, the team encountered TLS handshake failures caused by missing SNI fields and legacy SSLv2Hello usage, leading to a detailed investigation, protocol explanations, and configuration fixes to restore secure connections without modifying client code.

APISIXSNISSL
0 likes · 11 min read
Why APISIX TLS Handshakes Failed: Missing SNI and SSLv2 Pitfalls
Cloud Native Technology Community
Cloud Native Technology Community
Aug 23, 2023 · Cloud Native

How Kubernetes 1.28 Improves Batch Jobs with Pod Replacement Policy and Per‑Index Backoff Limits

Kubernetes 1.28 adds two alpha features—Pod Replacement Policy and per‑index backoff limits—that let batch jobs replace terminating pods more intelligently and cap retries for each indexed pod, reducing resource waste and improving reliability for machine‑learning workloads.

Alpha FeatureBackoff LimitBatch Jobs
0 likes · 9 min read
How Kubernetes 1.28 Improves Batch Jobs with Pod Replacement Policy and Per‑Index Backoff Limits
Efficient Ops
Efficient Ops
Aug 22, 2023 · Operations

Persisting Prometheus Alertmanager Alerts with Alertsnitch, MySQL, and Grafana

This article explains how Prometheus stores alerts only as time‑series data, why that limits historical queries, and provides a complete open‑source solution using Alertmanager, Alertsnitch, MySQL, and Grafana to persist, query, and visualize alerts in production environments.

Alert PersistenceAlertmanagerGrafana
0 likes · 10 min read
Persisting Prometheus Alertmanager Alerts with Alertsnitch, MySQL, and Grafana
DataFunTalk
DataFunTalk
Aug 22, 2023 · Artificial Intelligence

Building Complex Distributed Systems with Ray: An AutoML Case Study and Cloud‑Native Deployment

This article explains how the Ray distributed computing engine simplifies the design, deployment, and operation of complex cloud‑native distributed systems—illustrated through an AutoML service example—by detailing system complexity, Ray’s core concepts, resource customization, runtime environments, monitoring, and ecosystem integrations.

AIAutoMLDistributed computing
0 likes · 26 min read
Building Complex Distributed Systems with Ray: An AutoML Case Study and Cloud‑Native Deployment
Cloud Native Technology Community
Cloud Native Technology Community
Aug 22, 2023 · Cloud Native

Building a Robust Internal Developer Platform on Kubernetes: Components, Governance, and Policies

This article explains how to construct an Internal Developer Platform (IDP) on Kubernetes by outlining its essential components, adding default add‑on tools, establishing governance and policies, and implementing continuous integration/continuous delivery pipelines to provide developers with a safe, efficient, and cost‑effective experience.

GovernanceInternal Developer PlatformPolicy
0 likes · 7 min read
Building a Robust Internal Developer Platform on Kubernetes: Components, Governance, and Policies
Alibaba Cloud Native
Alibaba Cloud Native
Aug 21, 2023 · Cloud Native

Optimizing Multi‑Cluster Cloud Native Costs: ZEEK’s ACK FinOps Journey

This article details how ZEEK automotive tackled rapid growth challenges by redesigning its cloud‑native infrastructure, adopting Alibaba Cloud ACK FinOps and ACK One for multi‑cluster management, and implementing cost‑visibility, intelligent allocation, and configuration checks that yielded significant resource savings and operational stability.

FinOpsMulti-Clustercost optimization
0 likes · 18 min read
Optimizing Multi‑Cluster Cloud Native Costs: ZEEK’s ACK FinOps Journey
Efficient Ops
Efficient Ops
Aug 20, 2023 · Cloud Native

Master Kubernetes YAML with kubectl: help, dry‑run, and explain tricks

This guide shows beginners how to simplify Kubernetes YAML creation by leveraging kubectl's help, dry‑run, and explain commands, providing step‑by‑step examples, code snippets, and practical tips to boost efficiency and confidence when deploying applications.

YAMLcloud-nativedeployment
0 likes · 6 min read
Master Kubernetes YAML with kubectl: help, dry‑run, and explain tricks
Cloud Native Technology Community
Cloud Native Technology Community
Aug 17, 2023 · Cloud Native

Kubernetes v1.28 (Planternetes) Release: New Features, Enhancements, and Deprecations

Kubernetes v1.28, codenamed Planternetes, introduces 45 enhancements—including expanded version skew support, non‑graceful node shutdown recovery, improved CRD validation, beta ValidatingAdmissionPolicies, hybrid version proxy, and many features promoted to stable—while also deprecating and removing several older components, and provides download and community resources for adoption.

DeprecationsPlanternetescloud-native
0 likes · 20 min read
Kubernetes v1.28 (Planternetes) Release: New Features, Enhancements, and Deprecations
vivo Internet Technology
vivo Internet Technology
Aug 16, 2023 · Cloud Native

Building a Scalable Container Monitoring System with Prometheus and VictoriaMetrics at vivo

The vivo Internet Container Team built a scalable, high‑availability container monitoring platform by deploying dual‑replica Prometheus clusters with a custom HA adapter, remoteWrite to VictoriaMetrics, and a Kafka forwarder, while cutting metric cardinality, tuning cAdvisor, and upgrading VictoriaMetrics to eliminate data loss and storage spikes, achieving stable, efficient monitoring.

ContainerMetrics OptimizationPrometheus
0 likes · 16 min read
Building a Scalable Container Monitoring System with Prometheus and VictoriaMetrics at vivo
ByteDance Cloud Native
ByteDance Cloud Native
Aug 15, 2023 · Cloud Native

What’s New in Katalyst v0.3.0? Core Enhancements Explained

Katalyst v0.3.0 introduces major upgrades including enhanced KCNR API bandwidth isolation, a more extensible task and async execution framework, advanced mixed‑deployment controls, load‑aware resource prediction, and concurrent unit testing, all aimed at improving cloud‑native resource management efficiency.

Katalystkubernetesresource management
0 likes · 4 min read
What’s New in Katalyst v0.3.0? Core Enhancements Explained
Liangxu Linux
Liangxu Linux
Aug 14, 2023 · Cloud Native

Enable Cilium BGP Networking with Bird and FRR in a Vagrant Lab

This guide walks through setting up a simulated Kubernetes cluster with Cilium, configuring BGP using FRR on a router node and Bird on worker nodes, and enabling Cilium's built‑in BGP speaker to achieve cross‑node pod communication and LoadBalancer IP advertisement.

BGPBirdCilium
0 likes · 13 min read
Enable Cilium BGP Networking with Bird and FRR in a Vagrant Lab
Efficient Ops
Efficient Ops
Aug 14, 2023 · Cloud Native

How to Secure Multi‑Tenant Kubernetes Clusters: Best Practices and Architecture

This article explains the concept of multi‑tenant Kubernetes clusters, compares soft and hard isolation models, outlines enterprise, SaaS, and KaaS use cases, and provides practical guidance on access control, resource scheduling, and advanced security measures to achieve reliable tenant isolation.

access controlkubernetesmulti-tenancy
0 likes · 14 min read
How to Secure Multi‑Tenant Kubernetes Clusters: Best Practices and Architecture
UCloud Tech
UCloud Tech
Aug 14, 2023 · Artificial Intelligence

Deploy ChatGLM2‑6B on UCloud K8S: Complete Guide to Large Language Model Inference

This article reviews the architectures, training methods, and key characteristics of major open‑source large language models such as BERT, GPT, T5, LLaMA and ChatGLM, and then provides a step‑by‑step tutorial for deploying ChatGLM2‑6B on UCloud's UK8S platform using UFS storage, Kubernetes manifests, and command‑line tools.

ChatGLMUCloudkubernetes
0 likes · 14 min read
Deploy ChatGLM2‑6B on UCloud K8S: Complete Guide to Large Language Model Inference
Efficient Ops
Efficient Ops
Aug 13, 2023 · Cloud Native

Mastering Kubernetes CRDs: How to Define and Use Custom Resources

This article explains what Kubernetes CustomResourceDefinitions (CRDs) are, how they extend the platform without modifying core code, and provides step‑by‑step examples for creating CRDs and custom objects using YAML manifests and kubectl commands.

CRDCustomResourceDefinitionYAML
0 likes · 7 min read
Mastering Kubernetes CRDs: How to Define and Use Custom Resources
MaGe Linux Operations
MaGe Linux Operations
Aug 13, 2023 · Cloud Native

How Pods Really Work: Inside Containers, Namespaces, and cgroups

This article explores the inner workings of Kubernetes Pods by examining how containers share network, IPC, mount, and PID namespaces, how cgroups enforce resource limits, and how similar behavior can be reproduced with Docker, providing hands‑on examples and code snippets for deeper understanding.

ContainersNamespacesPods
0 likes · 19 min read
How Pods Really Work: Inside Containers, Namespaces, and cgroups
Liangxu Linux
Liangxu Linux
Aug 12, 2023 · Cloud Native

How Datree Prevents Misconfigurations in Kubernetes Deployments

Datree is an open‑source CLI that validates Kubernetes YAML files against customizable policies, offering syntax checks, version validation, and resource limits enforcement, allowing developers to catch configuration errors without connecting to a live cluster and providing dashboard‑driven rule management.

CLIDatreeYAML validation
0 likes · 5 min read
How Datree Prevents Misconfigurations in Kubernetes Deployments
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Aug 11, 2023 · Cloud Native

How to Build a Scalable E‑Commerce Platform with Kubernetes

This guide explains how to design a Kubernetes‑based architecture for a large‑scale online store, covering component containerization, auto‑scaling mechanisms, high‑availability deployment, continuous delivery, and monitoring integration to handle traffic spikes reliably.

Auto-scalingcloud-nativee‑commerce
0 likes · 5 min read
How to Build a Scalable E‑Commerce Platform with Kubernetes
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Aug 10, 2023 · Operations

How Kubernetes Powers Modern DevOps Automation and Operations

By integrating Kubernetes with DevOps practices, teams can automate deployment pipelines, achieve dynamic resource allocation, centralize monitoring with tools like Prometheus and Grafana, and treat infrastructure as code, resulting in faster, higher-quality software delivery and improved collaboration between development and operations.

DevOpsOperationsautomation
0 likes · 7 min read
How Kubernetes Powers Modern DevOps Automation and Operations
政采云技术
政采云技术
Aug 10, 2023 · Cloud Native

Detailed Overview of Kubernetes Gateway API

This article provides a comprehensive guide to Kubernetes Gateway API, covering its evolution from Ingress, core concepts, resource types, configuration examples, role‑based design, cross‑namespace routing, and practical steps for installation, deployment, and management using Istio and other controllers.

Gateway APIIstioapi-gateway
0 likes · 39 min read
Detailed Overview of Kubernetes Gateway API
ByteDance Cloud Native
ByteDance Cloud Native
Aug 9, 2023 · Cloud Native

How Volcano Engine’s New GPU Sharing Scheduler Boosts AI Workloads by 500%

This article explains Volcano Engine's next‑generation GPU sharing scheduling technology, detailing the two‑layer scheduler, card‑level bin‑pack/spread strategies, system architecture, API definitions, and optimization algorithms that together increase GPU deployment density over 500% and improve utilization by more than 50% for AI workloads.

GPU schedulingkubernetesmGPU
0 likes · 13 min read
How Volcano Engine’s New GPU Sharing Scheduler Boosts AI Workloads by 500%
Efficient Ops
Efficient Ops
Aug 8, 2023 · Cloud Native

How Does Kubernetes Turn YAML into Live Objects? A Deep Dive into Declarative APIs

This article explains how Kubernetes stores objects via the API and etcd, compares imperative and declarative management with kubectl commands and YAML files, describes the spec/status model, outlines the API server processing pipeline, and details the controller‑based reconciliation loop that drives desired state convergence.

ControllerDeclarative APIYAML
0 likes · 14 min read
How Does Kubernetes Turn YAML into Live Objects? A Deep Dive into Declarative APIs
Liangxu Linux
Liangxu Linux
Aug 6, 2023 · Cloud Native

Unlock Hidden kubectl Tricks: Advanced Commands for Kubernetes Mastery

This guide presents a collection of advanced kubectl techniques—including printing API details, filtering and deleting pods by status, counting pods per node, analyzing pod distribution across machines, and leveraging kubectl proxy—providing practical command examples and explanations for experienced Kubernetes users.

CLIDevOpsOperations
0 likes · 8 min read
Unlock Hidden kubectl Tricks: Advanced Commands for Kubernetes Mastery
Alibaba Cloud Native
Alibaba Cloud Native
Aug 6, 2023 · Artificial Intelligence

Boost Bloom‑7B1 Inference 2.5× Faster with FasterTransformer on ACK

This guide shows how to accelerate Bloom‑7B1 inference on Alibaba Cloud ACK by converting the model to FasterTransformer format, deploying it with Triton Server, and comparing performance against the original HuggingFace checkpoint, achieving roughly a 2.5‑fold speedup.

Artificial IntelligenceBloom-7B1FasterTransformer
0 likes · 17 min read
Boost Bloom‑7B1 Inference 2.5× Faster with FasterTransformer on ACK
Ops Development Stories
Ops Development Stories
Aug 3, 2023 · Cloud Native

How to Use Kubernetes as a Config Center and Auto‑Generate Dockerfiles

This article compares traditional independent config centers with Kubernetes as a config center, explains how to simplify environment variable handling, automatically generate Dockerfiles, and manage build and runtime images in Go projects, offering practical tips and code examples for containerized service deployment.

Config CenterDockerfileenvironment variables
0 likes · 11 min read
How to Use Kubernetes as a Config Center and Auto‑Generate Dockerfiles
DevOps Cloud Academy
DevOps Cloud Academy
Aug 2, 2023 · Cloud Native

Backing Up and Restoring etcd in a Kubernetes Cluster

This tutorial walks through installing the etcd client, creating an Nginx deployment for verification, backing up the etcd data store, validating the backup, and restoring the backup to a Kubernetes cluster while handling component shutdown and restart procedures.

DevOpscloud-nativeetcd
0 likes · 14 min read
Backing Up and Restoring etcd in a Kubernetes Cluster
ByteDance SYS Tech
ByteDance SYS Tech
Aug 1, 2023 · Cloud Native

How ByteFUSE Revolutionizes High‑Performance Cloud‑Native Storage with FUSE and RDMA

ByteFUSE, a user‑space FUSE‑based solution for ByteNAS, delivers low‑latency, high‑throughput, POSIX‑compatible storage across AI training, database backup, and search services by replacing NFS with a cloud‑native architecture that leverages CSI, RDMA, and kernel‑module hot‑upgrade techniques.

Distributed File SystemFUSEPerformance Optimization
0 likes · 19 min read
How ByteFUSE Revolutionizes High‑Performance Cloud‑Native Storage with FUSE and RDMA
HelloTech
HelloTech
Aug 1, 2023 · Cloud Native

Elastic Scaling Practices in Cloud‑Native Kubernetes Environments

To overcome native HPA limits and business‑specific constraints in a fully containerized, cloud‑native Kubernetes environment, we implemented a dual‑threshold water‑level and scheduled scaling engine, hybrid‑cloud ClusterAutoScale, mixed‑deployment resource prioritization, and comprehensive Prometheus‑based observability, achieving higher utilization, lower costs, and a roadmap toward deeper optimization and AIOps.

Auto Scalingcloud-nativeelastic scaling
0 likes · 10 min read
Elastic Scaling Practices in Cloud‑Native Kubernetes Environments
Open Source Linux
Open Source Linux
Jul 31, 2023 · Operations

How to Diagnose Kubernetes Pod Network Issues: Tools, Models, and Real-World Cases

This article introduces a systematic approach for troubleshooting Kubernetes pod network problems, covering anomaly classification, essential diagnostic tools such as tcpdump, nsenter, paping and mtr, a step‑by‑step troubleshooting workflow, and detailed case studies that illustrate root‑cause analysis and resolution techniques.

CNIiptableskubernetes
0 likes · 27 min read
How to Diagnose Kubernetes Pod Network Issues: Tools, Models, and Real-World Cases
MaGe Linux Operations
MaGe Linux Operations
Jul 29, 2023 · Cloud Native

Why Choose K3s Over K8s? A Lightweight Kubernetes Comparison

This article explains what K3s and Kubernetes (K8s) are, compares their architectures, deployment methods, resource requirements, upgrade processes, performance and security, and advises when to choose the lightweight, CNCF‑certified K3s over the full‑featured Kubernetes for edge, development or production environments.

K3scontainer orchestrationedge computing
0 likes · 18 min read
Why Choose K3s Over K8s? A Lightweight Kubernetes Comparison
MaGe Linux Operations
MaGe Linux Operations
Jul 27, 2023 · Cloud Native

Unlock Advanced kubectl Tricks to Master Kubernetes Clusters

This article shares practical kubectl tips—including printing API calls, filtering and deleting pods, counting pod distribution, and using kubectl proxy—to help experienced Kubernetes users work more efficiently and troubleshoot clusters with powerful command‑line shortcuts.

DevOpsTipscloud-native
0 likes · 7 min read
Unlock Advanced kubectl Tricks to Master Kubernetes Clusters
StarRocks
StarRocks
Jul 27, 2023 · Cloud Native

Deploy StarRocks Quickly with Docker and Kubernetes: Step‑by‑Step Guide

This guide explains how to set up a StarRocks cluster using Docker for rapid testing and the StarRocks Kubernetes Operator for production‑grade deployments, covering architecture basics, required tools, command‑line steps, YAML configuration, and connection methods for both internal and external access.

DockerStarRockskubernetes
0 likes · 11 min read
Deploy StarRocks Quickly with Docker and Kubernetes: Step‑by‑Step Guide
Ops Development Stories
Ops Development Stories
Jul 27, 2023 · Cloud Native

Why Containerization Lets You ‘Mop Up’ Work: A Real‑World Journey

This informal blog recounts a developer’s hands‑on journey from puzzling over the buzzword “containerization” to building an automated GitLab‑CI, Docker, and Kubernetes pipeline, highlighting practical benefits, challenges, and the humorous “mop‑up” moments that make the transition both enlightening and entertaining.

ContainerizationDockerGitLab
0 likes · 4 min read
Why Containerization Lets You ‘Mop Up’ Work: A Real‑World Journey
Open Source Linux
Open Source Linux
Jul 27, 2023 · Cloud Native

How Datree Secures Your Kubernetes Configurations: A Hands‑On Guide

Datree is an open‑source CLI tool that validates Kubernetes YAML files, enforces custom policies, and prevents misconfigurations without needing cluster access, offering dashboard‑driven rule management and a step‑by‑step example that shows how to fix memory and CPU limits in a deployment manifest.

CLIDatreeDevOps
0 likes · 4 min read
How Datree Secures Your Kubernetes Configurations: A Hands‑On Guide
DevOps Cloud Academy
DevOps Cloud Academy
Jul 26, 2023 · Cloud Native

Step-by-Step Guide to Deploying Jenkins with ArgoCD on a Kubernetes Cluster

This guide walks through setting up a Kubernetes cluster with NFS storage, loading Jenkins and agent Docker images, configuring ArgoCD, deploying Jenkins via declarative manifests, initializing Jenkins, creating and deploying a static Jenkins agent, and troubleshooting common issues, providing full code snippets and screenshots.

ArgoCDDevOpsDocker
0 likes · 4 min read
Step-by-Step Guide to Deploying Jenkins with ArgoCD on a Kubernetes Cluster
Tongcheng Travel Technology Center
Tongcheng Travel Technology Center
Jul 26, 2023 · Cloud Native

Design and Implementation of a Cloud‑Native Operator Platform for Component Management at Tongcheng Travel

This article details Tongcheng Travel's cloud‑native migration journey, describing the challenges of resource quota management, component versioning, and operational automation, and explains how a Kubernetes Operator platform built with Go, kubebuilder, and a watch‑broadcast mechanism addresses these issues while outlining future expansion plans.

Configuration ManagementGoOperator
0 likes · 9 min read
Design and Implementation of a Cloud‑Native Operator Platform for Component Management at Tongcheng Travel
MaGe Linux Operations
MaGe Linux Operations
Jul 25, 2023 · Cloud Native

Why Choose Loki for Cloud‑Native Log Management? A Complete Deployment Guide

This article explains why Loki is a lightweight, cloud‑native log aggregation solution, outlines its advantages and supported storage backends, compares log collectors, details Loki's indexing and query mechanisms, and provides step‑by‑step instructions for deploying Loki in Kubernetes with all‑in‑one, read/write, and microservice modes.

Cloud Native MonitoringGrafanaLoki
0 likes · 15 min read
Why Choose Loki for Cloud‑Native Log Management? A Complete Deployment Guide
Liangxu Linux
Liangxu Linux
Jul 24, 2023 · Cloud Native

How Datree Prevents Kubernetes Misconfigurations: A Quick Guide

Datree is an open-source CLI that validates Kubernetes YAML files against customizable policies, allowing users to detect syntax errors, version mismatches, and resource limits without connecting to a live cluster, and it provides a dashboard for managing and editing rules.

CLIConfiguration ValidationDatree
0 likes · 5 min read
How Datree Prevents Kubernetes Misconfigurations: A Quick Guide
dbaplus Community
dbaplus Community
Jul 24, 2023 · Cloud Native

Do You Really Need Docker Before Learning Kubernetes?

This article explains the relationship between Docker and Kubernetes, clarifies why Docker knowledge isn’t mandatory for K8s beginners, and outlines the essential Docker skills needed to effectively work with Kubernetes clusters.

DevOpsDockercloud-native
0 likes · 6 min read
Do You Really Need Docker Before Learning Kubernetes?
Alibaba Cloud Native
Alibaba Cloud Native
Jul 24, 2023 · Cloud Native

How Tim's Coffee Leveraged Cloud‑Native Architecture and Observability for Rapid Growth

Tim's Coffee transformed its legacy systems into a fully containerized, micro‑service, cloud‑native platform, using Kubernetes, Dubbo, ARMS, Prometheus and Grafana to boost deployment efficiency, scalability, cost savings, and observability, while tackling alert storms and improving development productivity.

Alibaba CloudDevOpskubernetes
0 likes · 12 min read
How Tim's Coffee Leveraged Cloud‑Native Architecture and Observability for Rapid Growth
AI Cyberspace
AI Cyberspace
Jul 24, 2023 · Cloud Native

What Is Cloud Native? A Simple City Analogy and Evolution Explained

An accessible Chinese-to-English guide demystifies cloud native by likening it to a city, outlines its definition, core traits, evolution through three philosophical questions, practical “Wang’s Four Rules,” development stages from cloud‑hosted VMs to containers and serverless, and discusses applicability in private and traditional enterprises.

Cloud ComputingContainerizationDevOps
0 likes · 17 min read
What Is Cloud Native? A Simple City Analogy and Evolution Explained
DaTaobao Tech
DaTaobao Tech
Jul 24, 2023 · Cloud Native

Tengine-Ingress: High‑Performance Cloud‑Native Ingress Gateway for Alibaba Group

Tengine‑Ingress is Alibaba’s cloud‑native Ingress gateway built on the high‑performance Tengine‑Proxy, replacing the legacy Unified Access with dynamic, loss‑less configuration, per‑domain gray‑rollout, dual‑certificate TLS, real‑time observability, and checksum validation, achieving up to 20 % lower latency, CPU and memory usage while scaling to thousands of pods, and paving the way for a universal API gateway supporting TCP, UDP, gRPC, QUIC/HTTP3 and advanced TLS.

Dynamic ConfigurationObservabilityTLS
0 likes · 18 min read
Tengine-Ingress: High‑Performance Cloud‑Native Ingress Gateway for Alibaba Group
MaGe Linux Operations
MaGe Linux Operations
Jul 23, 2023 · Operations

Debug Kubernetes Pods Without Restart Using Ephemeral Containers

This guide explains why exec‑based pod debugging is limited and demonstrates how to use Kubernetes ephemeral containers to troubleshoot network issues, trace processes, and debug nodes without restarting the pod, including step‑by‑step commands and code examples.

DebuggingEphemeral ContainersPod
0 likes · 7 min read
Debug Kubernetes Pods Without Restart Using Ephemeral Containers
ZhongAn Tech Team
ZhongAn Tech Team
Jul 21, 2023 · Cloud Computing

Zong An Insurance's Decade of Digital Transformation: From Cloud-Native Architecture to AIGC Integration

This article explores how Zong An Insurance has pioneered digital transformation in the financial technology sector over the past decade, achieving 100% cloud-native architecture, developing core systems like WuJieShan, and integrating AIGC technologies to create a comprehensive insurance ecosystem.

AIGC IntegrationCloud ComputingDevOps
0 likes · 12 min read
Zong An Insurance's Decade of Digital Transformation: From Cloud-Native Architecture to AIGC Integration
Alibaba Terminal Technology
Alibaba Terminal Technology
Jul 21, 2023 · Cloud Native

How Tengine-Ingress Boosts Cloud‑Native Traffic with Zero‑Downtime Updates

Tengine-Ingress, Alibaba’s cloud‑native ingress gateway built on Tengine‑Proxy, replaces the legacy Tengine gateway by delivering dynamic, loss‑less configuration updates, high‑availability gray‑release mechanisms, global consistency checks, and significant performance gains in TLS handshake latency, CPU usage, and memory consumption across large‑scale deployments.

cloud-nativehigh availabilityingress
0 likes · 19 min read
How Tengine-Ingress Boosts Cloud‑Native Traffic with Zero‑Downtime Updates
Open Source Linux
Open Source Linux
Jul 20, 2023 · Cloud Native

How to Retrieve Crash Logs of a Restarted Pod Using kubectl --previous

When a pod crashes and continuously restarts, standard kubelet logs may miss the previous container's output, but using kubectl logs with the --previous flag lets you access the logs of the last terminated instance, as explained with commands, file locations, and practical verification steps.

kubectlkubeletkubernetes
0 likes · 7 min read
How to Retrieve Crash Logs of a Restarted Pod Using kubectl --previous
Volcano Engine Developer Services
Volcano Engine Developer Services
Jul 19, 2023 · Cloud Native

How Kelemetry Transforms Kubernetes Observability with Object‑Centric Tracing

Kelemetry, an open‑source tracing system from ByteDance, visualizes Kubernetes control‑plane events by treating each object as a span, linking audit logs, events, and component interactions to provide a unified, searchable view that simplifies debugging, performance analysis, and multi‑cluster observability.

DebuggingObservabilityTracing
0 likes · 14 min read
How Kelemetry Transforms Kubernetes Observability with Object‑Centric Tracing
HomeTech
HomeTech
Jul 19, 2023 · Cloud Native

Why Use KubeVirt and How to Deploy It on a Kubernetes Cluster

This article explains the motivations for adopting KubeVirt, introduces its concepts and architecture, details the component design, and provides step‑by‑step instructions—including code snippets—for deploying KubeVirt, CDI, Ceph‑CSI, external snapshotter, creating VMs and exposing VNC, while outlining future directions for cloud‑native virtualization.

CSICephDevOps
0 likes · 15 min read
Why Use KubeVirt and How to Deploy It on a Kubernetes Cluster
Alibaba Cloud Native
Alibaba Cloud Native
Jul 18, 2023 · Cloud Native

How Meiqia Scaled Its Cloud‑Native Microservices with the Higress Gateway

Meiqia, a global intelligent cloud‑customer‑service provider, replaced a tangled multi‑middleware routing stack with a unified Higress gateway, achieving hot‑reloadable routing, WebSocket stability, multi‑cloud deployment, and a clear control‑plane/data‑plane separation that dramatically reduced costs and operational complexity.

DevOpsHigressapi-gateway
0 likes · 10 min read
How Meiqia Scaled Its Cloud‑Native Microservices with the Higress Gateway
Cloud Native Technology Community
Cloud Native Technology Community
Jul 18, 2023 · Cloud Native

2023 Kubernetes Reliability Benchmark Highlights Common Configuration Gaps

The 2023 Fairwinds Kubernetes benchmark, analyzing over 150,000 workloads, reveals that many organizations still miss critical best‑practice configurations such as memory limits, liveness probes, proper image pull policies, replica counts, and CPU limits or requests, leading to increased security risks, uncontrolled cloud costs, and reduced reliability.

BenchmarkBest PracticesReliability
0 likes · 7 min read
2023 Kubernetes Reliability Benchmark Highlights Common Configuration Gaps
MaGe Linux Operations
MaGe Linux Operations
Jul 16, 2023 · Cloud Native

How Datree Stops Kubernetes Misconfigurations: A Quick Hands‑On Guide

Datree is an open‑source CLI that validates Kubernetes YAML files against customizable policies, preventing misconfigurations without needing cluster access; this guide explains why it’s needed, how it works, and demonstrates a step‑by‑step example fixing resource limits in a deployment.

CLIDatreeYAML validation
0 likes · 4 min read
How Datree Stops Kubernetes Misconfigurations: A Quick Hands‑On Guide
Alibaba Cloud Native
Alibaba Cloud Native
Jul 16, 2023 · Cloud Native

Why Pods Suddenly Fail with “Address not available” and How to Fix It

When a Kubernetes pod that previously created connections stops working and reports “Address not available”, the root cause is often exhaustion of the kernel’s random port range, which can be resolved by adjusting net.ipv4.ip_local_port_range and aligning it with ServiceNodePortRange settings.

EADDRNOTAVAILPodPort Range
0 likes · 12 min read
Why Pods Suddenly Fail with “Address not available” and How to Fix It
MaGe Linux Operations
MaGe Linux Operations
Jul 15, 2023 · Cloud Native

How to Secure Your Kubernetes Clusters with DevSecOps Best Practices

This article explains how to integrate security into the DevOps pipeline for Kubernetes, covering DevSecOps concepts, image protection, role‑based access control, network policies, encryption, etcd safeguarding, and disaster‑recovery strategies to keep clusters safe and releases fast.

DevSecOpsNetworkPolicyRBAC
0 likes · 21 min read
How to Secure Your Kubernetes Clusters with DevSecOps Best Practices