Tagged articles
4062 articles
Page 7 of 41
Linux Ops Smart Journey
Linux Ops Smart Journey
Apr 25, 2025 · Cloud Native

How to Seamlessly Migrate Calico from etcd to Kubernetes Datastore

Learn step‑by‑step how to transition Calico’s data store from etcd to Kubernetes, covering prerequisite checks, locking the datastore, exporting and importing data, reconfiguring calicoctl, applying the new manifests, and unlocking the store, while highlighting benefits and tips for a smooth migration.

CalicoKubernetesNetwork Policy
0 likes · 11 min read
How to Seamlessly Migrate Calico from etcd to Kubernetes Datastore
Cloud Native Technology Community
Cloud Native Technology Community
Apr 24, 2025 · Cloud Native

Simplifying Kubernetes Application Deployment with Korifi and Optimizing Cloud Costs using OpenCost

Korifi provides a Cloud Foundry‑style abstraction layer that simplifies Kubernetes application deployment and management, while OpenCost offers comprehensive, multi‑cloud cost visibility and optimization; together they enable developers to deploy apps easily, monitor pod‑level expenses, and make data‑driven resource allocation decisions.

DevOpsKorifiKubernetes
0 likes · 7 min read
Simplifying Kubernetes Application Deployment with Korifi and Optimizing Cloud Costs using OpenCost
Raymond Ops
Raymond Ops
Apr 23, 2025 · Cloud Native

Secure Kubernetes Secrets with kubeseal: Installation & Usage Guide

This article explains how to install kubeseal and its controller, create and encrypt Kubernetes Secret manifests, apply sealed secrets, and provides useful tips on API routes, custom certificates, and RBAC to securely manage secrets in a cloud‑native environment.

KubeSealKubernetesSealed Secrets
0 likes · 5 min read
Secure Kubernetes Secrets with kubeseal: Installation & Usage Guide
Go Programming World
Go Programming World
Apr 22, 2025 · Artificial Intelligence

Design and Implementation of an Enterprise‑Grade LLMOPS Platform (EasyAI)

This article presents a comprehensive overview of building an enterprise‑level LLMOPS platform—including concept definitions, the relationship between LLMOPS, MLOps and intelligent agent platforms, four development tiers, architecture layers, core technical concerns, deployment options, and the benefits of cloud‑native AI development.

AI platformDevOpsGo
0 likes · 15 min read
Design and Implementation of an Enterprise‑Grade LLMOPS Platform (EasyAI)
IT Xianyu
IT Xianyu
Apr 21, 2025 · Cloud Native

Step-by-Step Guide to Setting Up a Kubernetes 1.19 Cluster on CentOS 7.9

This guide walks through preparing two CentOS 7.9 servers, installing Docker and Kubernetes 1.19 components, initializing a master node, joining a worker node, and validating the cluster with a sample Nginx deployment, including common troubleshooting tips.

CalicoCentOSCluster Setup
0 likes · 10 min read
Step-by-Step Guide to Setting Up a Kubernetes 1.19 Cluster on CentOS 7.9
Selected Java Interview Questions
Selected Java Interview Questions
Apr 18, 2025 · Cloud Native

One‑Click Deployment of Spring Cloud Microservices Using Jenkins, Docker, and Kubernetes

This guide walks through a complete one‑click Jenkins pipeline that pulls Spring Cloud source code from Git, builds it with Maven, packages the JAR into a Docker image, pushes the image to a registry, and finally deploys or updates the service on Kubernetes, covering SSH‑key setup, pipeline scripting, and Kubernetes resource definitions.

DockerJenkinsKubernetes
0 likes · 16 min read
One‑Click Deployment of Spring Cloud Microservices Using Jenkins, Docker, and Kubernetes
IT Services Circle
IT Services Circle
Apr 17, 2025 · Backend Development

McDonald's Java Backend Interview Experience and Technical Q&A

This article shares a candidate's interview experience at McDonald's China Technology R&D Center, detailing salary expectations, work schedule, and an extensive technical Q&A covering IoC/DI, MyBatis, JVM memory and garbage‑collection algorithms, MySQL storage engines, concurrency, locking, RPC, HTTPS, Docker/Kubernetes commands, and Git workflows, while also including some promotional material.

BackendDockerKubernetes
0 likes · 22 min read
McDonald's Java Backend Interview Experience and Technical Q&A
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Apr 17, 2025 · Cloud Native

Kubernetes Architecture and Core Principles Explained

This article provides a comprehensive overview of Kubernetes, covering its cloud‑native architecture, core components such as API Server, Scheduler, Controller Manager, etcd, kubelet and kube‑proxy, and explains the workflow that enables automated deployment, scaling and management of containerized applications.

DevOpsKubernetesarchitecture
0 likes · 6 min read
Kubernetes Architecture and Core Principles Explained
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Apr 17, 2025 · Cloud Native

OpenKruise 1.8 Release Highlights: In‑Place VPA, StatefulSet Volume Expansion, AI WorkloadSpread, Serverless Probe, SidecarSet Gray‑Release, and Helm Pre‑Delete Hook

OpenKruise 1.8, the latest CNCF‑incubated cloud‑native automation suite, introduces in‑place vertical pod autoscaling, native StatefulSet volume expansion, AI‑aware WorkloadSpread, serverless probe support, sidecar gray‑release capabilities, and a Helm pre‑delete safety hook, all backed by detailed YAML examples and future roadmap.

InPlaceVPAKubernetesOpenKruise
0 likes · 13 min read
OpenKruise 1.8 Release Highlights: In‑Place VPA, StatefulSet Volume Expansion, AI WorkloadSpread, Serverless Probe, SidecarSet Gray‑Release, and Helm Pre‑Delete Hook
vivo Internet Technology
vivo Internet Technology
Apr 16, 2025 · Big Data

Offline Mixed Deployment of Spark Tasks on Kubernetes: Containerization, Scheduling, and Elastic Resource Management

The article explains how the vivo Internet Big Data team containerized offline Spark jobs and deployed them with the Spark Operator on a mixed online‑offline Kubernetes cluster, using elastic scheduling and resource‑over‑subscription to boost CPU utilization by 30‑40% and handle over 100,000 daily tasks.

Big DataContainerizationKubernetes
0 likes · 36 min read
Offline Mixed Deployment of Spark Tasks on Kubernetes: Containerization, Scheduling, and Elastic Resource Management
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Apr 16, 2025 · Artificial Intelligence

Optimizing Multi‑Node Distributed LLM Inference with ACK Gateway and vLLM

This article presents a step‑by‑step guide for deploying and optimizing large‑language‑model inference across multiple GPU‑enabled nodes using ACK Gateway with Inference Extension, vLLM’s tensor‑ and pipeline‑parallel techniques, and Kubernetes resources such as LeaderWorkerSet, PVCs, and custom routing policies, followed by performance benchmarking and analysis.

ACK GatewayKubernetesLLM
0 likes · 19 min read
Optimizing Multi‑Node Distributed LLM Inference with ACK Gateway and vLLM
Ops Development Stories
Ops Development Stories
Apr 15, 2025 · Cloud Native

Boost Kubernetes Management with AI: Introducing the Lightweight k8m Console

This article introduces k8m, a lightweight AI‑enhanced console for Kubernetes that simplifies cluster management, installation, configuration, and daily operations, while offering features such as YAML auto‑translation, AI‑driven event and log diagnostics, command generation, multi‑cluster support, and role‑based access control.

AIDevOpsKubernetes
0 likes · 13 min read
Boost Kubernetes Management with AI: Introducing the Lightweight k8m Console
Cloud Native Technology Community
Cloud Native Technology Community
Apr 11, 2025 · Cloud Native

How Kube-OVN Enables Seamless Live Migration for KubeVirt VMs

This article explains the challenges of live‑migrating KubeVirt virtual machines, how Kube‑OVN addresses network‑bridge limitations and IP changes, provides the required VM annotation, step‑by‑step migration commands, and details the multi‑stage migration process that keeps network interruption under 0.5 seconds with no TCP break.

Kube-OVNKubeVirtKubernetes
0 likes · 7 min read
How Kube-OVN Enables Seamless Live Migration for KubeVirt VMs
21CTO
21CTO
Apr 9, 2025 · Operations

9 Must‑Have Container Monitoring Tools and Best Practices for Modern Cloud‑Native Environments

This article reviews nine practical container‑monitoring solutions—from Last9 and Prometheus to Dynatrace and Elastic Observability—detailing their key features, pricing, and why developers prefer them, and then offers comprehensive best‑practice guidance for metrics, tagging, alerts, and advanced observability strategies in Kubernetes‑driven cloud‑native deployments.

DevOpsKubernetesMetrics
0 likes · 25 min read
9 Must‑Have Container Monitoring Tools and Best Practices for Modern Cloud‑Native Environments
Java Architect Essentials
Java Architect Essentials
Apr 7, 2025 · Backend Development

Comprehensive Guide to Using Ctrip’s Apollo Distributed Configuration Center with Spring Boot

This article provides a detailed tutorial on Apollo, Ctrip’s open‑source distributed configuration center, covering its core concepts, features, model, four‑dimensional management, client design, deployment architecture, high‑availability considerations, step‑by‑step project creation, Spring Boot integration, Kubernetes deployment, and practical testing procedures.

ApolloConfiguration ManagementKubernetes
0 likes · 26 min read
Comprehensive Guide to Using Ctrip’s Apollo Distributed Configuration Center with Spring Boot
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Apr 7, 2025 · Cloud Native

How to Perform a Zero‑Downtime Kubernetes 1.30.x Upgrade

This guide explains how to upgrade a Kubernetes cluster from v1.30.0 to v1.30.1 without service interruption by backing up etcd, checking health, planning the rollout, upgrading master nodes and Calico, and using rolling updates and Istio canary releases for seamless application migration.

CalicoIstioKubernetes
0 likes · 8 min read
How to Perform a Zero‑Downtime Kubernetes 1.30.x Upgrade
FunTester
FunTester
Apr 5, 2025 · Cloud Native

Using Fabric8 OpenShift Client DSL to Manage OpenShift Resources

This guide demonstrates how to use the Fabric8 OpenShift client DSL in Java to initialize an OpenShift client, create and manage resources such as DeploymentConfig, Route, BuildConfig, Project, ImageStream, and various policies, and perform listing and deletion operations with code examples.

Client DSLFabric8Kubernetes
0 likes · 11 min read
Using Fabric8 OpenShift Client DSL to Manage OpenShift Resources
FunTester
FunTester
Apr 4, 2025 · Cloud Native

Using the Kubernetes Java Client: CSR, SharedInformers, ListOptions, DeleteOptions, WatchOptions, LogOptions and Resource Operations

This article explains how to use the Kubernetes Java client to create, approve, and deny CertificateSigningRequests, work with SharedInformers, list and delete resources with ListOptions and DeleteOptions, watch resources, retrieve logs, serialize objects to YAML, and run Pods, providing practical code examples and best‑practice recommendations.

CSRKubernetesSharedInformer
0 likes · 7 min read
Using the Kubernetes Java Client: CSR, SharedInformers, ListOptions, DeleteOptions, WatchOptions, LogOptions and Resource Operations
Cloud Native Technology Community
Cloud Native Technology Community
Apr 2, 2025 · Databases

Three Cloud‑Native Approaches to Deploying PostgreSQL: CloudNativePG, Korifi, and Neon

The article examines three modern, cloud‑native and open‑source methods for deploying PostgreSQL—using the CloudNativePG operator on Kubernetes, leveraging Korifi’s self‑service platform, and adopting Neon’s compute‑storage separation—highlighting trends toward cloud‑native infrastructure, self‑service, and open‑source SaaS alternatives.

CloudNativePGDatabase DeploymentKorifi
0 likes · 6 min read
Three Cloud‑Native Approaches to Deploying PostgreSQL: CloudNativePG, Korifi, and Neon
Java Architect Essentials
Java Architect Essentials
Mar 31, 2025 · Backend Development

Spring Boot vs Quarkus: Performance, Migration, and Real‑World Comparison

This article compares Spring Boot and Quarkus by outlining their architectures, running a reactive‑API benchmark with JMeter and VisualVM, presenting startup, build‑time, memory, CPU and response‑time results for JVM and native images, and offering a step‑by‑step guide for migrating Spring developers to Quarkus with code examples.

KubernetesNative ImagesQuarkus
0 likes · 17 min read
Spring Boot vs Quarkus: Performance, Migration, and Real‑World Comparison
MaGe Linux Operations
MaGe Linux Operations
Mar 31, 2025 · Cloud Native

How to Enable Kubernetes in Docker Desktop (Step‑by‑Step Guide)

This guide walks you through enabling Kubernetes in Docker Desktop on Windows or macOS, covering prerequisite setup, configuring registry mirrors, loading required images via a PowerShell script, activating the K8s service, and accessing the Kubernetes Dashboard for cluster visualization.

Docker DesktopKubernetesPowerShell
0 likes · 8 min read
How to Enable Kubernetes in Docker Desktop (Step‑by‑Step Guide)
MaGe Linux Operations
MaGe Linux Operations
Mar 29, 2025 · Cloud Native

Quickly Deploy a Kubernetes Cluster with KubeKey: Step‑by‑Step Guide

This guide walks you through setting the download region, installing KubeKey and required dependencies, generating a configuration file, creating a single‑node or multi‑node Kubernetes cluster with containerd, and verifying the deployment using kubectl, all with clear command examples and screenshots.

CalicoCluster DeploymentKubekey
0 likes · 5 min read
Quickly Deploy a Kubernetes Cluster with KubeKey: Step‑by‑Step Guide
Java Captain
Java Captain
Mar 29, 2025 · Backend Development

Comparative Performance Analysis of Spring Boot and Quarkus for Cloud‑Native Java Microservices

This article compares Spring Boot and Quarkus by describing their architectures, presenting a reactive test application that uses PostgreSQL, detailing a JMeter performance test plan, and analyzing results such as startup time, build size, CPU, memory, and response latency to help developers decide when to migrate from Spring Boot to Quarkus.

KubernetesNative ImagesQuarkus
0 likes · 14 min read
Comparative Performance Analysis of Spring Boot and Quarkus for Cloud‑Native Java Microservices
Ops Development & AI Practice
Ops Development & AI Practice
Mar 27, 2025 · Cloud Native

Master Kustomize: Simplify Kubernetes Configs with Generators and Transformers

Kustomize, built into kubectl, lets you declaratively manage Kubernetes YAML by organizing base resources, dynamically generating ConfigMaps and Secrets, applying transformers for environment‑specific tweaks, and optionally validating output, enabling a clean Base + Overlay workflow that reduces duplication and simplifies multi‑environment configuration.

Configuration ManagementDevOpsKubernetes
0 likes · 8 min read
Master Kustomize: Simplify Kubernetes Configs with Generators and Transformers
Linux Ops Smart Journey
Linux Ops Smart Journey
Mar 27, 2025 · Cloud Native

Stream Kubernetes Events to Elasticsearch with Kafka & Logstash

This guide walks you through deploying the Kubernetes Event Exporter, packaging its Helm chart and Docker image, configuring Kafka TLS secrets, setting up Logstash to ingest events and forward them to Elasticsearch, creating an index template, and verifying the end‑to‑end pipeline.

ElasticsearchEvent ExporterKafka
0 likes · 10 min read
Stream Kubernetes Events to Elasticsearch with Kafka & Logstash
StarRocks
StarRocks
Mar 27, 2025 · Databases

How JD Logistics Boosted Query Speed and Cut Costs with StarRocks Storage‑Compute Separation

JD Logistics transformed its one‑stop self‑service analytics platform, UData, by migrating from an integrated storage‑compute architecture to a storage‑compute separated design powered by StarRocks, achieving sub‑10‑second P95/P99 query latency, reducing storage costs by 90%, and cutting compute expenses around 30% while supporting massive data volumes.

Cost ReductionData PlatformKubernetes
0 likes · 20 min read
How JD Logistics Boosted Query Speed and Cut Costs with StarRocks Storage‑Compute Separation
ITPUB
ITPUB
Mar 26, 2025 · Cloud Native

How KubeBlocks Enables Scalable, Automated Redis on Kubernetes at Kuaishou

This article details Kuaishou's migration of massive Redis clusters to Kubernetes using the KubeBlocks Operator, covering architecture, multi‑layer management requirements, federated cluster deployment, custom controllers, performance and stability considerations, and the resulting operational benefits.

KubeBlocksKubernetesOperator
0 likes · 15 min read
How KubeBlocks Enables Scalable, Automated Redis on Kubernetes at Kuaishou
MaGe Linux Operations
MaGe Linux Operations
Mar 25, 2025 · Cloud Native

Mastering Canary Releases with Argo Rollout: A Step‑by‑Step Guide

This article explains the concept of canary (gray) releases, shows how to set up separate production and canary environments on Kubernetes, configure traffic splitting via Ingress annotations or headers, and automate the whole process with Argo Rollout, including detailed YAML examples and verification commands.

Argo RolloutCanary ReleaseDeployment Automation
0 likes · 17 min read
Mastering Canary Releases with Argo Rollout: A Step‑by‑Step Guide
FunTester
FunTester
Mar 23, 2025 · Operations

The Origin, Development, and Future of Chaos Engineering

Chaos engineering, introduced by Netflix in 2011 to proactively inject failures and test system resilience, has evolved over the past decade into a widely adopted practice integrated with SRE, automation, AI, and Kubernetes, offering best‑practice guidelines and future trends for improving distributed system reliability.

KubernetesReliabilitySRE
0 likes · 8 min read
The Origin, Development, and Future of Chaos Engineering
FunTester
FunTester
Mar 22, 2025 · Cloud Native

Master Kubernetes ReplicationController, ConfigMap, and Secret with Java Client

This guide demonstrates how to use the Fabric8 Kubernetes Java client to load, create, retrieve, update, scale, watch, and delete ReplicationController, ConfigMap, and Secret resources, providing complete code examples and step‑by‑step commands for managing these objects across namespaces in a Kubernetes cluster.

ConfigMapFabric8Kubernetes
0 likes · 14 min read
Master Kubernetes ReplicationController, ConfigMap, and Secret with Java Client
DevOps Cloud Academy
DevOps Cloud Academy
Mar 21, 2025 · Cloud Native

Building a Jenkins CI/CD Pipeline for Java Applications with Docker and Helm

This article provides a step‑by‑step guide to creating a Jenkins CI/CD pipeline for Java applications, covering environment setup, required plugins, project structure, CI stages such as code checkout, dependency installation, testing, SonarQube scanning, Docker image building and pushing, as well as CD stages including Helm chart creation, deployment, health checks, rollback, and maintenance‑window management.

DockerJenkinsKubernetes
0 likes · 11 min read
Building a Jenkins CI/CD Pipeline for Java Applications with Docker and Helm
FunTester
FunTester
Mar 20, 2025 · Cloud Native

Kubernetes Java Client Initialization and DSL Usage Guide

This article demonstrates how to initialize a Kubernetes Java client, configure it with default or custom kubeconfig, and provides comprehensive DSL examples for managing Pods, Services, Deployments, ReplicaSets, including creation, listing, updating, watching, port forwarding, and cleanup, with full code snippets throughout.

DevOpsFabric8Kubernetes
0 likes · 17 min read
Kubernetes Java Client Initialization and DSL Usage Guide
Ops Development Stories
Ops Development Stories
Mar 19, 2025 · Cloud Native

Unified Multi‑Cluster Monitoring with KubeDoor 1.0: Alerts, Metrics & Best Practices

KubeDoor 1.0 introduces a new architecture for unified multi‑Kubernetes monitoring, offering components for master and agent, flexible deployment options, Helm‑based installation, configurable storage and alerting settings, and detailed guidance on integrating with existing Prometheus/VictoriaMetrics setups while providing automatic peak‑usage data collection.

ClickHouseKubernetesVictoriaMetrics
0 likes · 14 min read
Unified Multi‑Cluster Monitoring with KubeDoor 1.0: Alerts, Metrics & Best Practices
Tencent Cloud Developer
Tencent Cloud Developer
Mar 19, 2025 · Cloud Native

Kubernetes Monitoring: Why It’s Needed, Core Components, and Metric Exposure

Monitoring Kubernetes is essential to detect resource contention, component failures, and network issues; it involves tracking core component metrics such as API server latency, etcd write times, scheduler delays, as well as node‑level CPU, memory, disk, and network statistics, pod health, and custom application metrics exposed via Prometheus exporters for comprehensive observability.

ExportersKubernetesMetrics
0 likes · 23 min read
Kubernetes Monitoring: Why It’s Needed, Core Components, and Metric Exposure
Python Programming Learning Circle
Python Programming Learning Circle
Mar 18, 2025 · Cloud Native

Automating Kubernetes Operations with the Python Client

This article demonstrates how to use the Python Kubernetes client to programmatically restart deployments, scale them, execute commands inside pods, apply node taints, retrieve cluster metrics, and convert between YAML/JSON and client objects, providing practical code examples for cloud‑native automation.

APIDevOpsKubernetes
0 likes · 8 min read
Automating Kubernetes Operations with the Python Client
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Mar 18, 2025 · Cloud Native

Gray Release of LoRA and Base Models Using ACK Gateway with AI Extension on Kubernetes

This guide explains how to deploy large language model inference services on a GPU-enabled Kubernetes cluster, configure ACK Gateway with AI Extension for intelligent routing and load balancing, and perform gray releases for both LoRA fine‑tuned models and base models such as QwQ‑32B and DeepSeek‑R1, including step‑by‑step commands and validation procedures.

ACK GatewayAI inferenceKubernetes
0 likes · 25 min read
Gray Release of LoRA and Base Models Using ACK Gateway with AI Extension on Kubernetes
MaGe Linux Operations
MaGe Linux Operations
Mar 18, 2025 · Cloud Native

How to Deploy a Kubernetes v1.28.8 Cluster with KubeKey on Ubuntu

This guide walks through configuring three Ubuntu servers, installing KubeKey, creating a Kubernetes v1.28.8 cluster with HAProxy load balancing, deploying a sample nginx workload, and verifying the installation using kubectl and curl, providing all necessary commands and configuration details for a successful deployment.

KubekeyKubernetesUbuntu
0 likes · 13 min read
How to Deploy a Kubernetes v1.28.8 Cluster with KubeKey on Ubuntu
Cloud Native Technology Community
Cloud Native Technology Community
Mar 18, 2025 · Cloud Native

Best Practices for Managing Core Services in Large‑Scale Kubernetes Deployments

Scaling Kubernetes across dozens or hundreds of clusters requires standardized core services—networking, security, observability, and automation—so organizations should adopt templated configurations, GitOps tools, centralized monitoring, and automated certificate management to reduce complexity, improve security, and lower operational overhead.

Cluster ManagementGitOpsKubernetes
0 likes · 8 min read
Best Practices for Managing Core Services in Large‑Scale Kubernetes Deployments
Ops Development & AI Practice
Ops Development & AI Practice
Mar 17, 2025 · Cloud Native

Why EKS API Mode Says ‘Provide Credentials’ Even with Admin Access – The Hidden Access Entry Issue

When using AWS EKS API authentication, many users encounter the “the server has asked for the client to provide credentials” error despite having AdministratorAccess, because the newer API mode requires explicit Access Entries for each IAM identity, a detail often missed in documentation and troubleshooting.

API modeAccess EntriesEKS
0 likes · 9 min read
Why EKS API Mode Says ‘Provide Credentials’ Even with Admin Access – The Hidden Access Entry Issue
Python Programming Learning Circle
Python Programming Learning Circle
Mar 17, 2025 · Cloud Native

Automating Kubernetes Tasks with the Python Client Library

This tutorial demonstrates how to set up a local KinD cluster, configure authentication, use raw curl commands, and employ the official Kubernetes Python client to list pods, create deployments, watch events, and manage RBAC, providing a complete guide for automating Kubernetes operations with Python.

APIDevOpsKubernetes
0 likes · 11 min read
Automating Kubernetes Tasks with the Python Client Library
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Mar 17, 2025 · Cloud Native

Boost LLM Inference with ACK Gateway AI Extension: A Step‑by‑Step Guide

This guide demonstrates how to deploy the QwQ‑32B large language model on an Alibaba Cloud ACK cluster, configure OSS storage, enable the ACK Gateway with AI Extension, set up InferencePool and InferenceModel resources, and benchmark intelligent routing versus standard gateway routing, revealing latency and throughput improvements.

ACK GatewayAI ExtensionKubernetes
0 likes · 16 min read
Boost LLM Inference with ACK Gateway AI Extension: A Step‑by‑Step Guide
MaGe Linux Operations
MaGe Linux Operations
Mar 16, 2025 · Cloud Native

How to Install and Use Portainer for Docker & Kubernetes Management

This guide walks you through installing Portainer, an open‑source Docker and Kubernetes management UI, checking system and Docker versions, running the Portainer server and agent containers, and accessing the web interface to manage local and remote container environments.

Container ManagementDevOpsDocker
0 likes · 5 min read
How to Install and Use Portainer for Docker & Kubernetes Management
Ops Development & AI Practice
Ops Development & AI Practice
Mar 16, 2025 · Cloud Native

Why Quarkus Is Revolutionizing Cloud‑Native Java Development

Quarkus, a Kubernetes‑native Java framework built for GraalVM and HotSpot, delivers millisecond startup, low memory usage, developer‑friendly features, and seamless integration with cloud‑native platforms, making it ideal for microservices, serverless, and modern cloud applications.

Fast StartupKubernetesQuarkus
0 likes · 7 min read
Why Quarkus Is Revolutionizing Cloud‑Native Java Development
MaGe Linux Operations
MaGe Linux Operations
Mar 15, 2025 · Cloud Native

How MetalLB Transforms Load Balancing for Bare‑Metal Kubernetes Clusters

This guide explains Kubernetes Service types, the role of MetalLB in providing LoadBalancer functionality for bare‑metal clusters, step‑by‑step installation, configuration of address pools, testing with a sample service, integration with Ingress, and an overview of the Calico network plugin for pod isolation.

CalicoKubernetesMetalLB
0 likes · 14 min read
How MetalLB Transforms Load Balancing for Bare‑Metal Kubernetes Clusters
Architecture Digest
Architecture Digest
Mar 14, 2025 · Backend Development

Spring Boot vs Quarkus: Performance Comparison, Migration Guide, and Practical Code Samples

This article compares Spring Boot and Quarkus across architecture, startup speed, memory usage, and native image support, presents a JMeter‑based performance test suite, shares migration tips for Spring developers, and provides concrete Java code examples for building reactive microservices on both frameworks.

KubernetesPerformanceQuarkus
0 likes · 13 min read
Spring Boot vs Quarkus: Performance Comparison, Migration Guide, and Practical Code Samples
Practical DevOps Architecture
Practical DevOps Architecture
Mar 14, 2025 · Backend Development

Comprehensive Java Senior Engineer Curriculum: Architecture Design, System Optimization, and Advanced Topics

This curriculum outlines a 25‑week advanced Java engineering program covering performance optimization, JVM internals, concurrency, distributed storage, MySQL and NoSQL databases, Docker, Kubernetes, microservices, Spring framework deep dives, Nginx, Elasticsearch, Redis, messaging systems, CI/CD tools, and hands‑on project implementation to build high‑performance, scalable backend systems.

DockerJVMKubernetes
0 likes · 3 min read
Comprehensive Java Senior Engineer Curriculum: Architecture Design, System Optimization, and Advanced Topics
DevOps Cloud Academy
DevOps Cloud Academy
Mar 13, 2025 · Operations

Release Engineering Best Practices: Branching Models, CI/CD Guidelines, and Deployment Strategies

This article provides a comprehensive overview of release engineering, covering branch models, naming conventions, merge processes, Git commit standards, CI/CD stage design, environment isolation, artifact management, product delivery steps, deployment strategies, and rollback procedures to ensure reliable software releases.

KubernetesOperationsbranching
0 likes · 26 min read
Release Engineering Best Practices: Branching Models, CI/CD Guidelines, and Deployment Strategies
Sohu Tech Products
Sohu Tech Products
Mar 12, 2025 · Cloud Native

Argo Workflows: Container-Native Workflow Engine for Kubernetes

Argo Workflows is an open‑source, container‑native engine that runs on Kubernetes via Custom Resource Definitions, letting users declaratively define complex, step‑or DAG‑based pipelines—including CI/CD, data processing, and machine‑learning jobs—through reusable templates, with a server UI, controller, and pod architecture monitored by Prometheus.

Argo WorkflowsCNCFContainer Native
0 likes · 16 min read
Argo Workflows: Container-Native Workflow Engine for Kubernetes
FunTester
FunTester
Mar 12, 2025 · Operations

Fault Injection Testing: Concepts, Scenarios, Process, and Best Practices

Fault injection testing deliberately introduces failures into a system to assess its resilience, helping identify weak points, improve retry and timeout mechanisms, and ensure robust operation across software, protocol, and infrastructure layers, with practical guidance on processes, tools, and Kubernetes-specific practices.

Fault InjectionKubernetesOperations
0 likes · 8 min read
Fault Injection Testing: Concepts, Scenarios, Process, and Best Practices
dbaplus Community
dbaplus Community
Mar 11, 2025 · Operations

How a Unified White‑Screen Ops Platform Transformed Multi‑Cloud Middleware Management

This article details the challenges of traditional middleware operations, explains how Kubernetes and Operators were leveraged to build a unified, visual, and automated platform that standardizes, automates, and visualizes multi‑cloud resource management, and reports the significant efficiency, cost, and safety gains achieved across dozens of clusters.

KubernetesOperationsOperator
0 likes · 23 min read
How a Unified White‑Screen Ops Platform Transformed Multi‑Cloud Middleware Management
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Mar 11, 2025 · Cloud Native

Implementing Per‑User Rate Limiting with Alibaba Cloud Service Mesh (ASM) Traffic Scheduling Suite

This article explains how to use Alibaba Cloud Service Mesh (ASM) traffic‑scheduling suite to implement rich traffic‑control scenarios such as per‑user rate limiting, request queuing and priority scheduling in a Kubernetes environment, providing step‑by‑step deployment, configuration and verification instructions.

ASMKubernetesTraffic Scheduling
0 likes · 14 min read
Implementing Per‑User Rate Limiting with Alibaba Cloud Service Mesh (ASM) Traffic Scheduling Suite
Ops Development Stories
Ops Development Stories
Mar 10, 2025 · Cloud Native

What Are Kubernetes Core Components and How Do They Work?

This article provides a comprehensive overview of Kubernetes fundamentals, covering core control‑plane and node components, key object differences such as Pod vs Deployment, Service types, ConfigMap vs Secret, scheduling, health checks, scaling, security, storage, and troubleshooting techniques.

ContainersKubernetesPod
0 likes · 19 min read
What Are Kubernetes Core Components and How Do They Work?
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Mar 9, 2025 · Cloud Computing

Deploy QwQ-32B LLM Inference on Alibaba Cloud ACS with vLLM: Step‑by‑Step Guide

This guide walks you through using Alibaba Cloud Container Compute Service (ACS) to provision GPU resources, prepare the QwQ-32B model, configure persistent storage, deploy the model with vLLM, set up OpenWebUI, verify the service, and optionally benchmark its performance, all with detailed commands and YAML examples.

ACSAlibaba CloudBenchmark
0 likes · 17 min read
Deploy QwQ-32B LLM Inference on Alibaba Cloud ACS with vLLM: Step‑by‑Step Guide
Infra Learning Club
Infra Learning Club
Mar 9, 2025 · Cloud Native

How to Fix nvidia-smi Missing GPU Process Info Inside Containers

The article explains why nvidia-smi cannot display GPU processes when run inside a container, analyzes the underlying pid‑namespace isolation and kernel‑level restrictions, and provides three practical solutions—including using hostPid, custom kernel interception modules, and the nvitop tool—plus a workaround for gpu‑operator deployments.

GPUKernel ModuleKubernetes
0 likes · 8 min read
How to Fix nvidia-smi Missing GPU Process Info Inside Containers
Ops Development & AI Practice
Ops Development & AI Practice
Mar 7, 2025 · Cloud Native

Mastering Kubernetes StatefulSets: How to Run Stateful Apps Reliably

This article explains Kubernetes StatefulSets, covering their core concepts, guarantees such as stable network IDs and persistent storage, the controller’s components, deployment workflow, typical use cases, best‑practice recommendations, and a detailed comparison with Deployments to help you manage stateful workloads effectively.

KubernetesStatefulSetcloud-native
0 likes · 8 min read
Mastering Kubernetes StatefulSets: How to Run Stateful Apps Reliably
Practical DevOps Architecture
Practical DevOps Architecture
Mar 7, 2025 · Cloud Native

Kubernetes DNS Resolution Issues and Troubleshooting Guide

This article explains common Kubernetes DNS resolution failures, both for external domains and internal service discovery addresses, and provides a step‑by‑step troubleshooting workflow that includes checking CoreDNS, examining resolv.conf, adjusting DNS settings, and recreating CoreDNS when necessary.

ClusterCoreDNSDNS
0 likes · 6 min read
Kubernetes DNS Resolution Issues and Troubleshooting Guide
ITPUB
ITPUB
Mar 6, 2025 · Cloud Native

Mastering Portainer: Simplify Docker and Kubernetes Management with Easy Deployment

This guide explains what Portainer is, compares its Community and Business editions, details its core architecture, provides step‑by‑step installation using Docker, Docker‑Compose, and Docker‑Stack, and demonstrates key features such as dashboards, container, image, service, volume, and user management for Docker and Kubernetes environments.

Container ManagementDockerKubernetes
0 likes · 43 min read
Mastering Portainer: Simplify Docker and Kubernetes Management with Easy Deployment
Cloud Native Technology Community
Cloud Native Technology Community
Mar 6, 2025 · Information Security

Shift-Down Security: Embedding Security into Cloud‑Native Platforms

The article introduces the Shift‑Down Security model for Kubernetes, explaining how moving security controls into the platform complements Shift‑Left practices, reduces vulnerabilities and configuration errors, and enables collaborative, automated, and adaptive protection across development, operations, and security teams in cloud‑native environments.

DevSecOpsKubernetesShift-Down Security
0 likes · 14 min read
Shift-Down Security: Embedding Security into Cloud‑Native Platforms
vivo Internet Technology
vivo Internet Technology
Mar 5, 2025 · Cloud Native

Beidou Container Operations Management Platform: Architecture, Automation, and Capabilities

The Beidou Operations Management Platform, created by vivo’s Internet Server team, unifies management of over twenty Kubernetes clusters and tens of thousands of nodes, automates scaling, inspections, event collection, and Helm‑based application deployment, achieving more than 90% UI‑driven operations and dramatically improving stability and operational efficiency.

Container ManagementDevOpsKubernetes
0 likes · 20 min read
Beidou Container Operations Management Platform: Architecture, Automation, and Capabilities
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Mar 5, 2025 · Cloud Native

Using Fluid Cloud‑Native Data Caching to Boost Performance and Elasticity of a Quantitative Research Platform on Alibaba Cloud

This article describes how JoinQuant built a cloud‑native quantitative research platform on Alibaba Cloud, identified performance, cost, data‑management, and security challenges, and solved them with Fluid’s JindoRuntime data‑caching, elastic scaling, and Python‑driven workflows, achieving dramatic speed and cost improvements.

Data CachingFluidKubernetes
0 likes · 18 min read
Using Fluid Cloud‑Native Data Caching to Boost Performance and Elasticity of a Quantitative Research Platform on Alibaba Cloud
Practical DevOps Architecture
Practical DevOps Architecture
Mar 5, 2025 · Cloud Native

Kubernetes DNS Resolution Issues and Troubleshooting Guide

This guide explains common Kubernetes DNS problems—including failure to resolve external domains, inter‑pod service discovery addresses, and related impacts on applications like Nginx reverse proxies—and provides step‑by‑step troubleshooting procedures such as checking CoreDNS, inspecting resolv.conf, and customizing dnsPolicy and dnsConfig in pod specifications.

CoreDNSDNSKubernetes
0 likes · 6 min read
Kubernetes DNS Resolution Issues and Troubleshooting Guide
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Mar 4, 2025 · Cloud Native

Koordinator v1.6 Release: Advanced Heterogeneous Device Scheduling and GPU Management Features

The Koordinator v1.6 release introduces a suite of innovations—including GPU topology‑aware scheduling, end‑to‑end GPU & RDMA joint allocation, strong GPU isolation, differentiated GPU scoring, fine‑grained resource reservation, mixed‑workload QoS, and extensive scheduler and rescheduler optimizations—to efficiently manage heterogeneous resources in Kubernetes clusters for AI and high‑performance computing workloads.

GPU schedulingHeterogeneous ResourcesKoordinator
0 likes · 24 min read
Koordinator v1.6 Release: Advanced Heterogeneous Device Scheduling and GPU Management Features
Ops Development Stories
Ops Development Stories
Mar 4, 2025 · Operations

Master Process Exporter: Deploy, Integrate with Prometheus & Grafana in Kubernetes

This guide walks Kubernetes administrators through the full lifecycle of Process Exporter—from lightweight deployment and RBAC setup, through Prometheus Operator integration and Grafana dashboard creation, to detailed configuration and alerting—enabling precise process‑level monitoring and rapid root‑cause analysis.

DaemonSetGrafanaKubernetes
0 likes · 15 min read
Master Process Exporter: Deploy, Integrate with Prometheus & Grafana in Kubernetes
21CTO
21CTO
Mar 2, 2025 · Backend Development

How Swift on Kubernetes Boosted Performance 4× and Cut Costs by 66%

Cultured Code’s case study shows that replacing a Python 2 application with a Swift‑based service running on AWS‑hosted Kubernetes increased average response speed fourfold while reducing compute costs to one‑third, highlighting the appeal and challenges of using Swift for backend development.

Backend DevelopmentCost ReductionKubernetes
0 likes · 4 min read
How Swift on Kubernetes Boosted Performance 4× and Cut Costs by 66%
Ops Development & AI Practice
Ops Development & AI Practice
Feb 27, 2025 · Cloud Native

Boost Kubernetes Efficiency with Offline‑Online Hybrid Deployment

This article explains how to combine online services and offline tasks within a single Kubernetes cluster using offline‑online hybrid deployment, detailing its benefits such as cost savings and higher resource utilization, and walks through practical implementation methods like CronJobs, HPA, priority classes, node affinity, custom schedulers, and the open‑source Koordinator project, while also addressing associated challenges.

KubernetesOffline TasksOnline Services
0 likes · 6 min read
Boost Kubernetes Efficiency with Offline‑Online Hybrid Deployment