Tagged articles
3133 articles
Page 8 of 32
DataFunSummit
DataFunSummit
Aug 7, 2024 · Cloud Native

Optimizing I/O for Data-Intensive Analytics in Cloud-Native Environments: A Case Study of Uber Presto

This whitepaper examines the industry trend of moving data‑intensive analytics workloads to cloud‑native environments, analyzing how cloud storage cost models affect performance optimization, and presents an Uber Presto case study that reveals fragmented I/O patterns and proposes cost‑effective optimization strategies.

I/O optimizationcloud storage costcloud-native
0 likes · 3 min read
Optimizing I/O for Data-Intensive Analytics in Cloud-Native Environments: A Case Study of Uber Presto
Alibaba Cloud Native
Alibaba Cloud Native
Aug 7, 2024 · Operations

How iLogtail Achieves Million‑Scale Observability with SRE Practices

This article details how Alibaba Cloud's iLogtail agent, serving tens of thousands of hosts and containers, overcomes unique stability challenges through a comprehensive SRE approach that spans design, development, testing, gray‑release, operations, and customer‑support, ultimately boosting reliability and reducing incident rates.

ObservabilitySREcloud-native
0 likes · 32 min read
How iLogtail Achieves Million‑Scale Observability with SRE Practices
ByteDance Cloud Native
ByteDance Cloud Native
Aug 7, 2024 · Artificial Intelligence

Deploy Stable Diffusion in 5 Minutes with Volcengine’s Continuous Delivery CP

Learn how to quickly launch a Stable Diffusion WebUI service in just five minutes using Volcengine’s cloud‑native continuous delivery platform, which abstracts Kubernetes complexities, provides pre‑configured AI templates, serverless VCI deployment, automatic scaling, API gateway access, and includes a Python client for image generation.

AI deploymentPythonServerless
0 likes · 14 min read
Deploy Stable Diffusion in 5 Minutes with Volcengine’s Continuous Delivery CP
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Aug 7, 2024 · Cloud Native

Top 10 Must‑Know Kubernetes Features to Boost Your Cloud‑Native Apps

This article introduces ten essential Kubernetes technologies—including Helm Charts, Operators, Service Mesh, StatefulSets, CRDs, Network Policies, Pod Disruption Budgets, HPA, Persistent Volumes, and Ingress Controllers—explaining their core concepts, key capabilities, and how they improve application reliability, scalability, and security.

DevOpsService MeshStatefulSets
0 likes · 8 min read
Top 10 Must‑Know Kubernetes Features to Boost Your Cloud‑Native Apps
Go Programming World
Go Programming World
Aug 4, 2024 · Fundamentals

Applying the Adapter Pattern in Go: From Payment Systems to Multi‑Cloud Management and Model Training Platforms

This article explains the Adapter design pattern, illustrates its real‑world analogy, demonstrates a complete Go implementation for payment processing, and shows how the pattern can be used in production scenarios such as a multi‑cloud management platform and a model‑training service, highlighting its role in unifying incompatible interfaces.

Adapter PatternDesign PatternsGo
0 likes · 13 min read
Applying the Adapter Pattern in Go: From Payment Systems to Multi‑Cloud Management and Model Training Platforms
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 2, 2024 · Operations

How Alibaba Cloud’s ARMS Go Agent Enables Zero‑Intrusion Monitoring for Go Microservices

This article explains how Alibaba Cloud's ARMS team collaborated with the language and compiler team to create an OpenTelemetry‑based, compile‑time automatic instrumentation solution for Golang applications, detailing its background, technical workflow, key optimizations, and practical impact on cloud‑native monitoring.

ARMSAutomatic InstrumentationGo
0 likes · 13 min read
How Alibaba Cloud’s ARMS Go Agent Enables Zero‑Intrusion Monitoring for Go Microservices
Practical DevOps Architecture
Practical DevOps Architecture
Aug 2, 2024 · Cloud Native

In-depth Understanding of Kubernetes Networking: Architecture Design and Performance Optimization

This comprehensive course provides a deep dive into Kubernetes networking, covering core concepts, network models, policies, plugins, traffic management, security, multi‑tenant, cross‑cloud, and hybrid‑cloud designs, and offers hands‑on performance optimization techniques across more than 60 video modules.

CNICalicoCilium
0 likes · 4 min read
In-depth Understanding of Kubernetes Networking: Architecture Design and Performance Optimization
Huolala Tech
Huolala Tech
Aug 1, 2024 · Big Data

How Huolala’s Big Data Team Cut Costs and Boosted Efficiency with an Elastic Architecture

Huolala’s three‑year‑old big data team shares how they tackled cost, operations, and analysis inefficiencies by building a layered, elastic infrastructure, adopting ARM servers, automating workflows, embracing cloud‑native practices, and implementing multi‑engine routing, achieving 20‑30% cost savings and higher performance.

cloud-nativecost optimizationelastic scaling
0 likes · 12 min read
How Huolala’s Big Data Team Cut Costs and Boosted Efficiency with an Elastic Architecture
Top Architect
Top Architect
Jul 31, 2024 · Backend Development

Why Adopt Microservices? Principles, Splitting Strategies, DDD Relationship, and Common Implementation Techniques

The article explains why microservices have become a mainstream architectural choice, outlines the shortcomings of monolithic systems, presents core design principles and various service‑splitting methods, discusses the synergy with Domain‑Driven Design, compares microservices with other architectures, and lists typical technologies used in practice.

Backend ArchitectureDDDService Splitting
0 likes · 23 min read
Why Adopt Microservices? Principles, Splitting Strategies, DDD Relationship, and Common Implementation Techniques
FunTester
FunTester
Jul 31, 2024 · Cloud Native

Improving Test Environment Stability with Containerized One-Box and Soft‑Isolation Solutions

The article analyzes why test environments are inherently less stable than production, identifies frequent changes as the root cause, and proposes two container‑based approaches—One‑Box for small services and soft isolation for large microservice systems—plus automated health and business inspections to achieve reasonable, cost‑effective stability.

ContainerizationOperationscloud-native
0 likes · 13 min read
Improving Test Environment Stability with Containerized One-Box and Soft‑Isolation Solutions
Linux Ops Smart Journey
Linux Ops Smart Journey
Jul 30, 2024 · Cloud Native

Unveiling Kubernetes: Inside the Cosmic Architecture Powering Cloud Native Apps

Amid the digital transformation era, Kubernetes has become essential for modern cloud computing, and this article demystifies its inner workings by detailing its master and node components, service discovery, storage orchestration, networking, high availability, flexible resource management, and thriving ecosystem.

architecturecloud-nativecontainer orchestration
0 likes · 5 min read
Unveiling Kubernetes: Inside the Cosmic Architecture Powering Cloud Native Apps
Baidu Geek Talk
Baidu Geek Talk
Jul 29, 2024 · Cloud Native

How Baidu’s Jarvis2.0 Redefined Cloud‑Native Microservice Governance

This article examines Baidu's Jarvis2.0 platform, detailing how its multi‑runtime architecture, unified control plane, and automated deployment pipelines transformed a sprawling microservice ecosystem of over 1,000 services into a highly efficient, cloud‑native system that dramatically reduced release times, improved stability, and enabled seamless multi‑language support.

BaiduGovernanceJarvis2.0
0 likes · 15 min read
How Baidu’s Jarvis2.0 Redefined Cloud‑Native Microservice Governance
58 Tech
58 Tech
Jul 29, 2024 · Databases

HBase Cloud Migration: Architecture, Challenges, and Solutions

This technical report details the background, architecture, construction, core issues, migration plans, and future roadmap of moving 58's HBase clusters to a cloud‑native environment, highlighting cost reduction, operational automation, and performance optimizations.

Big DataDatabasesHBase
0 likes · 22 min read
HBase Cloud Migration: Architecture, Challenges, and Solutions
Architect
Architect
Jul 28, 2024 · Backend Development

Why Microservices? Benefits, Principles, and Practical Implementation Guide

This article provides a comprehensive analysis of microservices, covering why they are adopted, the drawbacks of monolithic architectures, core characteristics, splitting principles and strategies, the relationship with Domain‑Driven Design, comparisons with other architectural styles, and a detailed overview of the common technical stack used to build and operate microservice systems.

Backend ArchitectureDDDService Splitting
0 likes · 22 min read
Why Microservices? Benefits, Principles, and Practical Implementation Guide
MaGe Linux Operations
MaGe Linux Operations
Jul 26, 2024 · Operations

How to Install and Use Velero for Kubernetes Backup and Migration

This guide introduces Velero, an open-source Kubernetes backup and migration tool, explains its architecture and workflow, and provides step-by-step instructions for installing the client and server, configuring object storage with MinIO, and managing backups and restores.

Velerocloud-nativekubernetes
0 likes · 10 min read
How to Install and Use Velero for Kubernetes Backup and Migration
Linux Ops Smart Journey
Linux Ops Smart Journey
Jul 25, 2024 · Cloud Native

Master Docker & Containerd Image Push/Pull: Secure Registry Configuration

This guide explains how to configure Docker and Containerd to correctly set image push and pull parameters for both HTTP and HTTPS registries, covering insecure registry settings, certificate management, host resolution, and command‑line examples using docker, ctr, crictl, and nerdctl to ensure efficient and secure container image handling.

DockerImage Registrycloud-native
0 likes · 12 min read
Master Docker & Containerd Image Push/Pull: Secure Registry Configuration
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jul 19, 2024 · Cloud Native

Integrating Dubbo Applications with Alibaba Cloud Service Mesh (ASM): Best Practices and Transition Scenarios

This guide explains how to connect Dubbo microservices to Alibaba Cloud Service Mesh (ASM) to obtain mTLS, observability, and traffic management, covering prerequisites, step‑by‑step deployment, code examples, a registry‑based transition approach, and recommended best‑practice configurations.

ASMDubboService Mesh
0 likes · 29 min read
Integrating Dubbo Applications with Alibaba Cloud Service Mesh (ASM): Best Practices and Transition Scenarios
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Jul 17, 2024 · Operations

How NetEase Cloud Music Automated Massive Service Upgrades with a Custom Platform

This article presents a comprehensive case study of NetEase Cloud Music's automatic upgrade platform, detailing the background challenges, technical architecture, sidecar versus component upgrades, workflow orchestration, operational safeguards, performance metrics, and future roadmap for large‑scale microservice migrations.

OperationsUpgradeautomation
0 likes · 17 min read
How NetEase Cloud Music Automated Massive Service Upgrades with a Custom Platform
Ctrip Technology
Ctrip Technology
Jul 16, 2024 · Backend Development

Implementation and Optimization of the QUIC Protocol in the Trip.com App

This article details the deployment of QUIC in Trip.com’s mobile app, covering multi‑process architecture, containerized upgrades, service discovery, health monitoring, push‑pull resilience, full‑link tracing, congestion‑control algorithm redesign, and the resulting performance and reliability improvements achieved across global users.

Backend DevelopmentCongestion ControlQUIC
0 likes · 25 min read
Implementation and Optimization of the QUIC Protocol in the Trip.com App
Alibaba Cloud Observability
Alibaba Cloud Observability
Jul 16, 2024 · Cloud Native

Auto‑Instrumenting Java in GraalVM Native Images: A Practical OTel Solution

This article explains how Alibaba Cloud engineers tackled the challenges of long startup times and high memory usage in Java serverless workloads by designing a static Java Agent instrumentation that works with GraalVM Native Image, preserving OpenTelemetry observability while dramatically improving performance.

Java AgentOpenTelemetrycloud-native
0 likes · 13 min read
Auto‑Instrumenting Java in GraalVM Native Images: A Practical OTel Solution
Alibaba Cloud Observability
Alibaba Cloud Observability
Jul 16, 2024 · Cloud Native

How to Seamlessly Migrate Your Self‑Hosted Prometheus + Thanos to Alibaba Cloud Managed Prometheus

This guide explains why many users still run self‑built Prometheus + Thanos, outlines the common deployment scenarios and pain points, and provides detailed step‑by‑step migration procedures—including metric collection, visualization, and alerting—for moving to Alibaba Cloud's fully managed Prometheus service across Kubernetes, ECS, and IDC environments.

Alibaba CloudPrometheuscloud-native
0 likes · 14 min read
How to Seamlessly Migrate Your Self‑Hosted Prometheus + Thanos to Alibaba Cloud Managed Prometheus
Top Architecture Tech Stack
Top Architecture Tech Stack
Jul 16, 2024 · Cloud Native

Designing Fault‑Tolerant Microservices Architecture: Patterns and Practices

The article explains how to build reliable microservices by isolating failures, applying graceful degradation, change‑management, health checks, self‑healing, fallback caching, retry strategies, rate limiting, fast‑fail principles, circuit breakers, and failure‑testing to ensure high availability in distributed cloud‑native systems.

Circuit BreakerOperationsRate Limiting
0 likes · 14 min read
Designing Fault‑Tolerant Microservices Architecture: Patterns and Practices
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jul 5, 2024 · Cloud Native

Koordinator v1.5.0 Release: New Features and Enhancements

Koordinator v1.5.0, the 13th major release since its open‑source debut, introduces pod‑level NUMA alignment, Terway network QoS, core scheduling, and numerous performance and stability improvements, while also being accepted as a CNCF Sandbox project and outlining future roadmap plans.

Core SchedulingNUMAScheduling
0 likes · 14 min read
Koordinator v1.5.0 Release: New Features and Enhancements
Alibaba Cloud Observability
Alibaba Cloud Observability
Jul 4, 2024 · Cloud Native

How to Install and Use Go Agent for Cloud‑Native Application Monitoring on ACK

This guide explains how Go Agent solves the manual instrumentation problem in Go microservices by injecting monitoring code at compile time, details its implementation using AST analysis and -toolexec, lists supported SDKs and product features, and provides step‑by‑step instructions for installing the ARMS component, compiling Go binaries, granting permissions, and enabling monitoring on Alibaba Cloud ACK clusters.

ARMSGoapplication monitoring
0 likes · 16 min read
How to Install and Use Go Agent for Cloud‑Native Application Monitoring on ACK
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jun 29, 2024 · Backend Development

Evolution of System Architecture: From Monolithic to Cloud‑Native Microservices

This article traces the historical evolution of backend system architecture—from early single‑machine monolithic deployments through database isolation, static‑dynamic separation, front‑back separation, clustering, caching, message queues, sharding, SOA, microservices, containerization, CI/CD, and finally cloud‑native designs—highlighting the motivations, challenges, and key technologies at each stage.

System architectureci/cdcloud-native
0 likes · 33 min read
Evolution of System Architecture: From Monolithic to Cloud‑Native Microservices
Ops Development Stories
Ops Development Stories
Jun 28, 2024 · Cloud Native

Multi-Cluster Kubernetes: Benefits, Federation, Karmada, and Practical Tips

This article explains why organizations adopt multi‑cluster Kubernetes for high availability, hybrid‑cloud scaling, and fault isolation, outlines the preparatory steps, compares Federation v1 and v2, introduces Karmada as a CNCF project, and shares practical non‑federated deployment, monitoring, traffic management, and migration techniques with code examples.

DevOpsFederationKarmada
0 likes · 18 min read
Multi-Cluster Kubernetes: Benefits, Federation, Karmada, and Practical Tips
TAL Education Technology
TAL Education Technology
Jun 27, 2024 · Cloud Native

Case Study: Integrating the AiFenxi BI Platform with Apache APISIX Gateway for Improved Performance and Stability

This case study details how the AiFenxi business intelligence platform integrated Apache APISIX as a high‑performance API gateway within Tencent Cloud TKE, addressing latency, scalability, and security challenges, and outlines the architectural changes, deployment steps, and resulting performance improvements.

APISIXBI platformOperations
0 likes · 7 min read
Case Study: Integrating the AiFenxi BI Platform with Apache APISIX Gateway for Improved Performance and Stability
Practical DevOps Architecture
Practical DevOps Architecture
Jun 24, 2024 · Backend Development

Comprehensive Backend Development and Cloud Native Training Curriculum

This document outlines a detailed training curriculum covering backend development, cloud-native technologies, microservice frameworks, databases, messaging systems, containerization, orchestration, performance tuning, and related tools, presented as a series of numbered sessions with practical labs and theory.

DevOpsPerformance tuningcloud-native
0 likes · 7 min read
Comprehensive Backend Development and Cloud Native Training Curriculum
DataFunSummit
DataFunSummit
Jun 22, 2024 · Cloud Native

Optimizing I/O for Data-Intensive Analytics in Cloud-Native Environments: Insights from Uber Presto

This whitepaper examines the industry trend of migrating data‑intensive analytics workloads to cloud‑native environments, revealing how cloud storage’s unique cost model demands finer‑grained performance optimization, and presents Uber Presto case‑study findings that expose fragmented I/O patterns and associated financial impacts.

Cost ModelI/O optimizationcloud-native
0 likes · 3 min read
Optimizing I/O for Data-Intensive Analytics in Cloud-Native Environments: Insights from Uber Presto
Architect's Guide
Architect's Guide
Jun 22, 2024 · Databases

Understanding TiDB: NewSQL Features, Architecture, and Comparison with MySQL and NoSQL

This article provides a comprehensive overview of TiDB, a cloud‑native NewSQL database that combines the scalability of NoSQL with the strong consistency of traditional relational databases, detailing its architecture, high‑availability design, HTAP capabilities, MySQL compatibility, and practical application scenarios.

HTAPMySQL compatibilityNewSQL
0 likes · 23 min read
Understanding TiDB: NewSQL Features, Architecture, and Comparison with MySQL and NoSQL
Alibaba Cloud Observability
Alibaba Cloud Observability
Jun 20, 2024 · Cloud Native

How to Achieve Unified Multi‑Cluster Monitoring with Alibaba Cloud Prometheus and ACK One

This article explains how enterprises can use Alibaba Cloud's ACK One platform together with the Prometheus‑based Observability service to build a unified, cloud‑native monitoring solution for heterogeneous, multi‑region Kubernetes clusters, addressing scalability, cost, and operational challenges.

ACK OneMulti-Cluster MonitoringPrometheus
0 likes · 12 min read
How to Achieve Unified Multi‑Cluster Monitoring with Alibaba Cloud Prometheus and ACK One
dbaplus Community
dbaplus Community
Jun 17, 2024 · Cloud Native

Why Terraform Should Retire: Embracing Kubernetes Operator Alternatives

The article critiques Terraform’s architectural quirks, drift‑management challenges, licensing controversy, and community split, then argues that adopting Kubernetes operator‑based infrastructure management offers a more modular, collaborative, and cloud‑agnostic solution for modern DevOps teams.

Operatorcloud-nativeinfrastructure-as-code
0 likes · 13 min read
Why Terraform Should Retire: Embracing Kubernetes Operator Alternatives
Volcano Engine Developer Services
Volcano Engine Developer Services
Jun 14, 2024 · Operations

How ByteDance Built an EB‑Scale Log Service: Design & Optimization

This article details the evolution of ByteDance's TLS (Tinder Log Service) from a Loki‑based prototype to an EB‑scale, cloud‑native log system, covering its core properties, data organization, architecture, caching, hybrid storage, private codec, ecosystem compatibility, intelligent features, and real‑world case studies.

ByteDanceData StorageTLS
0 likes · 24 min read
How ByteDance Built an EB‑Scale Log Service: Design & Optimization
Alibaba Cloud Observability
Alibaba Cloud Observability
Jun 13, 2024 · Cloud Native

Kickstart Your Observability Journey with Alibaba Cloud Monitoring

This guide introduces new Alibaba Cloud users to the fundamentals of cloud observability, explaining the metric‑trace‑log stack, the layered monitoring pyramid, and step‑by‑step how to set up out‑of‑the‑box resource monitoring, dashboards, alerts, and advanced integration options.

Alibaba CloudMetricscloud monitoring
0 likes · 7 min read
Kickstart Your Observability Journey with Alibaba Cloud Monitoring
TAL Education Technology
TAL Education Technology
Jun 11, 2024 · Cloud Native

Containerizing Elasticsearch: Architecture Upgrade, API Gateway Integration, and Cloud‑Native Migration

This article details how a large‑scale Elasticsearch deployment was transformed from physical servers to a Kubernetes‑based, containerized architecture, addressing cost, scalability, API compatibility, security, observability, and multi‑cloud migration using the 极限网关 API gateway.

ElasticsearchMigrationapi-gateway
0 likes · 11 min read
Containerizing Elasticsearch: Architecture Upgrade, API Gateway Integration, and Cloud‑Native Migration
Alibaba Cloud Native
Alibaba Cloud Native
Jun 9, 2024 · Cloud Native

Deploy a SpringAI Chatbot in Minutes with Alibaba Cloud AppStack

This guide walks you through obtaining free ECS resources and a DashScope API‑key, initializing a SpringAI application with the AppStack template, deploying it via the built‑in pipeline, and verifying the AI chatbot runs successfully, while also showing optional template‑based batch upgrades.

ChatbotSpringAIappstack
0 likes · 6 min read
Deploy a SpringAI Chatbot in Minutes with Alibaba Cloud AppStack
DataFunSummit
DataFunSummit
Jun 8, 2024 · Cloud Native

Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto

This whitepaper examines the industry shift of moving data‑intensive analytics to cloud‑native platforms, analyzes how cloud storage cost models affect performance optimization, and presents Uber Presto case‑study findings that reveal fragmented access patterns and the financial impact of traditional I/O strategies in the cloud.

Cost ModelI/O optimizationcloud-native
0 likes · 3 min read
Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto
Tencent Cloud Developer
Tencent Cloud Developer
Jun 7, 2024 · Cloud Native

Multi-AZ High‑Availability Architecture of Tencent Cloud TDMQ for Apache Pulsar

Tencent Cloud TDMQ for Apache Pulsar achieves multi‑AZ high availability by containerizing ZooKeeper, BookKeeper and Brokers, using managed ZK, persistent cloud disks and elastic NICs, enforcing quorum and rack‑aware replicas, and planning cross‑region bidirectional replication to ensure seamless disaster recovery and continuous messaging.

Multi‑AZPulsarTDMQ
0 likes · 15 min read
Multi-AZ High‑Availability Architecture of Tencent Cloud TDMQ for Apache Pulsar
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jun 6, 2024 · Cloud Native

Argo Workflows vs Jenkins: Building Cloud‑Native CI/CD Pipelines on ACK One Serverless

Argo Workflows, a cloud‑native Kubernetes job orchestrator, offers superior autoscaling, concurrency, cost efficiency, and seamless integration with the Argo ecosystem compared to Jenkins, and the article provides a detailed tutorial on deploying a Go‑based CI pipeline on ACK One Serverless Argo with BuildKit, NAS caching, and parameterized workflow templates.

Argo WorkflowsBuildKitci/cd
0 likes · 15 min read
Argo Workflows vs Jenkins: Building Cloud‑Native CI/CD Pipelines on ACK One Serverless
Cloud Native Technology Community
Cloud Native Technology Community
Jun 5, 2024 · Cloud Native

Which Multi‑Cluster Networking Solution Is Right for Your Kubernetes Fleet?

With Kubernetes clusters expanding across multiple environments, choosing the right cross‑cluster networking method is crucial; this article compares six approaches—underlay networks, overlay CNI meshes, Submariner, Skupper, KubeSlice, and others—detailing their core principles, advantages, and limitations to help you select the best fit.

CNIMulti-Clustercloud-native
0 likes · 11 min read
Which Multi‑Cluster Networking Solution Is Right for Your Kubernetes Fleet?
ByteDance Cloud Native
ByteDance Cloud Native
Jun 3, 2024 · Cloud Native

How to Build High‑Availability Kubernetes Clusters with Volcengine VKE & VCI

This guide explains how Volcengine's VKE (Kubernetes Engine) and VCI (Elastic Container Instance) enable high‑availability, multi‑AZ deployments, covering cluster creation, control‑plane distribution, virtual node configuration, inventory‑aware scheduling, and practical YAML examples for resilient cloud‑native workloads.

Container ServiceServerlessVolcengine
0 likes · 14 min read
How to Build High‑Availability Kubernetes Clusters with Volcengine VKE & VCI
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
May 31, 2024 · Cloud Native

Best Practices for Deploying AI Model Inference on Knative

This guide explains how to efficiently deploy AI model inference services on Knative by externalizing model data, using Fluid for accelerated loading, configuring secrets, ImageCache, graceful shutdown, probes, autoscaling parameters, mixed ECS/ECI resources, shared GPU scheduling, and observability features to achieve fast scaling, low cost, and high elasticity.

AI Model InferenceAutoscalingBest Practices
0 likes · 19 min read
Best Practices for Deploying AI Model Inference on Knative
360 Smart Cloud
360 Smart Cloud
May 27, 2024 · Databases

Deployment, Optimization, and Management of TiDB Service in 360 Zhihui Cloud

This article details the product models, usage scenarios, and a series of performance and operational optimizations—including query plan health checks, space reclamation, resource isolation, cloud‑native deployment, cross‑region high availability, and unified monitoring—implemented for the TiDB service operated by 360 Zhihui Cloud since its launch in April 2023.

DatabasePerformance OptimizationTiDB
0 likes · 17 min read
Deployment, Optimization, and Management of TiDB Service in 360 Zhihui Cloud
ITPUB
ITPUB
May 26, 2024 · Cloud Native

Containerizing Elasticsearch & ClickHouse on Kubernetes: Bilibili’s Scalable, Low‑Cost Solution

This article details Bilibili’s journey of containerizing Elasticsearch and ClickHouse on Kubernetes, covering the challenges of stateful services, architectural decisions, custom operators, storage and network solutions, deployment steps, observability enhancements, and the resulting cost, quality, and efficiency gains.

ClickHouseElasticsearchLVM
0 likes · 38 min read
Containerizing Elasticsearch & ClickHouse on Kubernetes: Bilibili’s Scalable, Low‑Cost Solution
iQIYI Technical Product Team
iQIYI Technical Product Team
May 24, 2024 · Operations

High Availability and Disaster Recovery Practices of iQIYI's Video Relay Service (VRS)

iQIYI’s Video Relay Service ensures uninterrupted video playback by employing a two‑region, three‑center hybrid cloud architecture, multi‑layer storage, cross‑AZ retry mechanisms, protective rate‑limiting and degradation paths, layered monitoring, and rigorous stress‑testing and chaos engineering to achieve high availability and disaster recovery.

Backend ArchitectureVideo Streamingcloud-native
0 likes · 18 min read
High Availability and Disaster Recovery Practices of iQIYI's Video Relay Service (VRS)
MaGe Linux Operations
MaGe Linux Operations
May 23, 2024 · Cloud Native

How to Slash Server Load: Optimizing Go Apps on Kubernetes

This article explains how to reduce server load and improve performance of Go applications on Kubernetes by using lightweight base images, fine‑tuning resource limits, adjusting garbage‑collection settings, employing connection pools, configuring health probes, and leveraging autoscaling, load balancing, and caching techniques.

GoPerformance Optimizationcloud-native
0 likes · 19 min read
How to Slash Server Load: Optimizing Go Apps on Kubernetes
DaTaobao Tech
DaTaobao Tech
May 22, 2024 · Cloud Native

AONE Serverless Quality Assurance: Design, Testing, and Monitoring

The article explains how AONE Serverless separates development and operations domains to enable independent iteration and lower costs, details a QA workflow—functional regression, performance testing, monitoring verification, reverse‑engineered interfaces, automated API traffic replay, and isolated pressure testing— and reports deployment build time cuts of 17% and overall deployment reductions up to 44%, while outlining challenges and future plans for layered automation and plugin‑based extensions.

Serverlesscloud-nativequality assurance
0 likes · 9 min read
AONE Serverless Quality Assurance: Design, Testing, and Monitoring
Huolala Tech
Huolala Tech
May 21, 2024 · Information Security

How Huolala Built a Comprehensive Security Asset Map for Cloud‑Native Environments

Huolala’s Information Security team built a comprehensive security asset library and visualization framework, detailing asset pain points, mapping methodology, detection and drawing modules, and measurable outcomes, to enhance asset visibility, risk assessment, and continuous security operations in a cloud‑native environment.

Graph DatabaseInformation SecuritySecurity Operations
0 likes · 12 min read
How Huolala Built a Comprehensive Security Asset Map for Cloud‑Native Environments
Continuous Delivery 2.0
Continuous Delivery 2.0
May 21, 2024 · Operations

Platform Engineering: Enhancing Developer Experience and Operational Efficiency

The article explains how platform engineering, as an evolution of DevOps, aims to reduce developers' cognitive load by providing integrated, user‑friendly tools and workflows, discusses real‑world practices such as ByteDance's CI/CD pipeline, and outlines future trends toward process‑orientation, intelligence, and scalability.

DevOpsDeveloper ExperiencePlatform Engineering
0 likes · 4 min read
Platform Engineering: Enhancing Developer Experience and Operational Efficiency
DataFunSummit
DataFunSummit
May 20, 2024 · Big Data

Real-Time High-Performance Analytics on Data Lakes with CloudLakehouse Multi-Cluster Architecture

This article explains how CloudLakehouse’s Multi‑Cluster elastic architecture enables high‑concurrency, low‑latency real‑time analytics on data lakes by addressing storage‑compute separation, dynamic caching, and automated scaling, providing a cost‑effective solution for customer‑facing data products.

Multi-ClusterReal-time Analyticscloud-native
0 likes · 18 min read
Real-Time High-Performance Analytics on Data Lakes with CloudLakehouse Multi-Cluster Architecture
360 Smart Cloud
360 Smart Cloud
May 20, 2024 · Cloud Native

Design and Implementation of Managed (Dedicated) Kubernetes Clusters

This article introduces the concept of managed Kubernetes clusters, compares them with traditional setups, details the architecture and implementation process, discusses challenges such as cross‑cluster webhook routing, certificate issuance, container runtime integration, and outlines the diverse features of the dedicated cluster solution.

CertificateManaged Clustercloud-native
0 likes · 13 min read
Design and Implementation of Managed (Dedicated) Kubernetes Clusters
DataFunSummit
DataFunSummit
May 19, 2024 · Cloud Native

Design and Implementation of a Cloud‑Native Recommendation System Architecture

This article explains how to design and implement a recommendation system by leveraging a four‑layer cloud‑native stack, covering virtualization, micro‑service migration, service governance, elasticity, cloud‑native business capabilities, and chaos‑engineering‑based stability practices to achieve cost‑effective, high‑performance, and reliable recommendation services.

Virtualizationarchitecturechaos engineering
0 likes · 10 min read
Design and Implementation of a Cloud‑Native Recommendation System Architecture
Architect's Tech Stack
Architect's Tech Stack
May 18, 2024 · Operations

Graceful Shutdown in Kubernetes and Spring Boot Microservices: Best Practices and Optimizations

This article explains the concept of graceful shutdown, outlines essential steps, examines Kubernetes pod termination and Spring Boot integration with Nacos, and provides practical optimizations—including PreStop hooks, terminationGracePeriod settings, and actuator shutdown—to ensure reliable service termination without data loss.

Graceful ShutdownNacosOperations
0 likes · 11 min read
Graceful Shutdown in Kubernetes and Spring Boot Microservices: Best Practices and Optimizations
DataFunTalk
DataFunTalk
May 18, 2024 · Artificial Intelligence

Tencent FinTech AI Development Platform: Architecture, Challenges, and Solutions

This article details the background, goals, and evolution of Tencent's FinTech AI development platform, outlines the technical challenges faced in feature engineering, model training, and inference services, and presents the comprehensive solutions and future plans implemented to improve efficiency, stability, and scalability.

FinTechcloud-nativeinference
0 likes · 13 min read
Tencent FinTech AI Development Platform: Architecture, Challenges, and Solutions
Selected Java Interview Questions
Selected Java Interview Questions
May 16, 2024 · Backend Development

Rethinking Microservices in 2023: From Google’s “Microservices 2.0” to Amazon’s Monolith Shift

In 2023, leading cloud giants like Google and Amazon publicly questioned the value of microservices, presenting new “Microservices 2.0” principles and monolithic alternatives that promise dramatically lower latency, cost, and operational complexity while reshaping backend architecture debates.

cloud-nativemicroservicesmonolith
0 likes · 12 min read
Rethinking Microservices in 2023: From Google’s “Microservices 2.0” to Amazon’s Monolith Shift
360 Smart Cloud
360 Smart Cloud
May 15, 2024 · Cloud Native

Polefs: A Cloud‑Native Distributed Cache File System for AI Training Workloads

The article outlines the challenges of massive AI training data, defines storage performance requirements, and presents Polefs—a cloud‑native distributed cache file system with unified storage, metadata acceleration, and read/write caching designed to improve GPU utilization and reduce data redundancy.

AIDistributed File SystemGPU utilization
0 likes · 14 min read
Polefs: A Cloud‑Native Distributed Cache File System for AI Training Workloads
Yang Money Pot Technology Team
Yang Money Pot Technology Team
May 14, 2024 · Cloud Native

Optimizing CI/CD Pipeline and Release Strategies for Microservices in a Cloud‑Native Environment

This article details a comprehensive overhaul of a company's CI/CD workflow for Java, Python, Go, and Node.js microservices, introducing automated pipelines, parallel builds, rolling, canary, and blue‑green deployments on Kubernetes with Istio to improve release speed, stability, and observability.

ci/cdcloud-nativerelease-management
0 likes · 15 min read
Optimizing CI/CD Pipeline and Release Strategies for Microservices in a Cloud‑Native Environment
dbaplus Community
dbaplus Community
May 13, 2024 · Cloud Native

Do You Really Need Kubernetes? Real‑World Dev Opinions and Practical Tips

This article compiles diverse Zhihu answers discussing whether Kubernetes is necessary, weighing its automation benefits and scaling power against configuration complexity, resource costs, and team readiness, while offering concrete kubectl commands and guidance for making an informed adoption decision.

Operationsci/cdcloud-native
0 likes · 9 min read
Do You Really Need Kubernetes? Real‑World Dev Opinions and Practical Tips
MaGe Linux Operations
MaGe Linux Operations
May 10, 2024 · Cloud Native

How to Set Up a Secure Kubernetes Dashboard Without RBAC

Learn step-by-step how to download the Kubernetes dashboard YAML, prepare offline images, modify the deployment to use a private registry, configure the API server address, create the service, verify its status, and access the dashboard via a browser, all without RBAC.

Dockercloud-nativedashboard
0 likes · 3 min read
How to Set Up a Secure Kubernetes Dashboard Without RBAC
360 Tech Engineering
360 Tech Engineering
May 10, 2024 · Cloud Native

Practical Experience of Deploying and Optimizing Apache Pulsar on Kubernetes at 360

This article shares the architecture design, cluster deployment, storage selection, multi‑region mode, service discovery, performance tuning, monitoring, alerting, and future plans of a production‑grade Apache Pulsar platform running on Kubernetes, providing valuable insights for engineers adopting Pulsar.

Apache PulsarDistributed MessagingPerformance Optimization
0 likes · 17 min read
Practical Experience of Deploying and Optimizing Apache Pulsar on Kubernetes at 360
360 Smart Cloud
360 Smart Cloud
May 9, 2024 · Cloud Native

Practical Experience of Deploying and Optimizing Apache Pulsar on Kubernetes at 360

This article shares 360's practical experience deploying Apache Pulsar on Kubernetes, covering architecture design, storage choices, multi‑cluster modes, service discovery, performance tuning, monitoring, alerting, and future plans, offering valuable guidance for engineers building cloud‑native messaging platforms.

Apache PulsarDistributed MessagingPerformance Optimization
0 likes · 19 min read
Practical Experience of Deploying and Optimizing Apache Pulsar on Kubernetes at 360
JD Cloud Developers
JD Cloud Developers
May 9, 2024 · Operations

Unlocking IT Architecture Governance: From Theory to JD.com’s Real‑World Success

This article explains the origins and concepts of IT architecture and governance, compares traditional and internet‑era enterprise structures, reviews major standards such as COBIT, ITIL and IT4IT, outlines practical governance methods, and showcases JD.com’s concrete implementation across infrastructure, applications, and organizational processes.

COBITEnterprise ArchitectureIT Governance
0 likes · 30 min read
Unlocking IT Architecture Governance: From Theory to JD.com’s Real‑World Success
Alibaba Cloud Observability
Alibaba Cloud Observability
May 8, 2024 · Cloud Computing

Mastering FinOps: A Practical Guide to Cloud Cost Management and Optimization

FinOps combines finance and DevOps to create a culture of transparent, predictable cloud spending, guiding enterprises through preparation, strategy alignment, shared responsibility, cost insight, optimization tactics, and ongoing control, enabling them to balance agility with disciplined cost management in cloud-native environments.

Cloud Cost ManagementFinOpscloud-native
0 likes · 12 min read
Mastering FinOps: A Practical Guide to Cloud Cost Management and Optimization
Liangxu Linux
Liangxu Linux
May 7, 2024 · Cloud Native

Unlocking Docker: From Containers to Compose, Swarm, and Kubernetes

This article explains Docker’s role as a container platform, covering basic concepts like images and Dockerfiles, the architecture of Docker Engine, and how Docker Compose, Swarm, and Kubernetes extend container orchestration across multiple services and hosts.

ContainersDockerDockerfile
0 likes · 13 min read
Unlocking Docker: From Containers to Compose, Swarm, and Kubernetes
DevOps Cloud Academy
DevOps Cloud Academy
May 6, 2024 · Cloud Native

How to Deploy a Highly Available Application on Kubernetes

This article explains key Kubernetes configurations—such as pod replicas, pod anti‑affinity, deployment strategies, graceful termination, probes, resource allocation, scaling, and disruption budgets—to achieve high availability and zero‑downtime deployments for containerized applications in production.

ProbesScalingcloud-native
0 likes · 20 min read
How to Deploy a Highly Available Application on Kubernetes
DataFunTalk
DataFunTalk
May 6, 2024 · Big Data

OPPO Next‑Generation Big Data & AI Integrated Architecture on Functional Cloud

This article presents OPPO’s next‑generation big‑data and AI integrated architecture on functional cloud, detailing a cloud‑native elastic compute framework, a unified data‑lake solution, real‑time feature platforms, machine‑learning data acceleration, and hybrid‑cloud deployments, highlighting performance gains and cost reductions.

Big Datacloud-nativeelastic computing
0 likes · 11 min read
OPPO Next‑Generation Big Data & AI Integrated Architecture on Functional Cloud