Tag

system optimization

1 views collected around this technical thread.

Practical DevOps Architecture
Practical DevOps Architecture
Mar 14, 2025 · Backend Development

Comprehensive Java Senior Engineer Curriculum: Architecture Design, System Optimization, and Advanced Topics

This curriculum outlines a 25‑week advanced Java engineering program covering performance optimization, JVM internals, concurrency, distributed storage, MySQL and NoSQL databases, Docker, Kubernetes, microservices, Spring framework deep dives, Nginx, Elasticsearch, Redis, messaging systems, CI/CD tools, and hands‑on project implementation to build high‑performance, scalable backend systems.

DockerJVMJava
0 likes · 3 min read
Comprehensive Java Senior Engineer Curriculum: Architecture Design, System Optimization, and Advanced Topics
DeWu Technology
DeWu Technology
Feb 26, 2025 · Backend Development

Migrating to Rust: A Case Study in High-Performance Computing

Migrating a Java computing layer to Rust yielded dramatic performance gains—30% lower CPU usage, 70% less memory—and greater stability, as the authors explain how Rust’s ownership, borrowing, lifetimes, and concurrency, combined with optimized data handling, FFI integration, Tokio async, Docker deployment, and monitoring, outweigh the steep learning curve and ecosystem gaps.

FFIHigh Performance ComputingMemory Management
0 likes · 22 min read
Migrating to Rust: A Case Study in High-Performance Computing
Baidu Tech Salon
Baidu Tech Salon
Jan 8, 2025 · Artificial Intelligence

Evolution of Video Search Ranking Architecture Toward an End‑to‑End Large‑Model Framework

The paper describes transforming a tightly coupled, multi‑stage video search ranking pipeline into a modular, end‑to‑end large‑model architecture that decouples recall, employs a graph‑engine parallel framework and elastic compute allocation, thereby boosting performance, flexibility, personalization and lowering long‑term operational costs.

Parallel Computingelastic resourcesend-to-end
0 likes · 10 min read
Evolution of Video Search Ranking Architecture Toward an End‑to‑End Large‑Model Framework
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jan 2, 2025 · Artificial Intelligence

Xiaohongshu's Self-developed RLHF System for Multimodal Large Language Models: Design, Optimization, and Performance

Xiaohongshu’s team unveiled a self‑developed RLHF system that trains multimodal large language models using heterogeneous and homogeneous network architectures, extensive PPO optimizations, and Medusa speculative sampling, achieving over 50% throughput gains, reduced hardware needs, and 5‑20% performance improvements on zero‑shot benchmarks.

MedusaPPOPRM
0 likes · 21 min read
Xiaohongshu's Self-developed RLHF System for Multimodal Large Language Models: Design, Optimization, and Performance
System Architect Go
System Architect Go
Dec 15, 2024 · Operations

A Structured Framework for Optimizing System Components

This article presents a comprehensive framework for optimizing system components such as databases, caches, message queues, and search engines, covering environment configuration, data structure organization, client‑side usage, and inter‑component collaboration, illustrated with MySQL, Redis, Kafka, and Elasticsearch examples.

Message Queuescachingcomponent tuning
0 likes · 6 min read
A Structured Framework for Optimizing System Components
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Dec 10, 2024 · Backend Development

Understanding High Concurrency in Nginx: Configuration Tips and Best Practices

This article explains what constitutes high concurrency for Nginx, outlines the hardware and software factors that affect its performance, and provides concrete configuration examples such as worker_processes, worker_connections, and other tuning directives to help achieve stable high‑traffic handling.

Nginxbackend performanceconfiguration tuning
0 likes · 4 min read
Understanding High Concurrency in Nginx: Configuration Tips and Best Practices
Sohu Tech Products
Sohu Tech Products
Sep 11, 2024 · Backend Development

Optimizing an Audit System Processing Chain for Traffic Spikes Using Chain of Responsibility and Sliding‑Window Detection

To prevent audit pipelines from choking during traffic surges, the article proposes augmenting the classic Chain of Responsibility with a sliding‑window anomaly detector, a statistically‑driven dynamic detector, a rule‑based backflow mechanism, and AI‑powered traffic forecasting, enabling adaptive routing and proactive scaling.

JavaSliding Windowbackend
0 likes · 16 min read
Optimizing an Audit System Processing Chain for Traffic Spikes Using Chain of Responsibility and Sliding‑Window Detection
Lobster Programming
Lobster Programming
Apr 24, 2024 · Backend Development

Boost Your System’s Concurrency: Proven Strategies from Load Balancing to CDN

Learn practical ways to enhance a system’s concurrent handling capacity, including traffic distribution via load balancers, database sharding and clustering, message queues, caching layers, CDNs, code and database optimizations, and scaling hardware resources such as servers, CPUs, and memory.

Load Balancingcachingconcurrency
0 likes · 4 min read
Boost Your System’s Concurrency: Proven Strategies from Load Balancing to CDN
37 Interactive Technology Team
37 Interactive Technology Team
Jul 21, 2023 · Cloud Native

Automated Performance Testing Architecture Using Alibaba Cloud PTS

The article explains how automated performance testing using Alibaba Cloud PTS—integrated with VPC‑based Kubernetes clusters, GitLab CI/CD pipelines, and cloud APIs—overcomes the inaccuracy, coordination delays, and resource waste of manual testing by providing on‑demand, scalable load generation, scenario orchestration, real‑time metrics, and cost‑effective reporting.

Alibaba Cloud PTSCI/CDCloud Computing
0 likes · 4 min read
Automated Performance Testing Architecture Using Alibaba Cloud PTS
DataFunTalk
DataFunTalk
Jul 3, 2023 · Artificial Intelligence

Exploring Large‑Scale Graph Neural Network Applications and System Optimizations at Tencent

This article reviews Tencent's research on large‑scale graph neural networks, covering three memory/computation paradigms, the SAGA extension of the GAS model, various system optimizations such as graph partitioning and feature transmission, and future challenges including non‑message‑passing models and geometric information.

AIGraph Neural Networksdistributed systems
0 likes · 14 min read
Exploring Large‑Scale Graph Neural Network Applications and System Optimizations at Tencent
Tencent Cloud Developer
Tencent Cloud Developer
Jan 9, 2023 · Operations

Linux I/O Optimization: Zero-Copy Techniques

The article explains Linux I/O optimization through zero‑copy techniques—such as mmap + write, sendfile, and splice—detailing memory hierarchy, the benefits of reducing user‑kernel copies, the suitability of async + direct I/O for large file transfers, real‑world uses like Kafka and Nginx, and inherent platform limitations.

DMALinux I/OMMU
0 likes · 32 min read
Linux I/O Optimization: Zero-Copy Techniques
Architect
Architect
Jul 8, 2022 · Backend Development

Understanding Feed Stream Architecture: Models, Storage, and Optimization

This article explains the concept of feed streams, compares push and pull implementation models, discusses storage options such as MySQL, Redis SortedSet and Cassandra, and presents optimization techniques including online‑push/offline‑pull strategies, pagination methods, and deep‑paging solutions for large‑scale systems.

CassandraRedisbackend architecture
0 likes · 9 min read
Understanding Feed Stream Architecture: Models, Storage, and Optimization
IT Architects Alliance
IT Architects Alliance
Dec 12, 2021 · Operations

System Performance Issue Analysis and Optimization Process for Business Applications

The article outlines a comprehensive process for diagnosing and optimizing performance problems in production business systems, covering causes such as high concurrency, data growth, hardware constraints, and detailing analysis of hardware, OS, database, middleware, JVM settings, code inefficiencies, and the role of monitoring and APM tools.

Database TuningJVMPerformance analysis
0 likes · 13 min read
System Performance Issue Analysis and Optimization Process for Business Applications
IT Architects Alliance
IT Architects Alliance
Nov 20, 2021 · Operations

Analysis and Optimization of Business System Performance

This article outlines a comprehensive approach to diagnosing and optimizing performance problems in production business systems, covering analysis processes, hardware, OS, database, middleware, JVM tuning, code inefficiencies, and monitoring techniques to identify root causes and improve system reliability.

Database TuningJVM Tuningmonitoring
0 likes · 16 min read
Analysis and Optimization of Business System Performance
Architect
Architect
May 18, 2021 · Big Data

Design and Optimization of Baidu's Image Processing and Ingestion Platform (Imazon) for Multimodal Retrieval

This article details Baidu's multimodal retrieval architecture, explaining the separation of online and offline services, the design of the Imazon image processing and ingestion platform, its technical indicators, large‑scale streaming and batch pipelines, optimization practices for high throughput, and the underlying content‑relationship engine.

DAGImage Processingbig data
0 likes · 13 min read
Design and Optimization of Baidu's Image Processing and Ingestion Platform (Imazon) for Multimodal Retrieval
DataFunTalk
DataFunTalk
Feb 21, 2021 · Artificial Intelligence

Advances in Pre‑Ranking for Large‑Scale Advertising: The COLD Framework and Its Technical Evolution

This article reviews the development history, technical routes, and recent breakthroughs of pre‑ranking (coarse ranking) in large‑scale advertising systems, focusing on Alibaba's COLD (Computing‑power‑cost‑aware Online and Lightweight Deep) framework, its model design, engineering optimizations, experimental results, and future research directions.

COLDadvertisingmachine learning
0 likes · 20 min read
Advances in Pre‑Ranking for Large‑Scale Advertising: The COLD Framework and Its Technical Evolution
Qunar Tech Salon
Qunar Tech Salon
Jan 21, 2021 · Backend Development

Design and Implementation of Qunar's High‑Concurrency Instant Messaging System

The article details Qunar's in‑house instant messaging platform, covering its purpose, protocol choices (XMPP with protocol‑buffer optimization), architecture based on ejabberd, message flow, reliability mechanisms, extensions such as bots and HTTP APIs, as well as extensive system‑level tuning for high‑concurrency TCP connections.

ErlangInstant MessagingXMPP
0 likes · 15 min read
Design and Implementation of Qunar's High‑Concurrency Instant Messaging System
Architecture Digest
Architecture Digest
Jan 17, 2021 · Operations

System Performance Issue Analysis and Optimization Process

This article outlines a comprehensive process for diagnosing and optimizing performance problems in production business systems, covering hardware, OS, database, middleware, JVM tuning, code inefficiencies, monitoring tools, and the limitations of pre‑release testing, with practical guidelines and visual references.

APMDatabase TuningJVM
0 likes · 16 min read
System Performance Issue Analysis and Optimization Process
DataFunTalk
DataFunTalk
Jan 4, 2021 · Artificial Intelligence

Personalized Computing‑Power Allocation for Alibaba Display Advertising: Transformers Engine and DCAF Algorithm

The article presents Alibaba's display‑advertising team’s three‑stage computing‑power efficiency evolution, introduces the DCAF personalized power‑allocation algorithm with its Lagrangian formulation, and describes the AllSpark dynamic‑control framework that together enable a flexible, resource‑aware Transformers engine achieving significant business gains during high‑traffic events.

algorithmic co-designcomputing powerdeep learning
0 likes · 21 min read
Personalized Computing‑Power Allocation for Alibaba Display Advertising: Transformers Engine and DCAF Algorithm
Top Architect
Top Architect
Sep 10, 2020 · Operations

Elasticsearch Performance Tuning Guide: Configuration, System, and Usage Optimizations

This article provides a comprehensive guide to improving Elasticsearch performance and stability by covering configuration file tweaks, system‑level settings, and usage‑level optimizations such as hot‑thread analysis, pending tasks, field storage, translog handling, refresh intervals, shard management, and best practices for routing and alias usage.

Cluster ConfigurationPerformance TuningSearch Engine
0 likes · 20 min read
Elasticsearch Performance Tuning Guide: Configuration, System, and Usage Optimizations