Past Memory Big Data
Author

Past Memory Big Data

A popular big-data architecture channel with over 100,000 developers. Publishes articles on Spark, Hadoop, Flink, Kafka and more. Visit the Past Memory Big Data blog at https://www.iteblog.com. Search "Past Memory" on Google or Baidu.

58
Articles
0
Likes
22
Views
0
Comments
Recent Articles

Latest from Past Memory Big Data

58 recent articles
Past Memory Big Data
Past Memory Big Data
Aug 31, 2022 · Databases

How to Begin Contributing to an Apache Top‑Level Open Source Project

This guide walks readers through the complete process of joining an Apache top‑level project—using Apache Doris as an example—including reading the README, joining mailing lists and chat groups, finding a first‑issue, forking the repository, making code changes, submitting a pull request, and passing community review.

Apache DorisCommunityDatabase
0 likes · 12 min read
How to Begin Contributing to an Apache Top‑Level Open Source Project
Past Memory Big Data
Past Memory Big Data
Aug 25, 2022 · Industry Insights

Enterprise BI 2022: Essential Insights and Free White Paper Download

The 2022 Enterprise BI Platform White Paper analyzes digital transformation challenges for large enterprises, introduces a Five‑Force BI model, presents case studies from a Fortune‑500 bank, a beverage chain, and a top internet firm, and offers a downloadable full report.

BI MethodologyCase StudiesDigital Transformation
0 likes · 3 min read
Enterprise BI 2022: Essential Insights and Free White Paper Download
Past Memory Big Data
Past Memory Big Data
Aug 23, 2022 · Big Data

JD Tech’s Event‑Tracking Data Governance and One‑Stop Platform: Practices and Innovations

The article explains why event‑tracking data needs governance, outlines a full‑link governance methodology, describes the organizational setup, and details the features of JD Tech’s one‑stop tracking management platform, including metadata unification, one‑click validation, real‑time dashboards, visualization tools, and H5‑native data integration.

H5-native integrationdata governanceevent tracking
0 likes · 16 min read
JD Tech’s Event‑Tracking Data Governance and One‑Stop Platform: Practices and Innovations
Past Memory Big Data
Past Memory Big Data
Aug 15, 2022 · Big Data

How Pinterest Scaled a Hadoop Upgrade Across 17k Nodes

Pinterest’s Monarch batch‑processing platform, built on over 17 k YARN nodes in AWS, was upgraded from Hadoop 2.7.1 to 2.10.0 using a phased, cluster‑by‑cluster strategy that balanced minimal downtime, extensive validation, and custom patches to handle compatibility and dependency issues.

AWS EC2Big DataCluster Upgrade
0 likes · 18 min read
How Pinterest Scaled a Hadoop Upgrade Across 17k Nodes
Past Memory Big Data
Past Memory Big Data
Aug 12, 2022 · Backend Development

Apache DolphinScheduler 3.0.0 Released: Biggest Changes Yet

On August 10, 2022 Apache DolphinScheduler 3.0.0 was officially released, introducing a brand‑new Vue3‑based UI that is dozens of times faster, extensive AWS support, custom time‑zone handling, task groups, native data‑quality checks, service splitting for container‑native deployment, numerous new task types, Python API enhancements, and a long list of bug fixes and documentation updates.

3.0.0AWS integrationApache DolphinScheduler
0 likes · 16 min read
Apache DolphinScheduler 3.0.0 Released: Biggest Changes Yet
Past Memory Big Data
Past Memory Big Data
Aug 9, 2022 · Big Data

Master the Complete Big Data Ecosystem in One Article

This article provides a comprehensive overview of the big data ecosystem, detailing nine core technology categories—from data collection and storage to computation, analysis, scheduling, and underlying infrastructure—along with tool comparisons, selection guidelines to help readers quickly build a complete big data knowledge system.

Big DataData AnalysisData Collection
0 likes · 12 min read
Master the Complete Big Data Ecosystem in One Article
Past Memory Big Data
Past Memory Big Data
Jul 22, 2022 · Big Data

Choosing Modern Data Architecture: Data Fabric vs. Data Mesh

The article compares Data Fabric and Data Mesh as modern data‑architecture approaches, explains their technical and organizational differences, discusses the ongoing debate between data lakes, warehouses, and lakehouses, and highlights how each option fits varying data‑type and usage scenarios.

Data ArchitectureData FabricData Lake
0 likes · 4 min read
Choosing Modern Data Architecture: Data Fabric vs. Data Mesh