Tag

ARROW

1 views collected around this technical thread.

DataFunTalk
DataFunTalk
Jul 23, 2024 · Big Data

Practical Experience with Apache Kyuubi and Apache Celeborn in Big Data Platforms

This article shares detailed practical experiences from DingXiangYuan's big‑data platform on using Apache Kyuubi and Apache Celeborn, covering architecture, flexible configuration, AuthZ fine‑grained permissions, small‑file and Z‑Order optimizations, Arrow‑based large result transmission, and operational tips such as connection‑level issues and Netty cache handling.

ARROWApache CelebornApache Kyuubi
0 likes · 17 min read
Practical Experience with Apache Kyuubi and Apache Celeborn in Big Data Platforms
Python Programming Learning Circle
Python Programming Learning Circle
Jan 13, 2024 · Fundamentals

Comprehensive Guide to Python Time Handling Modules (time, datetime, pytz, dateutil, Arrow, ISO 8601)

This article provides a thorough overview of Python's time‑related libraries—including the built‑in time and datetime modules, third‑party pytz, dateutil, and Arrow—explaining core concepts such as timestamps, time zones, daylight‑saving time, and ISO 8601 parsing with clear code examples and usage patterns.

ARROWDateTimePython
0 likes · 27 min read
Comprehensive Guide to Python Time Handling Modules (time, datetime, pytz, dateutil, Arrow, ISO 8601)
DataFunSummit
DataFunSummit
Nov 25, 2023 · Big Data

Practical Experience with Apache Kyuubi and Celeborn on the DXY Big Data Platform

This article presents a comprehensive technical overview of how DXY's big data platform leverages Apache Kyuubi and Celeborn to unify Spark entry points, configure flexible task isolation, implement fine‑grained AuthZ, optimize small files and Z‑Order sorting, and accelerate large result set transmission with Arrow, while also discussing operational challenges and upcoming features.

ARROWApache KyuubiCeleborn
0 likes · 17 min read
Practical Experience with Apache Kyuubi and Celeborn on the DXY Big Data Platform
Efficient Ops
Efficient Ops
Feb 15, 2023 · Operations

How China Agricultural Bank’s ARROW Platform Mastered DevOps Continuous Delivery

The article details China Agricultural Bank’s ARROW platform achieving third‑level DevOps continuous delivery certification, outlining its end‑to‑end pipeline, quality gates, metric‑driven improvements, and how these practices boost code quality, delivery speed, and support the bank’s digital transformation.

ARROWContinuous DeliveryDevOps
0 likes · 8 min read
How China Agricultural Bank’s ARROW Platform Mastered DevOps Continuous Delivery
Python Programming Learning Circle
Python Programming Learning Circle
Sep 10, 2021 · Fundamentals

Comprehensive Guide to Python Time Handling: time, datetime, pytz, dateutil, and Arrow

This article provides a detailed overview of Python's time‑handling ecosystem, covering fundamental concepts of seconds, GMT, UTC and leap seconds, the low‑level time module, high‑level datetime classes, timezone manipulation with pytz, flexible parsing with dateutil, and the user‑friendly Arrow library, complete with code examples and usage notes.

ARROWDateTimedateutil
0 likes · 24 min read
Comprehensive Guide to Python Time Handling: time, datetime, pytz, dateutil, and Arrow
DataFunTalk
DataFunTalk
Dec 24, 2019 · Big Data

Deep Dive into PySpark Implementation: Multi‑Process Architecture, Java Integration, RDD/SQL Interfaces, Executor Communication, and Pandas UDF

This article explains PySpark's multi‑process architecture, how the Python driver uses Py4J to call Java/Scala APIs, the implementation of RDD and DataFrame interfaces, executor‑side process communication and serialization with Arrow, and the design of Pandas UDFs, while also discussing current limitations and future directions.

ARROWPySparkPython
0 likes · 13 min read
Deep Dive into PySpark Implementation: Multi‑Process Architecture, Java Integration, RDD/SQL Interfaces, Executor Communication, and Pandas UDF