Apache Arrow 기반 Zero-copy 아키텍처로 단일 노드 TB급 데이터 처리 구현
Single-Node Data Engineering: DuckDB, DataFusion, Polars, and LakeSail
Single-Node Data Engineering: DuckDB, DataFusion, Polars, and LakeSail
Apache Data Lakehouse Weekly: April 16–22, 2026
What is Apache Arrow? Erasing the Serialization Tax
PySpark to Pandas/scikit-learn: A Practical Migration Guide for Data Engineers Learning ML
Apache Data Lakehouse Weekly: April 3–9, 2026
Parquet Content-Defined Chunking