The Battle of the Compressors: Optimizing Spark Workloads with
Hello! Hope you’re having a wonderful time working with challenging issues around Data and Data Engineering. In this article let’s look at the different compression algorithms Apache Spark offers…
Spark Release 3.3.0 is Here. Fourth release of 3.X is out with tons…, by Senior Brogrammer
Spark catalyst optimizer and query optimization, by krishnaprasad k
Spark Series: Partition Discovery & Production Learning, by Archana Goyal
PySpark — Read Compressed gzip files, by Subham Khandelwal
Load Data using EMR Spark with Apache Iceberg, by Vishal Khondre
The Battle of the Compressors: Optimizing Spark Workloads with ZStd, Snappy and More for Parquet, by Siraj
Optimizing Apache Spark File Compression with LZ4 or Snappy, by Matthew Salminen
Spark catalyst optimizer and query optimization, by krishnaprasad k
Optimizing Apache Spark File Compression with LZ4 or Snappy, by Matthew Salminen
Picking the right compression for high volume data transfer, by Murali Suraparaju
Bucketing: Are you leveraging it in a right way ?, by Aditya Sahu, Curious Data Catalog