Chanukya Pekala - Data Platform Architect

December 30, 2025

How to improve compression and query performance for high-cardinality timestamp columns in Apache Spark by switching from INT96 to INT64 encoding

#apache-spark#parquet#performance#data-engineering

August 5, 2025

.. and stop crushing it!

#apache-spark#data-engineering#pandas#pyspark#performance

December 7, 2024

Adapting to changing data...

#delta-lake#databricks#schema-evolution#data-engineering#apache-spark

January 21, 2024

unit testing the data...

#data-quality#apache-spark#deequ#data-engineering#testing

January 8, 2024

In typical data engineering tasks, we often work with procedural style code for data transformations

#apache-spark#data-engineering#scala#databricks

Data Engineer building scalable systems