Data Engineer building scalable systems

Welcome to my digital garden where I share learnings and ideas on data and software engineering.

Recent Posts

Building a Second Brain with LLMs

How I built a working knowledge graph of my own work in Obsidian using Andrej Karpathy's LLM Wiki approach.

#ai#knowledge-management#llm#obsidian#claude-code#productivity

slidesmd — Make Your PowerPoint Presentations Queryable by AI

Auto-index your .pptx files into a single agents.md that any AI can read. Drop a file in the folder, agents.md updates automatically. Ask Claude or a local Ollama model anything across all your decks.

#ai#python#cli#productivity#ollama#powerpoint

ClipAI CLI — Local AI Text Assistant for the Terminal

A terminal-native AI text assistant powered by Ollama. No API key, no cloud — runs entirely on your machine with clipboard, pipe, and interactive TUI modes.

#cli#ollama#ai#python#terminal#productivity

Optimizing Spark Timestamp Columns in Parquet Files

How to improve compression and query performance for high-cardinality timestamp columns in Apache Spark by switching from INT96 to INT64 encoding

#apache-spark#parquet#performance#data-engineering

Effective use of Spark driver

.. and stop crushing it!

#apache-spark#data-engineering#pandas#pyspark#performance

Schema Evolution in Databricks Delta Lake

Adapting to changing data...

#delta-lake#databricks#schema-evolution#data-engineering#apache-spark
View all posts →