about
blog
code
resume
Oct 2 2024
Minimal tmux Configuration
terminal
bash
cli
Sep 23 2024
Create a class for applying functions in sequence
python
pyspark
Jul 29 2024
Create Empty Dataframe of Date Sequence
Apr 3 2024
Generate Colour Gradient
python
visualization
Jan 22 2024
Read TOML variables into python globals
python
Jan 3 2024
Distribute Cluster Evaluation
clustering
machine-learning
Aug 18 2023
Databricks Fixtures for PyTest
pyspark
databricks
python
Aug 4 2023
How to Comment Multiple Lines in Vi
data-science
linux
vim
Jul 23 2023
Bash Script for TODO finding
cli
bash
linux
productivity
Jul 23 2023
Drop Columns with High Missing Values in Spark
python
pyspark
feature-engineering
Jun 20 2023
Emacs Shell Shortcuts
bash
cli
May 15 2023
Mapping a Pandas Column
python
data-science
pandas
May 14 2023
Decorator Template
python
metaprogramming
May 3 2023
Changing File Extensions
bash
cli
May 1 2023
Kedro 0.17.X Hook Spec
python
kedro
Mar 31 2023
Setting Up Spark for PyTest
python
data-science
pyspark
Mar 16 2023
Randomly Populating Pyspark Columns
python
data-science
Jan 26 2023
Dictionary for Automatically Loading Tables
python
data-science
Dec 21 2022
Metaclass for Auto Initialization
python
metaprogramming
Dec 8 2022
XGBoost Evaluation Classes
python
xgboost
Nov 30 2022
TensorFlow Custom Loop
Sep 12 2022
Parsing XML with untangle
Jul 24 2022
Sync Script
Jul 21 2022
PySpark Fill Rates
bash (4)
cli (4)
clustering (1)
data-science (6)
databricks (1)
feature-engineering (1)
kedro (1)
linux (2)
machine-learning (2)
metaprogramming (6)
pandas (1)
productivity (1)
pyspark (5)
python (19)
terminal (1)
vim (1)
visualization (1)
xgboost (1)