I am a data scientist at Microsoft, formerly at Wave Apps.
I am a data scientist at Microsoft, formerly at Wave Apps.
Blog part 2 (this time with ipython notebooks (and Rmarkdown)!):
Change Tracking in Delta Lake 2.0
Spark's randomSplit function and nondeterminism
Run every trail in Cougar Park
Businesses spend a profound amount money non-ideologically in political races.
IID failures lead to over-confidence in A/B test results.
Confidence bounds on metrics from sampled data sets.
PCA on county-level statistics in the US.
Examining the outliers between median income and education levels by county in the US.