After the GA of Apache Kudu in Cloudera CDH 5.10, we t...
Crazy Progress Bars
|| ||If you use computers, you’ve experienced progress...
Becoming a Data Scientist Podcast Episode 15: David Meza
David Meza is Chief Knowledge Architect at NASA, and t...
Data Cleaning, Categorization and Normalization
Definition of Clean Data Happy families are all alike;...
Journal: PLXtrum - realtime machine learning for predicting note onset
This system detects in real time when the note is comi...
Streaming Columnar Data with Apache Arrow
** Fri 27 January 2017 Over the past couple weeks, Non...
Where Predictive Modeling Goes Astray
I recently reread Yarkoni and Westfall’s in-progress p...
Development update: High speed Apache Parquet in Python with Apache Arrow
** Wed 25 January 2017 Over the last year, I have been...
Doing magic and analyzing seasonal time series with GAM (Generalized Additive Model) in R
As I wrote in the previous post, I will continue in de...
Radiocarbon dating
|| |Carbon dating, more specifically Carbon-14 dating ...