Introducing pydbgen: A random dataframe/database table generator
Nov05

Introducing pydbgen: A random dataframe/database table generator

When you start learning data science, often your biggest worry is not the algorithms or techniques but getting access to raw data. While there are many high-quality, real-life datasets available on the web for trying out cool machine learning techniques, I’ve found that the same is not true when it comes to learning SQL. read...

Read More
Revisiting the Unix philosophy in 2018
Nov05

Revisiting the Unix philosophy in 2018

In 1984, Rob Pike and Brian W. Kernighan published an article called “Program Design in the Unix Environment” in the AT&T Bell Laboratories Technical Journal, in which they argued the Unix philosophy, using the example of BSD’s cat -v implementation. In a nutshell that philosophy is: Build small, focused programs—in whatever language—that do only one thing but do this thing well, communicate via...

Read More