The World's Oldest Intern

The story of a serial change artist

Naive Bayes Classifier for identifying my DNS exfiltration packets – w00t

3 months of Hacker School for this!

Machine Learning baby steps with Hilary Mason’s video series

Jessica speaking my language – Buffer Overflows and Exploits

Learning sklearn while my dataset loads into pandas dataframe

my attempt at getting a job

Sometimes a simple check of length is all you need

MD5 hashed DNS requests to “”

SHA1 hashed DNS requests to “”

Twisted local DNS server with all my DNS being evaluated

Here is just the query name request

DNS Server receiving hashed message and message parsed out

Training set and testing set

Machine learning is about learning some properties of a data set and applying them to new data. This is why a common practice in machine learning to evaluate an algorithm is to split the data at hand in two sets, one that we call a training set on which we learn data properties, and one that we call a testing set, on which we test these properties.(from sklearn website)


sklearn is a Python module integrating classic machine learning algorithms in the tightly-knit world of scientific Python packages (numpyscipymatplotlib).

Fitting data

The main API implemented by scikit-learn is that of the estimator. An estimator is any object that learns from data; it may a classification, regression or clustering algorithm or a transformer that extracts/filters useful features from raw data.

