Exploring Cohort Analysis – Part One

Simply put, cohort analysis is a technique for analyzing activity over time by a common characteristic.  Mostly used in sales and marketing, cohort analysis can be used in tasks such as analyzing customer loyalty, customer cost acquisition, marketing campaign effectiveness and to explore many other aspects of sales.


I am using the superstore sales created by Michael Martin found here or here.  This excel file contains three sheets of which only the first one, Orders, will be used in this analysis.


The store providing their sales data does monthly advertising campaigns and wants to track what impact these advertising campaigns have on the amounts of orders placed over time.  They want to use this information to evaluate their different campaigns and improve their efforts.

Given the superstore sales data and the requirements, lets present the number of orders placed per customer join date.  Presenting the number of orders per join date will show the effectiveness of advertising campaigns leading to such date.


Any database server can be used to follow along.  The code used here can easily be revised to work on any vendor’s product like MySQL, etc.  For visualization purposes, Tableau can be easily changed replace by LibreOffice or similar.

Continue reading


Installing Oracle DB 12c in Linux OEL6 (In One Sitting)

Because I will have to do this again in no time, here I jot down the quickest way for me to install 12c.  This is the usual series of following steps, running into an error, searching for a fix, lather, rinse repeat.

Continue reading

Oracle 12c Primer – Up And Running

UPDATE – Post has been revised since posted for accuracy.  Thanks Matt.

Alright.  Having recently installed Oracle DB 12c, I was disoriented for four days before things started to click.  No joke, this latest release of Oracle latest database server is the biggest release-to-release departure of old to new I have witnessed so far… I’ll like to think I have a long way to go.

I’ll go over the items that most baffled me and attempt to explain what these changes mean and how to catch up to Oracle 12c.  By the end of this post, maybe I can convince you there quite a few niceties that come from free for all our trouble.  Hope it saves you some time as well.


Continue reading

I Too Found Paul Revere

I am coming from this Reddit post and from here… This post exists solely because I too wanted to find Paul Revere.  I am not even going to use ‘big data’.  Mainly because this data is not big, I do not have a graph database and because I think it would be fun to see what plain old SQL and Tableau can show us.  For truly insightful posts, just visit Kieran Healy, the author of the original post instead.

Kieran makes a very compelling argument on what is possible for surveillance programs the general population may (or not) understand.  His post is really fascinating.  Follow along and you can learn to monitor your citizens as well. 

Heck, I bet you could build your own surveillance network with freely available online services.  Imagine what you could cook up with Google Alerts, Amazon S3, IFTTT and a bit of time!

Continue reading