Quantified Self: Clustering data from my Habit Tracker

4 min readAug 13, 2020

I’ll be contributing to the Quantified Self movement by sharing the result of this clustering project.

The data exported from Loop — Habit Tracker. This is my friend for several months in my 8 to 5 job before. It helped me record the progress of my routine (what gets measured, gets done).

A habit is an action we do subconsciously or when there is no mental effort to it, otherwise it is a routine. These are the routine that I was trying to develop:

Morning Meditation, Morning readings, Midday Sleep, Out on time, Arthour (personal projects), Before sleep readings, Before sleep meditation.

Using the exported data from the app, I have different column and rows. Columns are the habit that I want to track, and rows are the binary values that show whether the routine was fulfilled or not (1-yes , 0-no) .

Pearson Correlation

Using correlation analysis, I discovered the strengths of relationships among my routines. This is worth exploring to establish a connection between these habits.

From the interpretation above we can see that:

Out on time routine (+0.63) has a moderate to strong positive linear relationship to Before sleep reading routine
Midday Sleep and Morning Readings routine (+0.73) have a strong positive linear relationship to each other.
Arthour and Before Sleep routine (+0.77) have a strong positive linear relationship to each other.
Before sleep meditation and Morning Meditation routine (+0.62) have a strong positive linear relationship to each other.

CLUSTERING

Clustering is an unsupervised learning algorithm that finds clusters or group in a set of data. These groups should have similar properties or features. This method is a common technique for statistical data analysis used in many fields.

K-modes Clustering. According to the documentation

K-modes is used for clustering categorical variables. It defines clusters based on the number of matching categories between data points.This is in contrast to the more well-known k-means algorithm, which clusters numerical data based on Euclidean distance.)

For simplicity, I used 3 number of clusters, this is a subjective judgement about the number of really distinctive clusters described here (I tried more K but it hurts my data).

These are different clusters derived using K-modes. I arbitrarily named my routines as: The Idealist, The Supervisor, The Visionary.

CLUSTER 1: The Idealist

This group has no Out on time and has a plenty of Midday Sleep routine. I remember I stay late at work to read or finish something that needed the next day.

CLUSTER 2: The Supervisor

This cluster has the least number of routines. These days could be tiring.

CLUSTER 3: The Visionary

This cluster has the most number of accomplished routines.

Analyzing such routines helps me understand better how my day looks like back then. From this data, I can design more meaningful routine towards a better day.

Originally published at https://www.linkedin.com.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Written by Carlos Abiera

58 Followers

20 Following

Carlos C. Abiera currently manages the operations of Montani Int. Inc. and leads the REV365 data team. He has keen interests in data and behavioral sciences.

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

More from Carlos Abiera

Navigating Low and High Context Culture Communication

Carlos Abiera

Navigating Low and High Context Culture Communication

The way people communicate with one another varies from culture to culture. In a diverse company composed of different members from…

Jun 19, 2021

Interpreting Google Search Console Data using Google Data Studio

Carlos Abiera

Interpreting Google Search Console Data using Google Data Studio

In integrating Google Search Console and Data Studio, you’ll encounter these two types of data connection: Site Impression and URL…

Sep 20, 2021

Different Ways to Segment Customers using the Recency, Frequency, Monetary Model.

Carlos Abiera

Different Ways to Segment Customers using the Recency, Frequency, Monetary Model.

If you are new to RFM analysis, the work of Avinash Navlani and Joao Correia is a go-to place for you. They offer an in-depth introduction…

Aug 26, 2021

Contributor Safety: Vulnerability, Trust and Accountability

Carlos Abiera

Contributor Safety: Vulnerability, Trust and Accountability

Amy Edmonson, a professor at Harvard Business School and author of “The Fearless Organization” says that an organization that wishes to…

Sep 9, 2021

See all from Carlos Abiera

Recommended from Medium

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jessica Stillman

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jeff Bezos’s morning routine has long included the one-hour rule. New neuroscience says yours probably should too.

Oct 30, 2024

731

I Wrote On LinkedIn for 100 Days. Now I Never Worry About Finding a Job.

Alexander Nguyen

I Wrote On LinkedIn for 100 Days. Now I Never Worry About Finding a Job.

Everyone is hiring.

Sep 21, 2024

973

Lists

Predictive Modeling w/ Python

20 stories1856 saves

What is ChatGPT?

9 stories521 saves

Practical Guides to Machine Learning

10 stories2225 saves

Generative AI Recommended Reading

52 stories1691 saves

An abstract illustration of a vast, dreamy desert landscape under a starry night sky. A small figure sits by a campfire, dwarfed by the large silhouette of a serene face blending into the sand dunes, creating a surreal and contemplative atmosphere.

The Startup

Jano le Roux

How This 17-Year-Old Quietly Built a $1.12M/Month AI App

I stumbled upon his exact strategy from A to Z and it's brilliant.

Dec 3, 2024

164

4 Lifechanging ChatGPT Features You May Not Know About (Feb. 2025)

Jordan Gibbs

4 Lifechanging ChatGPT Features You May Not Know About (Feb. 2025)

ChatGPT has been releasing a ton of powerful features recently… Are you caught up?

Feb 16

The Medium Blog

The Medium Newsletter

Why writing is just like running

Botanical journaling + beating writer’s block (Issue #284)

3d ago

Website Builder by Google Sites Templates

Google Sites Templates & Design by Harry Jung

How to Create the Perfect Illustration Images for Google Sites

Creating eye-catching illustration images is crucial for making your Google Sites stand out. But finding the right visuals can be both…

Feb 23

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams