Honest status: this is a designed and analytically validated project, not a fielded one. The interaction is prototyped, the study is pre-registered, and the analysis pipeline is built and validated on simulated data with known parameters. No participants have been run yet, so there are no empirical findings about real users. The numbers below are method validation, not results.
The wrong number
Digital-wellbeing tools almost all put one number in front of you: hours and minutes of use. I started from a measurement problem with that number.
It is poorly measured — a 2021 meta-analysis (Parry et al., Nature Human Behaviour) found self-reported phone use correlates only modestly with logged use, and the gap is worst exactly for the "problematic" use these apps care about. And even measured well, it barely predicts anything: Orben & Przybylski (2019), across 350,000+ people, found technology use explains on the order of 0.4% of variance in wellbeing.
The field is built on a metric that is both noisy and nearly non-predictive — while the variable that should matter, how a person appraises their own use, sits almost entirely unmeasured.
This continues a thread that runs through my research: subjective appraisal tends to predict psychological outcomes more strongly than objective behavioral measures — the same structure as fear-of-crime research, where perceived risk predicts fear far better than actual victimization risk. Driftline applies that lens to digital life.
Why the existing apps fall short
I ran a competitive analysis of the intentionality-app category (One Sec, ClearSpace, Opal, ScreenZen, Intently, and others). Two patterns repeat: they measure quantity (the noisy, non-predictive number), and they hand the work back to the user through timers, blocks, and willpower — treating the problem as a discipline failure rather than a product engineered to capture attention.
Importantly, the "pause before you open the app" mechanic is already well-established and even has a published study behind it, so I deliberately did not claim that as a contribution. The real gap is narrower: no app in the category treats appraisal as longitudinal data, pairs an intentionality read with an affect read, or closes the loop into a measurable research instrument.
The concept & design
Driftline replaces the minutes dashboard with two objects. First, an appraisal moment — a two-tap, experience-sampling check-in that captures a session on two axes:
Second, the mirror — instead of reporting volume, it reflects the shape of your sessions back to you. Each session is a point in a 2×2 field, and two perceptual encodings carry the meaning without a legend: warmth = affect (restored sessions glow warm, depleting ones go cold) and blur = drift (sessions you drifted into render literally out of focus; chosen ones are sharp).
In the working prototype, the headline insight runs counter to the usual story: the most depleting sessions aren't mindless scrolling but intentional late-night work — and some "drift" is genuinely restorative. A minutes-based tool can never surface that, because minutes don't know how a session felt or whether it was chosen.
The design intent is that the measurement itself is the intervention — a brief, non-judgmental moment of noticing — while the reflection surfaces something non-obvious. A functional interactive prototype of this loop is built (React); the production target is React Native / Expo.
The research design
The concept is only worth building if the underlying hypothesis holds, so I designed a study to test it rather than asserting it: a within-person experience-sampling study — signal-contingent prompts ~4×/day for 14 days, with baseline and exit measures. Because each participant is their own control across pings, every stable confounder (personality, baseline wellbeing, chronic use level) is automatically held constant.
I then wrote a full pre-registration using the field-standard ESM template (Kirtley et al., 2021), which locks the decisions most vulnerable to after-the-fact flexibility:
- 01A single primary hypothesisWithin-person appraisal → wellbeing; secondary hypotheses Holm-corrected, exploratory ones labeled — so no result can be retrofitted into a headline.
- 02Person-mean-centering, fixed in advanceSeparates within-person effects (causal-relevant) from between-person effects (confound-prone) — the move that makes the analysis credible.
- 03Locked exclusion rules & a stopping ruleInclusion thresholds and a fixed sample/stopping rule set before any data exist; no optional stopping.
- 04Multilevel mediationA 1-1-1 model of use → appraisal → wellbeing — the repeated-measures evolution of the mediation analysis from my master's thesis.
A subtle but critical decision: the appraisal of a session and the wellbeing outcome are measured as separate items with different referents, to avoid a circular model that predicts wellbeing from wellbeing.
The analysis pipeline
Read this first: the numbers in this section come from synthetic data with effects I planted, run to prove the pre-registered models work and to size the real study. They are not findings about real people.
I built the full analysis pipeline in R (lme4 / lmerTest) and validated it by simulating a dataset to the pre-registration's exact shape (50 people × 14 days × 4 pings, ~65% compliance) with known true effects, then checking recovery. Every pre-registered estimate landed on its planted value within sampling error.
What this demonstrates
Honest status & roadmap
| Literature synthesis & competitive analysis | Done |
| Concept & interaction design | Done |
| Working interactive prototype (core loop) | Built (React) |
| Study design | Done |
| Pre-registration (OSF, public) | Registered |
| Analysis pipeline (R), validated on simulated data | Done |
| Production instrument (React Native / Expo) | Planned |
| Real-world fielding (participants) | Not started |
| Empirical findings | None yet |
Next steps
Build the production instrument with scheduled prompts and data export; run a small pilot to confirm timing and item comprehension; field the study; run the confirmatory analysis on real data and add the results here.
The pre-registration is public and timestamped: osf.io/7ugtn.
Reflection
Designing the study taught me more than running a quick survey would have. Forcing myself to separate appraisal from outcome, to pre-register a single primary hypothesis, and to validate the analysis on simulated data before touching real participants is a discipline I'll carry into applied research work — it's the difference between a finding you can defend and one you merely hope for.