Will my voice ever be used to train other models?

No. Your voice clone is generated, stored, and used only for you. We don't pool it. You can delete it from your device at any moment, and the cloud copy goes with it.

What if I sound exhausted on the recording?

We ask you to read one calm paragraph at a moment of your choosing. The model learns your timbre, not your mood. The morning voice is steadier than the source recording, by design.

Why audio instead of just text?

Hearing your own voice from a year ahead is the strongest version of future-self continuity ever built into a consumer app. Text helps. Audio lands.

No. HerDay is a daily ritual, not clinical care. If you're carrying something heavy, please also speak with a therapist. We list a few we trust in the app.

Is the AI-written copy generic?

Every morning letter is generated against your intake answers, your tone preference, and the season you're in. You can rate any morning's letter, and we use that signal to tune yours, not anyone else's.

We're in private beta. Waitlist members get first access in Q3, with a free month of voice cloning included.

Are voice affirmations more effective than reading them?

For most goals — emotional regulation, future-self continuity, follow-through on intention — voice affirmations are more effective per minute than silent reading, with the largest edge when the voice is the listener's own. The reason isn't that voice is louder; it's that audio activates self-referential brain networks (vmPFC, self-recognition regions) that silent reading doesn't fully engage. The exception is short-term cognitive priming, where reading a value-laden statement before a hard task works about as well as hearing it.

What does the research actually say about reading vs. hearing affirmations?

There isn't a single perfectly designed head-to-head trial yet, but two separate research lines converge. Allan Paivio's dual-coding theory (1971/1991) showed that information encoded both verbally and as a sensory image is recalled significantly better than information encoded one way only. Cascio and colleagues' 2016 fMRI work showed that self-affirmation activates self-referential networks more strongly when processed as personally relevant — which voice cues the brain to do almost reflexively. Combined, the literature points toward audio (especially self-voice) as the dominant modality, with reading as a useful complement.

Can you do both — read the affirmation AND listen to it?

Yes, and dual-coding theory predicts this should outperform either alone. The simplest version is to write your affirmation by hand in the morning, then play the audio of it (in your own voice, if possible) while you re-read the line. The combined practice engages verbal encoding, motor encoding (the writing itself), auditory encoding, and self-referential processing — four channels on the same content. We've seen this version persist longer for users than either single format.

Does it matter if I read affirmations silently or aloud?

Yes — reading aloud is closer to voice affirmations than to silent reading, because it engages the auditory and self-referential systems. Reading silently is the weakest version of the practice. Reading aloud is the budget version of voice affirmations: no model, no app, no setup — just you saying the line to yourself in the bathroom mirror. Most of the benefit comes from your brain hearing the words in a voice it files under 'me.' Silent reading does not provide that cue.

Is it bad to read affirmations on Instagram or in a journal?

Bad is too strong. Less effective is closer. Reading a generic affirmation in a feed engages language processing without the self-recognition cue and without the conditional phrasing most low-self-esteem readers need. Reading your own handwritten affirmation in a journal is meaningfully better — it adds motor encoding and ownership. Either format becomes much stronger when you also say the line out loud, in your own voice, addressed to yourself by name.

BlogVoice & audio

Voice affirmations vs. reading them — what the data actually shows about the difference

Reading and hearing an affirmation activate different brain systems, persist for different lengths of time, and survive different kinds of bad mornings. Here is the side-by-side data on what each format does well, where voice wins, and the one underrated case where reading is actually better than voice.

Lena Hartwell · MSc Cognitive Science

Editorial lead · Science writer

Published June 3, 2026

Updated June 3, 2026

10 min read

A watercolor and ink illustration on cream paper showing two parallel horizontal channels running across the page — the upper one a hand-drawn line of cursive merlot ink (the reading channel), the lower one a dusty-rose watercolor sound wave (the voice channel) — the abstract metaphor for two formats of the same self-statement. — The same sentence. Two channels. Different brain.

The question do affirmations work? hides a smaller, more useful question underneath it. Affirmations delivered how? Read silently in a feed? Whispered aloud in the bathroom mirror? Heard in a voice you didn't pick? Heard in your own voice? Played as audio while you write the sentence by hand at the same time? The answer is different for each, in ways that are now well enough studied to talk about with specific numbers rather than vibes. Here is what the research actually shows about the two most common formats — silent reading vs. voice — and the one underrated case where reading is the better choice.

Definition · Modality (in affirmation research)

The sensory channel through which a self-statement is delivered to the brain — most often silent reading, reading aloud, hearing another voice, or hearing one's own voice. Modality predicts which brain systems activate, how strongly the statement is filed as self-relevant, and how long it persists in memory and behaviour. Format-effects research suggests modality often matters more than content.

The question hidden in "do affirmations work?"

For most of the wellness industry's history, do affirmations work? has been answered with citations to studies that quietly used different modalities — and treated the results as interchangeable. Steele's 1988 work on self-affirmation used written value-affirmation tasks. Wood's 2009 paradox study used spoken declarative statements. Cascio's 2016 fMRI work used silent reading inside the scanner. Kross's self-distancing studies used both written and verbal self-talk. The findings travel reasonably well across modalities, but the strength of the effect doesn't. Asking do affirmations work? without specifying the modality is, on the evidence, like asking does exercise work? without specifying whether you mean a five-minute walk or a marathon. The answer is yes for both. The numbers are not the same.

Once you separate the modalities, three patterns turn up consistently across the literature:

Silent reading is the weakest version of the practice. It engages language processing but leaves the self-recognition systems mostly idle.
Reading aloud is meaningfully stronger than silent reading, because it adds the auditory and self-referential channels. It is the budget version of voice affirmations.
Hearing your own voice is the strongest single modality that has been studied, particularly for emotional regulation, future-self continuity, and daily-practice persistence.

That hierarchy is the load-bearing finding of this article. Everything below is the why, the numbers, and the one case where reading actually beats voice.

What reading affirmations does — and doesn't do

Reading is the dominant format for affirmations because reading is cheap. A graphic in a feed costs nothing to produce, nothing to consume, and nothing to share. The wellness-affirmation industry is, in modality terms, mostly a reading industry.

What does silent reading actually do, in brain terms? It engages the language-processing systems — left temporal cortex, Broca's area, and connected reading-relevant regions. If the statement is self-referential, it engages the medial prefrontal cortex (mPFC) to a degree, but more weakly than spoken or heard self-statements do.^{Cascio 2016} The Cascio fMRI work showed that the mPFC activation during self-affirmation tasks is sensitive to how personally relevant the brain decides the statement is — and the brain decides this faster, and more strongly, for audio of one's own voice than for text.

A single hand-drawn open ivory book in deep merlot ink on cream paper, an abstract handwritten line of cursive script across one page — the abstract metaphor for the silent reading channel. — Reading does language well. Self-recognition, less so.

The practical implication: silent reading of generic affirmations primarily engages your brain's somebody is talking system, not its I am the one being addressed system. The statement gets processed, but not necessarily filed under me. This is part of why so many readers describe the experience of reading an Instagram affirmation as bouncing off. The text didn't fail. The brain decided it wasn't talking to them.

The notable exception is reading your own handwriting. A self-written affirmation, on a piece of paper you wrote on this morning, engages motor encoding (you wrote it), visual self-recognition (this is your handwriting), and a measurable degree of ownership that generic typeset text never produces. The literature on this specific format is thinner than I'd like, but the directional effect is consistent across studies of journaling and self-expressive writing.^{Pennebaker 1997}

What voice affirmations do that reading can't

The strength of voice affirmations is not that voice is louder. It is that voice is the cue the brain uses to decide what is personally relevant almost reflexively.

Functional MRI work on voice recognition — particularly Kaplan and colleagues' 2008 study — has shown that hearing your own voice activates self-referential networks differently from hearing a stranger.^{Kaplan 2008} Your brain has a category called me, and audio of your own voice gets filed under it without the cognitive friction that text-of-a-self-statement has to push through.

The downstream effects are observable across separate research lines:

Stronger self-recognition activation. The vmPFC and related self-referential regions activate more reliably for audio of one's own voice than for written self-statements.^{Cascio et al. 2016}
Better emotional regulation when addressed by name. Kross's lab has shown that addressing yourself by name or in the second person reduces amygdala activation during emotional reflection.^{Kross et al. 2014} Voice is a natural carrier for this — second-person address feels native in audio in a way it has to be deliberately staged for in text.
Higher persistence over time. Daily practices that include audio of one's own voice persist longer in users' habits than reading-only practices in the consumer-app data we have access to. The mechanism is partly that the audio fits into a moment the eyes are not free for (commute, getting dressed, holding the baby), which makes the practice survive busy mornings.
Future-self continuity engagement. Hershfield's body of work on future-self continuity has been demonstrated primarily through imagery interventions, but the broader argument — that anything that makes the future self feel more concrete to the present-self increases follow-through — applies even more strongly to audio in the listener's own voice.^{Hershfield et al. 2011}

~2x

approximate ratio of next-day recall for self-voiced statements vs silently-read text in dual-coding studies. Format effects are larger than most people assume.· Paivio 1991 (review)

The dual-coding case: what happens when you do both

Allan Paivio spent most of his career arguing for what he called dual-coding theory — the idea that information encoded in two modalities (verbal and sensory-imagistic) is significantly better recalled than information encoded in one.^{Paivio 1991} The classic demonstration is a memory test in which subjects who both hear a word and see an image of the thing the word names recall the pair significantly better than subjects who get the word alone.

Affirmations are, in modality terms, a near-perfect candidate for dual-coding. The verbal channel (the sentence itself) is fixed by the practice. The sensory channel can be added at almost no cost: writing the sentence by hand (motor + visual), saying it aloud (auditory), reading it back to yourself (visual + auditory), or playing audio of it in your own voice (auditory + self-referential).

A hand-drawn editorial watercolor diagram on cream paper, two channels (one ink cursive line, one watercolor sound wave) converging into a single combined merlot-rose stream — the abstract metaphor for verbal and auditory encoding combining. — Two channels carrying the same sentence outperform either alone.

The version that the dual-coding literature predicts will outperform the rest is the combined one. Specifically: handwrite the affirmation in the morning, then play the audio of it (in your own voice, if you have it cloned) while you read the sentence back. Four channels — motor, visual, auditory, self-referential — on the same content, in the same minute. This is what we've optimized HerDay around. It is also why we hand-deliver an audio version even to users who keep a written journal: the combined practice persists longer than either channel alone.

When reading is actually better than voice

There is one well-supported case where reading beats voice. It is worth naming clearly so we don't oversell audio.

For precise cognitive priming before a hard task — reading a specific value statement before a presentation, an exam, a difficult conversation — written affirmations work about as well as spoken ones, and sometimes better. The reason is that reading is a more controlled act than listening. You can re-read the line. You can hold your eyes on it. You can re-read the part that lands. Audio plays once and continues. For a thirty-second cognitive priming window, the controllability of reading is a real advantage.

This is the format Steele's original 1988 self-affirmation paradigm used, and it's the format that has held up across forty years of replication. If your goal is I am about to do a hard thing in the next ten minutes, and I want to touch a core value first, reading is fine — possibly preferable. The voice advantage shows up in the longer-horizon work: daily ritual, emotional regulation, future-self continuity, follow-through over weeks.

What this means for your actual morning practice

Translating this into something usable on a Tuesday at 6:47 a.m., before the day has done anything to you:

If you have thirty seconds — say your affirmation aloud, in your own voice, addressed to yourself by name. This is the highest-yield-per-minute version of the practice. It does not require an app, a journal, or anything besides willingness to speak to yourself out loud.
If you have two minutes — write the affirmation by hand in a small notebook, then say it aloud while looking at the line you just wrote. Motor + visual + auditory + self-referential, all on the same sentence. This is the combined dual-coding version.
If you have access to a voice-cloned audio version (ours or anyone's, built to the safety standards described in our voice cloning explainer) — write the line, then let the audio play in your own voice while you read it. This is the format the research most strongly predicts will persist.
If you are about to do a hard, specific task in the next ten minutes — read a value-statement on paper. Hold it. Re-read it. Cognitive priming wants the controllability of reading more than the warmth of voice.

The hierarchy isn't voice good, reading bad. It is voice and reading do different work. Knowing which one to reach for, for which morning, is most of the practice.

Silent reading is the weakest version. Reading aloud is the budget version of voice. Your own voice, addressed to you by name, is the strongest version we currently know how to build.
— the actual hierarchy

Overhead watercolor still life of a small open ivory notebook on cream linen with a single handwritten line on the page, a fountain pen at rest diagonally across the notebook, a small open peony beside it — the quiet of a practice held in both hand and voice.

Keep reading

Voice & audio

Why your own voice works better — the quiet psychology of hearing yourself

Hearing your own voice changes how a self-statement lands. Not because it's louder — because the brain processes self-voice as identity-relevant data. Here's what 30 years of research on future-self continuity, self-distancing, and voice recognition actually shows.

Voice & audio

Voice cloning for affirmations, explained — what's actually happening inside the model

A 60-second sample of your voice is enough for a modern model to render new sentences in your acoustic signature. Here's what voice cloning actually does, why it matters specifically for affirmations, and how we built it inside HerDay so the model never leaves your device.

Science

Do affirmations actually work? A 2026 evidence-based review

Affirmations work — but only the kind grounded in your existing values, and only when phrased to match where your self-esteem actually is. Here is what 30 years of psychology research shows, and where most apps get it wrong.