Vocabulary Size: How Many Words You Actually Need to Read
Somewhere online you've read that 1,000 words covers 85% of everyday language. It's true. It's also the most misleading true statistic in language learning.
Because 85% doesn't feel like understanding most of a text. It feels like reading a page where every seventh word has been blacked out — and the blacked-out words are the ones carrying the story.
Try the method first: Generate your first bilingual story and see what reading at your level actually feels like.
Where the 1,000-Word Number Comes From
Languages are lopsided. A tiny group of words does almost all the work: the, of, and, to, she, was. In English, the 100 most frequent words account for roughly half of everything ever written. The most frequent 1,000 word families push that to somewhere around 80–85% of ordinary text.
So the statistic is real. Learn 1,000 well-chosen words and you will, in a literal counting sense, recognize about six out of every seven words on a page.
The trap is the quiet leap from coverage to comprehension — from "I recognize 85% of the words" to "I understand 85% of the text." Those are not the same thing. They're not even close. And the gap between them is easiest to show, not explain.
What 85% Coverage Actually Looks Like
Here's a paragraph at 85% coverage. Every word you'd know is intact. Every word you wouldn't is blacked out:
Last night, Ana finally ████ the letter. She read it twice, ████ it on the table, and stared out the ████ window. Whatever she had been ████ for all these months, it wasn't this. In the morning she would have to ████ her sister — but not yet.
You can follow the grammar perfectly. Someone did something with a letter, felt something, will do something tomorrow. But did Ana open the letter or burn it? Is she relieved or devastated? Does she call her sister or avoid her?
That's the cruelty of frequency: the 1,000 words you learn first are mostly the connective tissue — pronouns, articles, prepositions, the fifty most common verbs. The words that get blacked out are the nouns and verbs that carry the plot. You end up fluent in the skeleton of a language and blind to its meaning.
So if 85% reads like that, what number does the job?
The Number Researchers Actually Use: 98%
Studies of reading comprehension keep landing on the same threshold: comfortable, unassisted reading starts at about 98% coverage. That's one unknown word in fifty — roughly one per paragraph instead of one per line. At 95%, reading is possible but effortful, a puzzle you solve rather than a story you follow. Below that, comprehension falls off a cliff.
And here's the part nobody puts in the motivational infographic: reaching 98% coverage of novels or news takes somewhere in the region of 8,000–9,000 word families. Around 5,000 gets you to a workable 95% for many everyday texts.
A word family, if the term is new: a base word plus its relatives. Decide, decides, decided, decision, undecided — one family, not five words. That's the small mercy in the number. The large problem remains.
Sit with that for a second. Eight thousand. If you drilled twenty new flashcards a day and forgot none of them — which you will — that's over a year of daily drilling just to reach the starting line of comfortable reading.
If that were the whole story, it would be a good reason to quit. It isn't the whole story.
The Good News Hiding in the Bad Math
Nobody who reads comfortably in a second language got there by memorizing 8,000 words from a list. Neither did you, in your first language. By age six you knew thousands of words, and nobody ever showed you a flashcard for puddle or grumpy. You met them inside situations, over and over, until they were simply yours.
Frequency, the same force that makes the 1,000-word claim misleading, quietly works in your favor here. The words you most need next are — by definition — the ones that appear most often in what you read. You don't have to hunt for word number 1,001. It's already waiting in the next story, and the one after that, and it will keep showing up until it sticks.
This is why vocabulary growth through reading compounds. Picture it over three months. In January, Tomás reads a short story about a missed train and stumbles on andén — platform. He half-learns it. In February it shows up in a story about a farewell, and this time he only hesitates. By March, when a character is waiting on the platform in the rain, the word is invisible — it's just part of the scene, the way platform is in English. He never reviewed it once. The language reviewed it for him.
Multiply that by every frequent word in the language, running in parallel, and you have the actual mechanism by which readers reach 8,000 words. Every story you finish makes the next story slightly more readable, which lets you read more, which teaches you more words. The learners who get there never counted. They read their way there.
Which leaves one honest problem: how do you read your way to a big vocabulary when you don't have the vocabulary to read yet?
How to Read at 1,000 Words Without Drowning
The bootstrap problem is real. Authentic novels at 85% coverage are miserable — you saw the blacked-out paragraph. Two things fix it.
First, read text matched to your level, not to natives. A story written for your vocabulary puts you at 90%+ coverage from the first line. The unknown words are spaced out enough that context can do its job — you meet one new word inside twenty familiar ones, and its meaning is often sitting right there in the scene.
Second, make the unknown 10% cost a glance, not a dictionary trip. This is where the blacked-out boxes stop being a wall. In a bilingual story, the translation sits alongside the text. When you hit empujó and it doesn't land, your eyes flick across, catch pushed, and return — the story never stops moving. The word arrives with its scene attached: who pushed, what, why it mattered. That's the same way puddle arrived when you were four.
Do that daily, and the math flips. The words you look up most often are the frequent ones, so they come back fastest — and each one you absorb nudges your coverage up. You're not studying toward the day you can finally read. You're reading now, and the vocabulary is the exhaust.
Generate a story at your level and count how few words you actually need to look up. It's usually fewer than the statistics made you fear.
So the honest answer to "how many words do you need?" is: more than the internet promised, fewer than the research scared you with — and it's the wrong question anyway. Word counts are a side effect, not a goal. Readers who make it never know their number. They just know that this month's stories feel easier than last month's, and that somewhere along the way, the black boxes disappeared.
Related reading:
- How Bilingual Stories Improve Vocabulary Retention
- Choosing Your Reading Level: How to Tell If a Story Is Too Easy or Too Hard
- Comprehensible Input: Why Stories Beat Flashcards
Start reading bilingual stories for free
Start reading bilingual stories for freeNot sure of your reading level? Take the free CEFR test →