Spaced Interleaved Retrieval

Encoded knowledge decays unless it is reconstructed, spaced, varied, and repaired over time. Spaced Interleaved Retrieval is the user’s main system for that work.

Summary

SIR combines three principles:

Retrieval: recall or use knowledge before checking the source.
Spacing: revisit material after delays that gradually widen.
Interleaving: mix topics, formats, difficulty levels, and angles of use.

The goal is to prove that knowledge can be reconstructed and used under varied conditions.

In the user’s system, SIR sits under Retrieval and is coordinated by Self-Regulation.

For complex problem solving and knowledge work, interleaving should be treated as reconstruction practice, not only mixed scheduling. Interleaving for Complex Problem Solving is the more general operating note for using interleaving to expose approach gaps, variable relationships, and weak problem frames.

What It Tests

SIR should test multiple levels of mastery:

Facts and labels.
Explanations of individual concepts.
Relationships between concepts.
Judgments about which relationships matter most.
Application in unfamiliar cases.
Procedure execution under realistic constraints.

If retrieval only tests recognition or isolated recall, it will miss many of the gaps that matter for real performance.

Use different methods depending on the target knowledge.

Need	Useful Retrieval Format
Isolated facts	Narrow flashcards, image occlusion, quick written recall.
Concept explanation	Teach the concept without notes, then check.
Relationships	Answer relational questions or create a map from memory.
Big-picture structure	Closed-book mindmap or chunkmap brain dump.
Evaluation	Compare competing explanations and justify which relationship matters more.
Procedural skill	Solve varied problems, execute the skill, or simulate the target task.
Exam readiness	Timed mixed questions, full answer generation, and error review.

SIR Methods

This list is a working menu of retrieval methods. The goal is to choose methods by knowledge type and mastery target, not to use every method for every subject.

Declarative Methods

Use these when the target knowledge is mostly conceptual, factual, explanatory, or relational.

Method	Best Use	Watch For
3Cs	Basic labels, diagrams, isolated terms, and fill-in-style details.	Too low-order for complex understanding.
Simple flashcards	Narrow facts, definitions, constants, labels, and small details.	Exploding card counts usually indicate weak encoding.
Simple relational flashcards	Specific relationships between a small number of concepts.	Do not force every relationship into cards.
Evaluative flashcards	High-value questions that require judgment about relationships.	Better created from a strong map or question set.
Isolated generated questions	Single-concept recall or explanation.	Can become too low-order if overused.
Relational generated questions	Compare, contrast, connect, or explain interaction between concepts.	Needs full answers, not mental familiarity.
Evaluative generated questions	Judge importance, relevance, priority, or conditions across concepts.	Time-consuming but high value.
Linear brain dump	Rapid recall of facts, terms, and broad coverage.	Weak at representing relationships.
Mindmap brain dump	Reconstruct the topic structure and relationships from memory.	May miss tiny details unless followed by detail checks.
Isolated teaching	Explain one concept or process out loud.	Often less efficient than narrower tools for pure facts.
Relational teaching	Explain how concepts interact.	Strong early-to-mid retrieval method.
WPW	High-volume, multi-order reteaching across whole and parts.	Requires strong encoding and can expose many gaps.
Direct practice questions	Answer from memory, then check.	Can hide uncertain reasoning if checking is too fast.
Extended practice questions	Answer, mark uncertainty, build a better answer, then check.	More time-intensive but better for gap finding.
Advanced group practice	Create, exchange, answer, compare, and discuss challenging questions.	High setup cost; best for important topics.
Evaluative discussion	Argue and judge relationships, importance, and edge cases.	Must be directed; undirected discussion is usually weak.
Chunkmap reconstruction	Rebuild or improve the chunk structure from memory.	If hard, the original encoding may have been shallow.
Feynman-style explanation	Explain simply, find gaps, refine the explanation.	Useful as micro-retrieval, not a full system by itself.
Memory systems	Ordered lists, labels, vocabulary, or narrow rote demands.	Avoid using them as a substitute for encoding.

Procedural Methods

Use these when the target knowledge is mostly skill execution, problem solving, production, language use, coding, calculation, lab work, or performance.

Method	Best Use	Watch For
Retrieved execution, simple	Perform a basic procedure from memory.	Stop once it becomes repetitive beyond mastery.
Retrieved execution, integrative	Combine multiple component skills into a larger function.	Move here early once simple execution is stable.
Retrieved execution, applied	Start from a target output and work backward to execute.	Best for realistic transfer.
Simple challenges	Solve prompted basic problems.	Good entry point, but not enough for mastery.
Integrative challenges	Combine multiple processes to solve a less predictable task.	Requires both procedural skill and declarative understanding.
Edge-case challenges	Handle atypical, unfamiliar, or boundary-condition problems.	Excellent for high-level transfer.
Variable modification	Change values or constraints while keeping the same basic problem type.	Good for early variation.
Variable addition	Add new conditions that change the problem structure.	Requires stronger conceptual judgment.

Methods To Avoid As Defaults

Passive rereading.
Passive rewriting.
Passive relistening.
Practice questions answered only mentally.
Checking answers before fully generating an attempt.
Undirected group discussion.
Repeating already-mastered procedures without variation.

REBIM (Repetitive Execution Beyond Initial Mastery) is the specific failure mode for procedural practice. REBIM is repeating a skill or process that can already be performed to high proficiency — coding the same simple scripts, solving the same difficulty of problems, or running the same conversation pattern in a language already well-handled. The benefit is negligible compared to what could be achieved with variation, edge cases, or applied challenges in the same time. REBIM typically feels productive because it produces smooth performance, which is indistinguishable from progress to the learner.

Encoding and Retrieval: The Tradeoff

Encoding and retrieval are not separate processes — the research treats them as fundamentally intertwined. In practice they are addressed separately because the techniques differ, but they influence each other directly: stronger encoding produces higher retention, which means less retrieval is required to maintain it. Weaker encoding produces more knowledge decay, which means retrieval sessions must be more frequent and more extensive.

As encoding skill develops, retrieval sessions become shorter and find fewer gaps. In the early stages — when encoding quality is still low — retrieval sessions will find many gaps, take longer, and feel laborious. This is the expected state, not a sign the system is failing.

Finding many gaps during retrieval is a sign the session is working, not that it is ineffective. A retrieval session that finds few gaps when mastery is still low is the sign of a session that is not probing deeply enough.

Concrete Spacing Schedule

The exact schedule matters less than session quality and consistency. A practical default that works across most academic contexts:

When	Session type
Same day	Encode and do a light retrieval pass on new material.
Next day (or within 2 days)	First gap-finding pass for yesterday’s content.
End of week	Interleaved retrieval for all material from the last two weeks.
End of month	Interleaved retrieval for all material from the last four weeks.
3–4 weeks before assessment	Comprehensive retrieval pass for all relevant content.

With this schedule, any given piece of material is encoded and re-encoded approximately six times in a month — through prestudy, first encoding, same-day retrieval, mid-week retrieval, end-of-week retrieval, and end-of-month retrieval. At strong encoding levels, the end-of-week pass for a full week’s material takes a few hours; the end-of-month pass for four weeks takes roughly half a day.

The schedule can be followed loosely. Exact timing matters less than ensuring each layer happens.

Scheduling Heuristic

The exact schedule is less important than session quality. A useful default for declarative-heavy material:

Timing	Main Job
Same day	Micro-retrieval while encoding.
2-5 days	First gap-finding pass.
1-2 weeks	Relationship and question-based pass.
1 month	Larger mixed challenge.
2 months and beyond	Maintenance, transfer, and higher-order challenge.

For procedural-heavy topics, move sooner toward varied execution practice. Declarative setup may still matter, but the system should quickly shift toward doing.

Session Rules

Retrieve before looking.
Write, speak, draw, or perform the answer fully.
Mark uncertainty as a gap even if the final answer is correct.
Check against the source, answer sheet, or worked example.
Decide whether the gap needs retrieval repetition or re-encoding.
Update the next session based on what failed.

Gap Types

Gap	Symptom	Repair
Fact gap	Missing term, label, number, definition, or step	Narrow flashcard or quick recall drill.
Relationship gap	Cannot explain how ideas influence each other	Rebuild local chunk map.
Structure gap	Cannot reconstruct the topic’s big picture	Re-Skin the BHS map.
Transfer gap	Can answer familiar questions but fails variations	Add interleaving and examples.
Confidence gap	Correct answer but shaky reasoning	Generate full answer sheet and compare.
Execution gap	Concept is known but performance fails	Practice the target procedure with feedback.

Relationship To Encoding

SIR improves the encoding system because retrieval failures reveal whether Bear Hunter System produced a usable structure.

If retrieval finds many lower-order gaps, do not automatically add hundreds of flashcards. First ask whether the encoding structure was weak. Better chunking can reduce the amount of isolated memorization needed.

LLM Use

LLMs can help SIR by:

Generating question variations.
Creating mixed practice sets.
Producing counterexamples.
Checking answer completeness.
Asking follow-up questions after a weak answer.
Converting a map into retrieval prompts.
Generating questions at varied difficulty levels across mastery levels.

Use AI for: saving time on low-cognitive tasks (collecting keywords, big-picture orientation); consolidating multiple sources into a single source to reduce split attention; generating test questions; asking questions to explore a topic like a tutor.

Avoid using AI for: checking the truth of nuanced or complex claims (LLMs cannot reliably arbitrate on nuanced topics); bypassing the mental modeling work of chunking, evaluating relationships, and judging importance — these are the processes that create encoding. If AI does the structural thinking, the learner coasts into passive understanding mode without building the schema. For beginners especially, avoid using AI to chunk and organize information until there is a reliable internal sense of when cognitive load is passive versus active.

The practical test: retrieve first, fully generate an answer, then use the model for feedback and calibration. When AI becomes the primary thinker and the learner becomes the editor, the benefit of retrieval practice disappears.

How It Should Feel

SIR should feel like controlled strain. The point is to expose what can and cannot be reconstructed under spacing, variation, and discrimination.

Good signs:

recall feels effortful but possible;
gaps become specific;
interleaving makes similar ideas easier to distinguish;
spacing reveals what is actually stable;
and failures point back to either retrieval practice or re-encoding.

Warning sign: SIR has become recognition when checking notes feels easier than reconstructing first.

Open Questions

What is the user’s real weekly retrieval capacity?
Which current subjects are declarative-heavy, procedural-heavy, or mixed?
Should this vault include an SIR scheduler template?

Logos52

Browse

Spaced Interleaved Retrieval

Summary

What It Tests

Default Retrieval Menu

SIR Methods

Declarative Methods

Procedural Methods

Methods To Avoid As Defaults

Encoding and Retrieval: The Tradeoff

Concrete Spacing Schedule

Scheduling Heuristic

Session Rules

Gap Types

Relationship To Encoding

LLM Use

How It Should Feel

Open Questions

Graph View

Table of Contents

Backlinks

Logos52

Browse

Spaced Interleaved Retrieval

Summary

What It Tests

Default Retrieval Menu

SIR Methods

Declarative Methods

Procedural Methods

Methods To Avoid As Defaults

Encoding and Retrieval: The Tradeoff

Concrete Spacing Schedule

Scheduling Heuristic

Session Rules

Gap Types

Relationship To Encoding

LLM Use

How It Should Feel

Related Concepts

Open Questions

Graph View

Table of Contents

Backlinks