CPRR Methodology
Conjecture-Proof-Refutation-Refinement

Table of Contents

1. Introduction

CPRR (Conjecture-Proof-Refutation-Refinement) is a research methodology that treats hypotheses as first-class artifacts with explicit lifecycle management, evidence tracking, and refutation gates.

2. Core Workflow

diagram-cprr-workflow.png

3. State Machine

diagram-cprr-states.png

4. Conjecture Structure

diagram-cprr-conjecture.png

5. Evidence Types

diagram-cprr-evidence.png

6. Integration with Issue Tracking

diagram-cprr-integration.png

7. Refutation Gates

diagram-cprr-gates.png

8. CPRR vs Traditional Approaches

diagram-cprr-comparison.png

9. Tooling Requirements

diagram-cprr-tools.png

10. Example Workflow

  1. Conjecture: "Algorithm X has O(n log n) complexity"
  2. Evidence: Benchmark data, theoretical analysis
  3. Refutation attempt: Edge case with O(n²) behavior
  4. Refinement: "Algorithm X has O(n log n) for sorted input"
  5. New conjecture: Refined hypothesis
  6. Proof: Formal verification or exhaustive testing

11. Applications

  • Research hypothesis management
  • Experimental software development
  • Formal methods integration
  • Multi-agent coordination
  • Knowledge base maintenance

12. Multi-Agent Team Patterns

The four-agent pattern recurs across the literature. Each framework names the roles differently but the structure converges:

Framework Role 1 Role 2 Role 3 Role 4 Year
CPRR Conjecture Proof Refinement Refutation 2026
Debate (Du et al.) Proposer Challenger Judge Arbiter 2023
ChatEval Prosecutor Defender Judge Recorder 2023
MetaGPT Architect Engineer QA PM 2023
CrewAI Researcher Writer Reviewer Editor 2024
CAMEL Instructor Assistant Critic User 2023
AutoGen Planner Coder Executor Critic 2023
DeerFlow Coordinator Researcher Coder Reporter 2025

The canonical decomposition:

  1. Architect / Planner — decides what to build
  2. Builder / Coder — writes the implementation
  3. Reviewer / Critic — finds problems
  4. Evaluator / Judge — accepts or rejects

CPRR differs from the others in two ways: the refutation gate can update the spec (the contract is mutable), and the evidence is persisted in git notes rather than ephemeral agent memory.

12.1. Dot-product decomposition

When a spec has N requirements and M implementations (e.g. the compliance harness: 13 requirements × 10 languages), the work is the dot product of two dimensions. The orchestrator parallelizes across languages (columns) but serializes across requirements (rows) because the gate pipeline order is normative.

12.2. Persistence strategies

Strategy Framework Survives compaction?
Ephemeral memory AutoGen, CrewAI No
JSON files MetaGPT Yes (manual)
Database DeerFlow Yes
git commits CPRR (proof) Yes
git notes CPRR (refutation) Yes
beads (bd) Walsh workflow Yes
aq gossip Walsh workflow Advisory only
PreCompact hook Claude Code Yes

12.3. Related

13. Resources

  • Lakatos, "Proofs and Refutations" (1976)
  • Popper, "The Logic of Scientific Discovery" (1959)
  • Du et al., "Improving Factuality and Reasoning in Language Models through Multiagent Debate" (2023)
  • Hong et al., "MetaGPT: Meta Programming for Multi-Agent Collaborative Framework" (2023)