---
id: "action-convert-markdown"
type: "action-item"
source_timestamps: ["00:03:18", "00:04:14"]
tags: ["preprocessing", "cost-saving"]
related: ["concept-markdown-conversion", "claim-pdf-markdown-savings", "entity-openbrain"]
speakers: ["Nate B. Jones"]
action: "Convert heavy files to Markdown before LLM ingestion."
outcome: "Reduces document token footprint by up to 20x."
sources: ["s45-claude-limit-chatgpt-habit"]
sourceVaultSlug: "s45-claude-limit-chatgpt-habit"
originDay: 45
---
# Convert All Documents to Markdown

## Action
Before uploading **any** PDF, Word document, or presentation to an LLM, run it through a Markdown converter to strip formatting metadata and reduce token bloat by up to 20x.

## Outcome
Reduces document token footprint by up to **20x** (e.g., 100K → 5K tokens for ~4,500 words of actual prose). Compounds further across multi-turn chats due to LLM statelessness.

## Tools
- [[entity-openbrain-d45]] — open-source Markdown conversion plugins referenced by the speaker
- Alternatives: PyMuPDF, Unstructured.io, Marker (per enrichment overlay)

## Why
See [[concept-markdown-conversion]] for mechanism and [[claim-pdf-markdown-savings]] for validation. Step 1 of [[framework-clean-conversation]]; first audit question of [[framework-stupid-button-audit]].