a builder's codex
codex · patterns · Post-training data as moat vs. distribution/ecosystem as moat

Post-training data as moat vs. distribution/ecosystem as moat

Position A — Post-training on proprietary data is the moat

Position B — Software/data is not the moat; ecosystems and distribution are

Conditions distinguishing them

Resolution / synthesis

Genuine layered contradiction. Sharma's claim that post-training is the moat contradicts Spiegel's claim that software (which includes ML weights) is not the moat. They cannot both be the dominant moat for the same business.

Resolution by layer:

The cards together describe a moat hierarchy: distribution > ecosystem > post-training > pre-training. Sharma is right within her layer; Spiegel is right across layers. The genuine disagreement: at the application company layer, would you bet defensibility on a proprietary post-training pipeline (Sharma-implied: yes) or on distribution (Spiegel: no, software isn't the moat)? Most evidence in the corpus tilts toward Spiegel for application companies and Sharma for model providers.

Open the interactive view →