Skip to main contentSkip to content

Playbook for content publishers

Your main question is usually: "How do I allow useful discovery without exposing the site naively?"

Read first

Your risk model

You care most about:

  • AI answer-generation usage;
  • model-training posture;
  • archive replay;
  • scraper pressure;
  • editorial reuse without clear boundaries.

Do

  • separate answer-generation from training in your reasoning;
  • decide what kind of machine access is acceptable;
  • choose a preset that reflects editorial goals, not just technical habit;
  • document the difference between openness and naivety.

Don’t

  • collapse all bots into one category;
  • assume robots.txt alone solves hostile scraping;
  • use vague language like "block AI" when the real policy is more granular.