Skip to content

markdown-content-parity should distinguish intentional audience segmentation from unintentional content drift #14

@devalog

Description

@devalog

Current behavior: The markdown-content-parity check assumes that any drift between markdown and HTML content means agents are getting outdated information and treats all divergence as a problem.

Problem: Content drift isn't always a deficiency. Some sites intentionally serve different content to different audiences — for example, using tags or other mechanisms to provide agent-optimized markdown alongside human-optimized HTML. In those cases, divergence is a feature, not a bug.

Desired behavior: The check should distinguish between two scenarios:

  • Unintentional drift — markdown and HTML have fallen out of sync (the current assumption, and a real problem).
  • Intentional audience segmentation — the site deliberately serves different content for agents vs. humans, e.g., through structured tags or metadata indicating the split is purposeful. For example, Fern specifically does this through <llms-only> and <llms-ignore> tags.

I don't have a specific implementation in mind, but roughly: the check should pass if markdown == HTML or if there's some evidence the content is intentionally segmented by audience.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions