feat: Add HTML representation by katosh · Pull Request #2236 · scverse/anndata

katosh · 2025-11-29T20:01:57Z

Rich HTML representation for AnnData

Closes HTML Repr #675
Tests added
Release note not necessary because:

Summary

Implements rich HTML representation (_repr_html_) for AnnData objects in Jupyter notebooks. Builds on previous draft PRs (#784, #694, #521, #346) with a complete, production-ready implementation.

Live Demo | Reviewer's Guide (technical details, design decisions, extensibility examples)

Screenshot

Features

Interactive Display

Foldable sections with auto-collapse for large datasets
Search/filter with regex and case-sensitive toggles
Copy-to-clipboard for field names
Nested AnnData expansion with configurable depth
.raw section showing unprocessed data (Report n_vars of .raw in __repr__ #349)

Visual Indicators

Category colors from uns palettes (e.g., cell_type_colors)
Type badges for views, backed mode, sparse matrices, Dask arrays
Serialization warnings for data that won't write to H5AD/Zarr
Value previews for simple uns values
README support via modal (renders markdown from uns["README"])
Memory info in footer

Serialization Warnings

Proactively warns about data that won't serialize:

Level	Issue	Related
🔴 Error	datetime64/timedelta64 in obs/var	#455, #2238
🔴 Error	Non-string keys	#321
🔴 Error	Object columns with dicts/lists/custom objects	#1923, #567, #636
🔴 Error	Non-serializable types in uns
🟡 Warning	Keys with `/` (deprecated)	#1447, #2099
🟡 Warning	String→categorical auto-conversion	#534, #926

Compatibility

Dark mode auto-detection (Jupyter Lab/VS Code, Furo/sphinx-book-theme)
No-JS fallback with graceful degradation
JupyterLab safe - CSS scoped to .anndata-repr prevents style conflicts
Lazy-loading safe - configurable partial loading for read_lazy() (categories, colors)
Zero dependencies added

Extensibility

Three extension mechanisms for ecosystem packages (MuData, SpatialData, TreeData):

TypeFormatter - Custom visualization for value types
SectionFormatter - Add new sections (e.g., obst/vart, mod)
Building blocks - CSS/JS/helpers for packages needing full control

See the Reviewer's Guide for examples and API documentation.

Testing

601 unit tests organized by responsibility (core, sections, formatters, UI, warnings, registry, lazy, robustness, Jupyter compatibility)
108 escaping/robustness tests covering escaping coverage at every user-data insertion point, broken objects, size bombs, threading
HTMLValidator for structured HTML assertions (section-aware, no external dependencies)
26 visual test scenarios: python tests/visual_inspect_repr_html.py

Supersedes Draft for AnnData html repr #784, Initial draft of AnnData HTML repr #694, WIP: add _repr_html_() method to AnnData for nicer rendering in Jupyter #521, [draft] html repr #346 (previous drafts)
Compatible with feat: allow gpu io in sparse_dataset by removing scipy inheritance #1927 (sparse scipy changes), feat: array-api compatibility #2063 (Array-API)
Fully backward compatible

Acknowledgments

Thanks to @selmanozleyen (#784), @gtca (#694), @VolkerH (#521), @ivirshup (#346, #675), and @Zethson (#675) for prior work and discussions.

Technical Notes and Edits

Lazy Loading

Constants are in _repr_constants.py (outside _repr/) to prevent loading ~6K lines on import anndata. The full module loads only when _repr_html_() is called.

Config Changes

pyproject.toml: Added vart to codespell ignore list (TreeData section name).

Edit (Dec 27, 2024)

To simplify review and reduce the diff, I've merged settylab/anndata#3 into this PR. That PR was originally created as a follow-up to explore additional features based on the discussion with @Zethson about SpatialData/MuData extensibility.

What changed:

Exported building blocks - CSS, JavaScript, and rendering helpers for external packages to build custom reprs while reusing anndata's styling
.raw section - Expandable row showing unprocessed data (Report n_vars of .raw in __repr__ #349)
Enhanced serialization warnings - Extended to cover datetime64, non-string keys, slashes in keys, and all sections
Regex search - Case-sensitive and regex toggles for filtering
Robust error handling - Failed sections show visible error indicators instead of being silently hidden

Edit (Jan 4, 2025)

Moved detailed implementation documentation (architecture, design decisions, extensibility examples, configuration reference) to the Reviewer's Guide to keep this PR description focused on features.

Code refactoring:

Split html.py into focused modules for maintainability
UI components extracted to components.py (badges, buttons, icons)
Section renderers moved to sections.py (obs/var, mapping, uns, raw)
Shared rendering primitives extracted to core.py (avoids circular imports)
Preview utilities moved to utils.py
FormatterContext consolidates all 6 rendering settings (read once at entry, propagated via context)
Result: html.py reduced from ~2100 to ~740 lines, clean import hierarchy

New features:

"Lazy" badge for read_lazy() AnnData objects (experimental) - indicates when obs/var are xarray-backed
Visual test for lazy AnnData (9b) - demonstrates lazy loading with (lazy) indicator on columns

Bug fixes:

Consistent meta column styling - all meta column text now uses adata-text-muted class for uniform appearance
Bytes index decoding - properly decode bytes values in index previews

Related issue discovered:

read_lazy() returns index values as byte-representation strings (e.g., "b'cell_0'" instead of "cell_0") - see ISSUE_READ_LAZY_INDEX.md

Edit (Jan 6, 2025)

Smart partial loading for read_lazy() AnnData:

Previously, lazy AnnData showed no category previews to avoid disk I/O. Now we do minimal, configurable loading to get richer visualization cheaply: only the first N category labels and their colors are read from storage (not the full column data). New setting repr_html_max_lazy_categories (default: 100, set to 0 for metadata-only mode).

Visual tests reorganized: 8 (Dask), 8b (lazy categories), 8c (metadata-only), 9 (backed).

Edit (Jan 6, 2025 - continued)

FormattedOutput API and architecture:

Clean separation between formatters and renderers - formatters inspect data and produce complete FormattedOutput, renderers only receive FormattedOutput (never the original data).

The FormattedOutput dataclass fields were renamed to be self-documenting:

Old Field	New Field	Purpose
`meta_content`	`preview` (text) or `preview_html` (HTML)	Preview column content
`html_content` + `is_expandable=True`	`expanded_html`	Collapsible content below row
`html_content` + `is_expandable=False`	`preview_html`	Inline preview in preview column
`is_expandable`	Removed	Use `expanded_html is not None`
(new)	`type_html`	Custom HTML for type column (replaces `type_name` visually)

Naming convention: *_html suffix indicates raw HTML (caller responsible for escaping), plain text fields are auto-escaped.

UI/UX improvements:

Zebra striping for section rows (alternating background colors)
Expand buttons now use ▼/▲ arrows instead of ⋯/▲ for consistency
No borders between entries within sections (cleaner look)
Fixed button alignment - Expand and wrap buttons now align properly
Category list styling - explicit muted color ensures consistent appearance in nested contexts

Edit (Jan 7, 2025)

Test architecture overhaul:

Tests reorganized from a single file into 10 focused modules for maintainability and parallel execution:

File	Focus
`test_repr_core.py`	HTML validation, settings, badges
`test_repr_sections.py`	Section rendering (obs, var, uns, etc.)
`test_repr_formatters.py`	Type-specific formatters
`test_repr_ui.py`	Folding, colors, search, clipboard
`test_repr_warnings.py`	Serialization warnings
`test_repr_registry.py`	Plugin registry
`test_repr_lazy.py`	Lazy AnnData support
`test_html_validator.py`	HTMLValidator tests + Jupyter compatibility

HTMLValidator class (conftest.py) provides structured HTML assertions:

v = validate_html(html)
v.assert_section_exists("obs")
v.assert_section_contains_entry("obs", "batch")
v.assert_section_initially_collapsed("obs")  # or _not_initially_collapsed

Key features: regex-based (no dependencies), section-aware matching, exact attribute matching to avoid "obs" matching "obsm".

Optional strict validation when dependencies available:

validate_html5() - W3C HTML5 + ARIA (requires vnu)
validate_js() - JavaScript syntax (requires esprima)

Jupyter Notebook/Lab compatibility tests (13 new tests in TestJupyterNotebookCompatibility):

Validates CSS scoping, JavaScript isolation, unique IDs across multiple cells, and Jupyter dark mode support.

Bug fix: readme-modal-title ID is now unique per container to prevent ID collisions when multiple AnnData objects are displayed in the same notebook.

Edit (Jan 8, 2025)

Maintainability improvements:

Fix	Description
Entry rendering	Consolidated `_render_entry_row` and `render_formatted_entry` to eliminate duplication
Debug logging	Added `get_formatter_for()` and `list_formatters()` methods to FormatterRegistry
Import hierarchy	Documented module dependency tree at top of `__init__.py`
Static assets	Moved CSS (~1060 lines), JS (~380 lines), markdown parser (~150 lines) to `static/` directory
FormattedOutput docs	Enhanced field documentation with precedence rules and CSS class reference
HTMLValidator	Moved to separate `tests/repr/html_validator.py` module (conftest.py: 960→270 lines)
Magic strings	Extracted CSS classes and section names to `_repr_constants.py`
TypeCellConfig	Added dataclass to simplify `render_entry_type_cell()` signature
Lazy module	Consolidated lazy loading utilities to new `lazy.py` module
CSS colors	Moved 148 CSS color names to `static/css_colors.txt` for easy updates

File structure changes:

src/anndata/_repr/
├── static/                  # NEW: Static assets directory
│   ├── __init__.py
│   ├── repr.css             # CSS template (~1060 lines)
│   ├── repr.js              # JavaScript (~380 lines)
│   ├── markdown-parser.js   # Markdown parser (~150 lines)
│   └── css_colors.txt       # CSS named colors (148 colors)
├── lazy.py                  # NEW: Lazy loading utilities
└── ...

API simplifications:

render_entry_type_cell() now accepts TypeCellConfig dataclass instead of 10 individual parameters
Lazy utilities consolidated: is_lazy_adata(), is_lazy_column(), get_lazy_categories(), get_lazy_categorical_info()
Static assets loaded via importlib.resources.files() (Python 3.9+)

Edit (Jan 9, 2025)

Robustness & escaping coverage testing:

Added 108 tests in test_repr_robustness.py across 14 test classes:

Escaping coverage (12 tests): verifies html.escape() is called at every user-data insertion point using a <b>MARKER</b> probe
Unicode edge cases (emoji, CJK, RTL override, zero-width chars)
Broken objects (crashing __repr__, __len__, __sizeof__, properties)
Size handling (huge strings, many categories, deep nesting)
Color array robustness (too many/few, invalid formats, empty)
Thread safety (concurrent repr generation)

Escaping tests trust html.escape() (stdlib) and only verify it's called at every insertion point, rather than exercising the escaping mechanism itself with attack vectors.

Test cleanup:

Removed redundant and overly-specific tests to focus on meaningful coverage. Tests now verify behavior that matters (e.g., XSS escaped, errors visible, truncation applied) rather than testing identical code paths multiple times.

Visual inspection: Consolidated to 26 scenarios with single comprehensive "Evil AnnData" test combining all adversarial patterns.

Fixes:

Added repr_html_max_readme_size to _settings.pyi type stubs
Fixed strict warnings compatibility (pytest.warns for expected warnings)
Section error truncation now shows "..." indicator when message exceeds limit

Updated stats:

Metric	Value
Total tests	601
Robustness tests	108 (14 test classes)
Visual scenarios	26
Settings	11

Edit (Jan 16, 2025)

Error handling consolidation:

Refactored error handling to use a single error field in FormattedOutput instead of separate is_hard_error parameters scattered across the codebase.

Key changes:

Component	Change
`FormattedOutput`	Added `error: str \| None` field with documented precedence over `preview`/`preview_html`
`FallbackFormatter`	Made bulletproof - wraps every attribute access in try/except, checks serializability and includes reason in warnings
`FormatterRegistry.format_value()`	Accumulates failed formatters instead of stopping at first failure
`render_formatted_entry()`	Removed `is_hard_error` param, now detects via `output.error`
`_validate_key_and_collect_warnings()`	Returns `(key_warnings, is_key_not_serializable)` - key issues mark as not serializable, preserving preview

Error vs Warning separation:

output.error: Hard rendering failure - row highlighted red, error message replaces preview
output.is_serializable=False: Serialization warning - red background, but preview preserved
Tooltip format: "Not serializable to H5AD/Zarr: {reason}" uses ":" to connect to reason, ";" separates independent warnings

New behavior when formatters fail:

Registry tries all matching formatters in priority order
Failed formatters are accumulated (full message for warnings, type-only for HTML)
If a later formatter succeeds: warnings emitted about earlier failures
If all fail: accumulated errors passed to fallback formatter

This prevents long error messages from appearing in HTML while preserving full details in warnings for debugging. Serialization issues (like non-string keys, lambdas, custom objects) preserve the value preview while showing the reason in the tooltip.

Updated stats:

Metric	Value
Total tests	601
Robustness tests	108 (14 test classes)
Source lines	~6,500 Python + ~2,130 static assets
Test lines	~10,450 (13 files)

Edit (Jan 26, 2025)

Review response changes (addressing @flying-sheep's review):

Typing: Any → object

Replaced all ~95 uses of Any across 7 files. Formatter method signatures now use obj: object since AnnData's uns accepts genuinely arbitrary objects and formatters handle AnnData-like objects (e.g., MuData) via duck typing. dict[str, Any] with known structure replaced with precise union types.

CSS: Native nesting + dark mode + variable dedup

Full conversion of repr.css to native CSS nesting (&). Selector repetitions of .anndata-repr reduced from 173 to 13. File length unchanged (~1164 lines) because the feature surface is genuinely large (~68 component blocks, 14 dtype colors, copy button, README styling, state variants), not because of repetition.
Added Sphinx theme dark mode selectors ([data-theme="dark"] for Furo/sphinx-book-theme) alongside existing Jupyter/VS Code detection.
Dark mode variables (~35 declarations) deduplicated: defined once in Python and substituted into both the @media (prefers-color-scheme: dark) block and theme-selector block.
Limitation: BEM modifiers (&--variant) produce invalid CSS at nesting depth 2+ (browser treats & as :is(parent child), so &--view becomes :is(.anndata-repr .anndata-badge)--view). 7 modifier rules flattened to sibling selectors.

Security tests simplified

Replaced ~34 attack-vector-heavy tests with 12 focused escaping-coverage tests. Each test puts a <b>MARKER</b> probe at one user-data insertion point and verifies it appears escaped. Removed TestCSSAttacks, TestEncodingAttacks; trimmed TestBadColorArrays, TestEvilReadme; consolidated TestUltimateEvilAnnData to 1 test. Total: 108 tests (14 classes), down from 123 (16 classes).

Other:

FormatterContext.column_name renamed to FormatterContext.key
Key validation moved into FormatterRegistry.format_value()
HTML validator tests updated for native CSS nesting (vnu doesn't support nesting syntax yet, so CSS parse errors are filtered)

Future-Proofing: Related PRs and Issues

This PR includes explicit handling and/or code references to track compatibility with several in-progress or future changes. The following PRs/issues may trigger updates to the _repr module:

Already Handled

PR/Issue	Description	Status in `_repr`	Code Locations
#1927	Removes scipy sparse inheritance	✅ `SparseMatrixFormatter` uses duck typing fallback	`formatters.py:242,260,307`
#2063	Array-API compatibility	✅ `ArrayAPIFormatter` via duck typing	`formatters.py:771,1135`
#2071	Array-API backends (JAX, Cubed)	✅ Covered by `ArrayAPIFormatter`	(same as #2063)

May Require Updates When Merged

PR/Issue	Description	Current Handling	Code Locations
#2288	`LazyCategoricalDtype` API	Accesses private `CategoricalArray` internals	`lazy.py` (all functions)
#1923	List data types in `obs`	Marked not serializable	`formatters.py:159`

Internal API Usage Inventory

Current patterns accessing internal/private APIs that may be replaceable:

Location	Current Pattern	Replacement Opportunity
`lazy.py:_get_categorical_array()`	Navigates xarray internals: `col.variable._data.array`	Post-#2288: Check `isinstance(dtype, LazyCategoricalDtype)`
`lazy.py:get_lazy_category_count()`	Accesses private `CategoricalArray._categories["values"].shape[0]`	Post-#2288: Use `dtype.n_categories`
`lazy.py:get_lazy_categorical_info()`	Accesses private `._categories`, `._ordered`	Post-#2288: Use `dtype.n_categories`, `dtype.ordered`
`lazy.py:get_lazy_categories()`	Uses `read_elem_partial()` on private `._categories`	Post-#2288: Use `dtype.head_categories(n)`
`lazy.py:is_lazy_adata()`	String check: `obs.__class__.__name__ == "Dataset2D"`	Consider proper type import if stable
`SparseMatrixFormatter.can_format()`	Duck typing: checks `nnz`, `tocsr`, `tocsc`	Post-#1927: Use anndata's sparse utilities if provided
`ArrayAPIFormatter.can_format()`	Duck typing: checks `shape`, `dtype`, `ndim`	Keep — follows Array API standard
`BackedSparseDatasetFormatter.can_format()`	Checks module name + `format` attr	Verify post-#1927

ilan-gold · 2026-02-23T18:16:13Z

or leave out the extensibility for now, as we’d do 1. before committing to a public API

I tend to think this is the way to go. Let's not bite off more than we need to here. I genuinely don't have a good grasp on what the use-case is here in strong terms - MuData already has its own renderer for example.

Not really, ideally there wouldn’t be much Python left, the idea was that the AnnData object would be turned into a simple render-ready JSON-like data structure (TypedDicts), which would then directly be rendered by a tree of templates.

Right, and this could build off of the work in #2290 and extend the JSON schema there. I would also go for a less-feature complete but more robust version of a JSON schema. For example, I know that categories can get big, but I think we should not worry about that. That is a v2 feature.

katosh · 2026-02-27T00:10:25Z

Thanks for the detailed proposal in your latest comments. I've mapped each feature onto the TypedDict + Jinja architecture to understand what transfers and what doesn't. To help navigate, here's where I address each of your points:

@flying-sheep's TypedDict + Jinja proposal → What maps cleanly, What a Jinja migration must reimplement, Maintenance cost at the boundary, What TypedDicts structurally prevent
@flying-sheep's Jinja security argument → On Jinja and security
@ilan-gold's "less feature-complete but more robust" → On robustness and scope
@ilan-gold's "what's the use-case for extensibility" → Why extensibility matters
JSON schema / feat: accessors #2290 → On JSON export (design questions from my earlier comment)

I've linked to relevant earlier comments throughout. Some points below build on arguments from earlier in the thread — I'd find it most productive if we can engage with those discussions rather than revisiting them from scratch.

Before diving in: the instinct behind TypedDict + Jinja is architecturally sound in the general case — separating data from presentation, defaulting to auto-escaping, enabling JSON-serializable intermediates. If the rendering layer were the complex part of this system, I'd agree templates are the right tool.

But in this system, the complexity lives in the Python formatting layer — type dispatch, error recovery, context-dependent decisions — which survives a Jinja migration unchanged. Jinja replaces the rendering layer, which is the simpler part. I want to walk through that concretely rather than assert it.

The core tradeoff

@flying-sheep outlined two options: (1) TypedDict + Jinja with extensibility built around it, or (2) leave out extensibility for now. @ilan-gold favors option 2:

less feature-complete but more robust

I understand the core concerns here are maintainability and security — you'll be maintaining this code long-term and are responsible for ensuring it's safe. I share those goals. But as I'll show below, TypedDict + Jinja does not deliver the improvements in robustness and maintainability it appears to promise, and introduces a new maintenance cost at the Python/Jinja boundary.

The key thing to surface is that this isn't just about deferring extensibility — TypedDict + Jinja is architecturally at odds with it. The features that require extensibility (ecosystem custom HTML, per-type dispatch) can't be expressed in a fixed TypedDict schema without falling back to |safe, which undermines the security rationale for adopting Jinja. This isn't deferring extensibility — it's adopting an architecture that structurally prevents it.

So the real question is: do we want extensibility? But first, since the proposal seems to assume there's no structured intermediate representation, let me recap the architecture so we're working from the same mental model.

How the current architecture works

The PR doesn't go from object to HTML in one step. There are three layers:

Type dispatch (FormatterRegistry, registry.py:699-914): when the repr encounters a value, it matches the value's Python type against registered formatters — pd.Categorical → CategoricalFormatter, np.ndarray → ArrayFormatter, etc. — with priority ordering and fallback chains. The base class for formatters (TypeFormatter, registry.py:249-348) defines the dispatch interface.
Structured intermediate representation (FormattedOutput, registry.py:72-168): each formatter returns a dataclass with explicit typed fields — type_name, css_class, tooltip, warnings, preview, preview_html, expanded_html, is_serializable, error (registry.py:72-168). This is the separation of "what to show" from "how to show it." It's not a JSON dict, but it is a structured, inspectable data object with the same role.
Rendering (components.py, core.py — ~1,000 lines): FormattedOutput fields are assembled into HTML. This layer contains no type dispatch, no try/except for data access, and no data introspection — it takes structured data and produces HTML strings.

We already have separation of concerns. The question is whether the rendering half should be written in Python or in Jinja — not whether separation exists. TypedDict + Jinja would replace layers 2 and 3, but the complexity doesn't live there. It lives in the formatters (~2,200 lines across registry.py and formatters.py), which use Python features that Jinja templates can't provide: type introspection for dispatch (isinstance checks, priority ordering), try/except for defensive error recovery, cross-references to adata.uns for color lookups, and FormatterContext for section- and key-dependent decisions. This logic would need to remain in Python as a "crawl phase." What TypedDict + Jinja actually replaces is the rendering layer — ~1,000 lines of HTML assembly — less than half the size of the formatting logic it leaves untouched.

What maps cleanly to TypedDict + Jinja

These features are fully compatible with a JSON-serializable intermediate representation — roughly 60-70% of the visual output:

Basic layout — section headers, entry rows, column widths
Badges (view, backed, lazy, extension) — pre-computed booleans
Dark mode — CSS light-dark(), independent of rendering backend
Foldable <details> — a boolean is_folded field per section
Memory info, shape metadata — scalar fields (n_obs, n_vars, nbytes)
Serialization warnings — pre-computed booleans
Search/filter, copy-to-clipboard — JS reads data-* attributes, unaffected by template engine

What a Jinja migration must reimplement

The remaining features CAN be expressed as TypedDict fields — but the formatting logic that populates those fields requires Python features that can't move into Jinja templates. With TypedDict + Jinja, this logic must be reimplemented as a Python "crawl phase" that does the same work as the current formatters.

Category colors (formatters.py:548-577): cross-references adata.uns["{key}_colors"], respects max_lazy_categories for lazy AnnData, truncates to visible categories. Requires context.adata_ref access, conditional logic for lazy vs. eager, and try/except around color lookup.
Context-dependent formatting (registry.py:181-246): FormatterContext carries adata_ref, section, and key. CategoricalFormatter checks context.section to decide whether to show category previews.
Defensive error recovery (registry.py:460-662): FallbackFormatter wraps every attribute access individually (.shape, .dtype, len(), repr(), str()) in its own try/except and assembles partial results — ~200 lines of Python.
Recursive nested AnnData (formatters.py:876-910): AnnDataFormatter calls generate_repr_html() recursively with depth tracking.

None of this logic goes away — it's reimplemented targeting TypedDict output instead of FormattedOutput.

The maintenance cost at the boundary

In addition, TypedDict + Jinja introduces a new maintenance cost that the current system doesn't have: a dual-contract boundary between the crawl phase and the template.

In the current system, FormattedOutput is a resolved contract. The formatter handles all ambiguity — which attributes worked, what to show when something failed, how to truncate — and produces fixed fields (type_name, error, tooltip). The renderer reads those fields. It's a dumb pipe. A change to what the formatter produces is visible in the dataclass definition and caught by type checking.

With TypedDict + Jinja, this changes. The TypedDict carries unresolved data — nullable fields for each attribute, raw category lists, error sentinels. The template must handle every combination with its own conditionals:

{% if entry.shape is not none %}({{ entry.shape|join(', ') }}){% endif %}
{% if entry.dtype %} {{ entry.dtype }}{% endif %}
{% if entry.error %}<span class="warning">⚠ {{ entry.error }}</span>{% endif %}
{% if entry.colors %} {# render swatches #} {% endif %}

That's two layers that must agree on: what fields are nullable, what null means, how partial results compose. A change to what the crawl phase produces can silently break the template, with no compile-time check across the Python/Jinja boundary. Mypy checks the TypedDict definition in Python; it cannot check that the template handles every nullable combination correctly. This is in tension with the typing rigor we've established elsewhere in this PR — the strict typing that motivated removing Any from formatter interfaces stops at the template boundary.

This compounds with JSON export. Adding a JSON consumer to the same TypedDict creates a third site that must handle the same combinatorial space of nullable fields — crawl, template, JSON serializer — all implementing their own conditional logic for the same partial-failure scenarios, all kept in sync manually.

	Writers	Consumers	Contract
Current system	Formatter resolves all ambiguity	Renderer reads fixed fields	Single, checked by `dataclass` + `mypy`
TypedDict + Jinja	Crawl produces unresolved data	Template + JSON serializer each handle nullable combinations	Dual/triple, unchecked across language boundary

Concretely, when FallbackFormatter encounters an object where .shape raises but .dtype works:

Current: formatter resolves → FormattedOutput(type_name="Broken", tooltip="dtype=float32", error="shape failed"). Renderer shows it. One decision point.
TypedDict + Jinja: crawl → {"type_name": "Broken", "shape": null, "dtype": "float32", "error": "shape failed"}. Template: conditional branches for each nullable field. JSON serializer: same conditionals, different output format. Two (or three) decision points that must agree.

This is the opposite of reduced maintenance burden. The current system resolves ambiguity once, in the formatter. TypedDict + Jinja defers it to every consumer.

What TypedDicts structurally prevent

Unlike the items above, these features are genuinely incompatible with a fixed TypedDict schema — not because of implementation effort, but because of structural limitations in how Jinja2 extensibility works.

Ecosystem custom HTML. A TypedDict has a fixed set of fields. There's no field for "SVG bar chart of category distribution" or "ontology badge" or "tree visualization." An ecosystem package that wants to show a custom preview needs to return HTML — but in @flying-sheep's vision, the entire pipeline avoids |safe/Markup and ecosystem packages ship Jinja templates instead.

I investigated what that would look like in practice. Jinja's extensibility mechanisms — template inheritance, macros, extensions — operate on a linear chain model: a child template extends a parent. When multiple independent packages (TreeData, SpatialData, bionty) each want to add type renderers, they'd each need to extend the same base template. But Jinja inheritance is single-parent: treedata.html {% extends "anndata.html" %} and spatialdata.html {% extends "anndata.html" %} can't both be active simultaneously without one extending the other, creating artificial dependencies between unrelated packages. Using {% include %} with a Python-managed template list doesn't resolve this — it still requires a Python registry to decide which template to include for which type, putting the dispatch logic back in Python with Jinja as syntactic sugar. Compare with the current Python registry, where each package independently calls register_formatter() — no package needs to know about any other.

Every project I surveyed that needs open type-dispatched rendering (Django Admin, Flask-Admin, WTForms, Sphinx, nbconvert) uses Python for dispatch and templates only for structural layout. None uses Jinja template inheritance for open-ended type registration.
Per-type dispatch within sections. When the repr encounters a value in obs, var, or uns, it looks at the value's Python type and picks the matching formatter (registry.py:699-914). Ecosystem packages can register formatters for their own types, restrict them to specific sections, and priority ordering resolves conflicts. This isn't just an extensibility concern — the internal templates would also need type-checking logic. With Jinja, the options are:
- {% if %}/{% elif %} chains in the template — hardcodes the set of types, requires template modification to add new ones
- Calling a Python dispatch function from the template ({{ dispatch(entry) }}) — but the return value is HTML, requiring |safe, and at that point the Jinja template is a thin wrapper calling Python
- A Jinja extension that delegates to a Python registry — which works, but means the dispatch logic is 100% Python with Jinja as syntactic sugar
@flying-sheep's template inheritance example ({% block attribute %}{% if attr_name != "tem" %}{{ super() }}{% else %}{% include "tem.html" %}{% endif %}{% endblock %}) works for section-level customization (e.g., adding an obst section for TreeData). It does not address per-type rendering within a section — there's no way to say "render this pd.Categorical differently from that np.ndarray" within the same obs section using blocks alone.

These aren't features that could be added later on top of TypedDict + Jinja. The only escape hatch is |safe/Markup(), which undermines both the security rationale and the goal of keeping HTML out of Python code — ecosystem packages would be back to generating HTML strings in Python and passing them through, which is exactly what Jinja was supposed to eliminate.

Why extensibility matters

@ilan-gold, you mentioned:

not having a good grasp on what the use-case is here in strong terms

Here are the concrete cases:

Discoverability of analysis results. Ecosystem tools store results across multiple AnnData slots, but there's no way for a user to see what was computed or which tool put it there — they just see generic arrays and columns. Our package kompot writes DE results to var, layers, and uns, but users have to know that the helper kompot.RunInfo(adata) exists to make sense of them. Extensibility solves this: kompot registers a TypeFormatter so that adata alone shows which analyses were run, their status, and which fields belong to which run — no separate helper needed.

Reusable components for MuData and SpatialData. Early in this PR, @Zethson asked for exactly this:

a canonical design and components that we could reuse for both MuData and SpatialData for an ideally consistent experience

The TypeFormatter/SectionFormatter API is the answer to that request. This is the same pattern bionty/lamindb would use for ontology annotations.

README rendering for collaborators. When sharing AnnData files between lab members, it's common to store a description in uns["README"]. Rendering it as formatted text means collaborators immediately understand what they're looking at. I've already made compromises here based on @flying-sheep's feedback, but it remains a motivating feature.

The extensibility API has been in this PR for months and is covered by 607 tests (108 adversarial). If there are specific maintainability or correctness concerns, I'd like to understand them so I can address them concretely. If you're concerned about API lock-in, Option B below keeps the API internal while preserving the architecture that makes it possible.

On Jinja and security

I understand this is framed primarily as a security question, and I want to engage with that directly. I evaluated template-based architectures early on and explained this reasoning in detail. Let me revisit it in light of the specific proposal.

The security argument for Jinja is: auto-escaping by default means a contributor can't accidentally forget to escape user data, preventing XSS from maliciously crafted AnnData files. That's a real concern, and I take it seriously. But let's be precise about the threat model and what Jinja actually changes.

The threat is narrow. The attack surface is: an attacker crafts an AnnData file with malicious strings (e.g., <script> in a column name), and the repr renders them as raw HTML in a Jupyter notebook. This produces XSS — not arbitrary code execution (the attacker already has that if they can get you to run their Python code). The risk is specifically that a contributor forgets to escape a string at one of the HTML insertion points, and that this gap isn't caught by CI.

Jinja's advantage is real but bounded. The failure mode asymmetry is genuine: forgetting html.escape() fails silently, forgetting |safe fails visibly (double-escaping). In a strings-only approach where all HTML is generated in templates, auto-escaping applies to every internal insertion point — that's a real improvement in default safety. But it doesn't eliminate the need for adversarial tests. With Jinja, we'd still need tests to verify that |safe/Markup() isn't used on untrusted data and that ecosystem extensions don't bypass escaping. With f-strings, we need tests to verify that every insertion point calls html.escape(). Either way, the safety guarantee comes from the test suite, not from the architecture. The 108 adversarial tests in test_repr_robustness.py already cover this systematically.

The cost is disproportionate to the security gain. This improvement in default escaping for internal rendering comes at the cost of: a new dependency, a cross-language boundary with unchecked contracts (see Maintenance cost at the boundary above), and structural barriers to extensibility (see What TypedDicts structurally prevent above). And for ecosystem extensions that produce custom visualizations, the escaping responsibility moves to third-party code — Jinja provides no safety improvement there.

Ecosystem extensions reintroduce the risk. If extensibility is supported, ecosystem packages would supply their own templates or generate HTML for custom visualizations. The escaping responsibility shifts to code outside anndata's control. But more fundamentally, ecosystem packages already run arbitrary Python in the user's process — a malicious or buggy package can execute code, access the filesystem, or exfiltrate data, none of which is constrained by HTML escaping. XSS in a formatter is a strictly lesser risk than what ecosystem code can already do. Jinja's auto-escaping on anndata's side doesn't change this threat model.

CSS injection isn't addressed by Jinja either. Category color values from adata.uns are inserted into CSS (style attributes), not HTML content — Jinja's auto-escaping doesn't cover this. A strings-only Jinja approach would have to either drop color features entirely or still rely on Python-side sanitization (sanitize_css_color()). This is arguably the trickier security surface in this PR, and it requires Python-level validation regardless of the rendering architecture.

On robustness and scope

@ilan-gold proposed:

a less-feature complete but more robust version

and

let's not bite off more than we need to here

I want to address both the scope concern and the robustness expectation.

On review burden: The PR is large, and I understand that reviewing +22K lines is daunting. As I broke down earlier, 41% of those lines are tests, 15% is the visual test harness, and 8% is static assets (CSS/JS). The actual source code is ~29% (~6.4K lines). Dropping extensibility (TypeFormatter/SectionFormatter and their tests) would genuinely reduce that — this is Option B below. I'm also open to splitting the PR or other changes that make the review tractable.

On robustness: The expectation of improved robustness from TypedDict + Jinja is misleading. The logic where robustness matters must be reimplemented in a crawl phase regardless (see above), and the boundary between crawl and template replaces a single-contract system with a dual-contract system — adding a maintenance surface, not removing one.

I agree that dropping the extensibility API reduces scope — that's Option B below, and I'm happy to go that route. But the robustness question remains: is TypedDict + Jinja more robust than f-strings for the code that stays? The internal formatting logic (type dispatch, FallbackFormatter, context-dependent decisions) is needed regardless of whether extensibility is public. And as argued above, moving the rendering layer to Jinja adds a dual-contract boundary rather than removing complexity.

For context on the rendering approach: xarray's repr uses f-strings and Jinja was never considered in that project's design discussion. Dask did migrate to Jinja (dask#8019), but for a different use case — Dask renders one known type per repr call (one template per type: array.html.j2, dataframe.html.j2), while anndata's repr discovers and renders many unknown types within a single section. That's structurally closer to xarray's challenge. @ilan-gold, given your experience with xarray's repr — do you see something in anndata's case that changes the calculus?

On JSON export

JSON export is a valuable goal and I'm in favor of it. But rather than motivating a Jinja migration, JSON export highlights the cost of TypedDict + Jinja.

As discussed in Maintenance cost at the boundary, TypedDict + Jinja creates a dual-contract system where the crawl phase produces unresolved data and the template handles nullable combinations. Adding a JSON consumer to the same TypedDict creates a triple-contract system — three sites implementing conditional logic for the same partial-failure scenarios, kept in sync manually. With the current system, adding JSON export means adding a serialization method to FormattedOutput that reads already-resolved fields. One contract, two output formats.

There's also a schema mismatch. The HTML path truncates and summarizes: max_items limits entries shown, max_categories limits categories expanded, max_lazy_categories controls what's loaded from backed AnnData. A JSON representation for structural comparison (#671) might want all keys without truncation. These are different contracts — the JSON TypedDict and the HTML TypedDict would diverge, giving you two schemas to maintain rather than one FormattedOutput with multiple output methods.

I raised several design questions in my earlier detailed response that I'd like to resolve before designing the schema:

What's the primary use case? Full structure comparison (Visualise/compare Anndata object structure #671) vs. truncated rendering view — these have fundamentally different contracts.
What about the *_html fields? FormattedOutput has fields like preview_html that carry type-specific pre-rendered HTML. A pure JSON schema would need to either drop these or include them as opaque strings.
How does this relate to PR feat: accessors #2290? @ilan-gold suggested building off the accessor JSON schema in feat: accessors #2290. The accessor schema describes AnnData's structural paths; a repr schema would need additional fields (formatting metadata, truncation, previews, warnings). Are these the same schema or complementary ones?

I'd appreciate engagement on these questions — they need to be resolved regardless of which rendering architecture we choose.

Path forward

I think there are three reasonable options for the rendering architecture, plus JSON export as a separate follow-up:

Option A: Merge with extensibility API. The TypeFormatter/SectionFormatter API ships as a public (or provisional) extension point. Ecosystem packages can register custom formatters from day one.

Option B: Merge without extensibility API. I remove register_formatter() and mark TypeFormatter/SectionFormatter as private. The internal architecture is unchanged — the same patterns are needed for anndata's own type dispatch — but no public contract is offered. Ecosystem extensibility can be added later by promoting the internal API; nothing about the architecture prevents it.

Option C: Adopt TypedDict + Jinja, strip extensibility. Replace FormattedOutput with TypedDicts and the rendering layer with Jinja templates. Drop the TypeFormatter/SectionFormatter API. The formatting logic (type dispatch, error recovery, context-dependent decisions, color lookups) stays in Python as a crawl phase. The tradeoffs are significant: the formatting logic doesn't shrink, a new cross-language maintenance boundary is created (see above), and extensibility becomes structurally harder to add later (see above).

JSON export can be added as a follow-up to any option above, once the design questions above are resolved.

My recommendation is A (or B as a compromise on review scope). I believe the current architecture provides the foundation for both extensibility and JSON export without the costs of a Jinja migration.

I've created a visual side-by-side comparison (gist source) showing what each approach can express for the features discussed above — basic layout, category colors, error recovery, ecosystem custom HTML, and the maintenance cost of adding JSON export.

I want to make sure we're making this decision on a shared understanding of the implementation. If there are specific parts of the code that feel hard to maintain or that raise security concerns, I'd welcome those pointers — they'd help me improve the implementation regardless of which direction we go.

JupyterLab strips <style> tags from untrusted notebooks (e.g. executed via nbconvert or transferred between users), leaving the HTML repr unstyled. Add a fallback div that is visible by default and hidden by CSS, showing a summary line and instructions to run `jupyter trust`.

repr(adata) can crash when aligned mappings contain objects with broken .shape properties, because _gen_repr accesses all mappings which triggers validation. This caused _repr_html_ to return None for adversarial AnnData objects (visible as "None" in test 24). Wrap the repr() call in a try/except with a simple shape-based fallback string.

SectionFormatter can now define render_html(obj, context) to produce custom HTML directly, bypassing the standard foldable <details> section. If render_html fails, falls back to get_entries gracefully. This enables compact inline representations (e.g., TreeData's label/alignment/allow_overlap as a single line like the X entry) alongside the standard entry-grid sections. Includes: - render_html support in _render_custom_section with fallback - TreeData visual test example showing both patterns - Unit tests for render_html, escaping, and crash fallback

Null bytes in user data (e.g., column name "null\x00byte") leaked through html.escape into the HTML output as literal \x00 bytes, breaking HTML parsers and causing truncated rendering in browsers. Replace null bytes with U+FFFD (Unicode replacement character) before escaping, per the HTML spec.

…notebooks) Use the xarray-style dual-representation pattern: emit a text <pre> fallback (visible by default) alongside the rich HTML (hidden via inline display:none). When CSS loads it flips visibility; when CSS is stripped the text repr shows.

Replace div cells with span cells so entries stay on one line without CSS. Add inline min-width via CSS custom variables for column alignment, monospace font fallback, comma-separated categories (hidden by CSS), and contextual hints for no-CSS and no-JS environments. - Entry cells: <div> → <span> with inline-block + min-width fallback - Category items: comma separators (hidden by CSS which uses margin) - Wrap buttons: hidden inline, shown only by JS overflow detection - Nested content: inline margin-left for indentation without CSS - No-CSS hint: visible by default, hidden by CSS - No-JS hint: hidden by default, shown by CSS :not(.anndata-repr--js), hidden again by JS init - CSS resets inline fallback styles on grid children (!important) - Drop text <pre> fallback — rich HTML now degrades well enough

Use anndata.utils.iter_outer from scverse#2372 as the canonical source for standard section iteration in the HTML repr. Drop the redundant SECTION_ORDER tuple; display order now follows iter_outer. - _render_all_sections iterates (name, elem) pairs from iter_outer and passes elem to downstream renderers, avoiding a second getattr (which would trigger another file open/close cycle on backed AnnData). - _render_dataframe_section, _render_mapping_section, _render_uns_section, _render_raw_section now take the elem directly. - _detect_unknown_sections and _get_custom_sections_by_position use a local STANDARD_SECTIONS frozenset for name-only membership checks so they don't pay iter_outer's per-yield I/O just to get names. - Drop unused adata param from _render_uns_entry. Display order changes to X, obs, var, obsm, varm, obsp, varp, layers, uns, raw (uns moves from position 4 to position 9).

Drop the STANDARD_SECTIONS frozenset (which mirrored iter_outer's internal name list). Instead, materialize iter_outer once at the top of _render_all_sections and reuse the collected names for the membership checks in _get_custom_sections_by_position and _detect_unknown_sections. Same in _collect_all_field_names: the names come from the iter_outer loop that was already running for column/key collection. Removes the maintenance burden of keeping a separate constant in sync with iter_outer's internal list.

katosh · 2026-04-15T19:38:59Z

Thanks for landing this @ilan-gold! Just adopted iter_outer as the section-iteration backbone. Replaced our hardcoded SECTION_ORDER tuple, the unknown-section detection, and the custom-section filter. Happy to have a single canonical source of truth for what counts as an outer element.

One small friction point worth sharing. The repr has a couple of name-only checks (filtering registered custom-section formatters, detecting unknown attributes that aren't standard sections) where we just need the set of canonical section names, not their values. We currently collect the names while iterating values for rendering, then thread that set through to the membership-check sites. It works, but it means we're either reconstructing the set per render, or carrying it around as a parameter.

It would be nicer if the canonical section names were exposed as a public constant alongside iter_outer, something like STANDARD_SECTIONS (or whatever shape fits). Benefits we'd get:

Membership checks ("is this attr a standard section?") become a one-liner without depending on having iterated iter_outer first.
Avoids the small cost of iter_outer's per-yield file-state restore on backed AnnData when we don't actually need the values.
Other consumers (write paths, ecosystem packages introspecting AnnData layout) get the same canonical reference without each maintaining their own copy.

Not a blocker, happy with the current state. Just flagging in case it fits with the follow-ups you have in mind.

flying-sheep · 2026-04-20T08:54:17Z

Hi! Responding to your Jinja comparison gist, please just read the basic documentation: https://jinja.palletsprojects.com/en/stable/templates/

Few of the things that you say are “Structurally prevented” are actually prevented in any way, e.g.

4. Ecosystem Custom HTML Structurally impossible without |safe.

I think it’s still not clear to you what MarkupSafe does. Or you know it but your LLM picks up obsolete text and you accept its output unquestioningly. In any case, custom HTML is very much possible, please stop going back to pretending it isn’t.

5. Per-Type Dispatch […] Jinja template blocks override entire sections […] Ecosystem packages would then need to extend{% elif %} chains, reimplementing the dispatch logic in Jinja syntax.

Not at all, we’re very free to design blocks however we like, e.g. very trivially you could extend a chain at the start (there are also other solution such as a nested block in the else branch):

parent.j2

{% block thing %}
{% if thing == "foo" %}
  foo
{% elif thing == "bar" %}
  bar
{% else %}
  baz
{% endif %}
{% endblock %}

child.j2

{% extends "parent.j2" %}
{% block thing %}
{% if thing == "spam" %}
  spam!
{% else %}
  {{ super() }}
{% endif %}
{% endblock %}

something else: why do you paste in the javascript in multiple places? Shouldn’t it be loaded once and reused?

katosh · 2026-04-20T22:04:18Z

Hi @flying-sheep, appreciate the engagement.

On MarkupSafe: Markup is the primary MarkupSafe mechanism; the gist was loose in calling |safe "the only escape hatch." That's a fair precision correction and the substantive argument below doesn't depend on it.

On custom HTML: my concern covers both bespoke visualizations and, more importantly, pre-existing HTML representations. Third-party scientific packages that define custom types typically already have _repr_html_ implementations (pandas, ete3, plotly, pygments, matplotlib, bokeh, bionty term viewers, TreeData tree renderers, and domain-specific packages written by individual scientists). The realistic extension pattern is Markup(obj._repr_html_()) at the boundary — reimplementing these visualizations as Jinja templates is a big ask for maintainers whose primary goal is their own scientific package, not integration with a specific template engine. Under that pattern, Jinja's safety benefit for third-party formatters reduces to auditable Markup assertions at integration boundaries rather than autoescape-by-default, and I haven't yet seen a counter-argument that doesn't reduce to the structural-impossibility framing we've already revised past.

To make the middle-ground concrete, I put up a small POC on our fork: settylab/anndata#8. It routes the top-level repr through one autoescape-enabled Jinja template and wraps existing formatter-produced fragments in markupsafe.Markup at the boundary.

Stepping back, the architectural choice hinges on requirements we haven't explicitly agreed on:

Safety against malicious .h5ad files at every user-data insertion point.
Extensibility for third-party packages with minimal authoring burden, ideally reusing existing _repr_html_.
Maintainability without cross-language contracts that can drift.
Dependency cost justified by concrete gains?
Contributor onboarding for Python developers without prior template-engine experience.

On JavaScript composition: you're right that repr.js is currently inlined per top-level repr in javascript.py, so N reprs on a page means N copies of the definition. The current design makes each cell self-sufficient because in Jupyter I can't rely on cell execution order, cells can be deleted or rerun, and on notebook reopen each cell's <script> re-runs independently — so any "load once, init many" pattern risks initialisation failures for reprs that are displayed before whichever cell happens to carry the definition. I considered alternatives (idempotency guard, data-URI <script src>, IPython.display.Javascript, build-time minification) but each regresses on at least one axis — file size, runtime cost, cross-context portability (static HTML export, untrusted notebooks, kernel restart), or added complexity — so I landed on the inlined approach. Is there a pattern you have in mind that avoids those trade-offs?

The Jinja question is forward-looking (reducing future XSS risk through default-safe idioms, type-level trust discipline) rather than correcting a present defect, so the trade-off depends on which requirements above weigh heaviest. Open to iterating on the POC or the current design.

flying-sheep · 2026-04-21T11:47:30Z

[…] calling |safe "the only escape hatch." That's a fair precision correction and the substantive argument below doesn't depend on it.

I disagree. I might have been unclear in why I brought up jinja but using it won’t and can’t make anything “structurally impossible”. To clarify: I simply consider jinja a better replacement for functions that are desigened to turn structured data into HTML, compared to simple functions that use Python string manipulation APIs. The approach “mark pieces of data as markup safe before passing it into a rendering black box” works better than having a bunch of parameters to a function, some of which contain valid markup while others contain data-derived raw strings that need to be escaped in order to not accidentally breaking page layout because there’s a stray “<”. Safety against malice isn’t the primary concern, it’s more about robustness, where safety is achieved along the way.

The realistic extension pattern is Markup(obj._repr_html_()) at the boundary

That’s a realistic pattern, but I think we’re not one one page about what “extending” means in the context of this PR. I mean by it that there are other packages building on anndata, which might want to both hook into the existing structure (e.g. add an attribute or so next to the others, but otherwise reuse the rest as-is) and also use the existing render machinery for stylistically integrated rendering of the new parts.

You’re talking about integrating 3rd party HTML representations while extending. _repr_html_/_repr_mimebundle_ isn’t built to be extensible at all, but many packages export only that one as public API to access a HTML representation, so using it while writing an extension for anndata’s HTML repr will probably be a common pattern. One example of prior art to make a public HTML repr api customizable is pandas’ “styler” object.

so N reprs on a page means N copies of the definition

Ah I saw it being called in multiple places and assumed it’d be included multiple times per repr. I think we can’t do better than one inclusion per repr. We can however improve runtime behavior – it could create a global API and reuse it if it finds it instead of creating a copy of the whole API.

Two related robustness fixes to the inlined repr JS: 1. `repr.js`: replace the last remaining `innerHTML = \`<h3 id="${modalTitleId}">…\`` in the README modal with plain DOM construction (`createElement`/`textContent`/`appendChild`). The literal `id="${modalTitleId}"` substring inside the inlined JS source was being regex-matched by the Jupyter-compatibility tests as a duplicate HTML ID attribute across cells. Using DOM APIs removes the problematic substring entirely and matches the surrounding code style. 2. `javascript.py`: wrap the per-container initialisation body in an `install-once` guard. Every cell still ships the full source so any cell stays self-sufficient across deletion, reorder, or notebook reopen, but only the first to execute actually installs `window.anndataRepr`; subsequent cells reuse the installed `init(container)`. Addresses the runtime-redundancy concern without giving up per-cell portability.

…hover Replace iter_outer as the section-iteration source in the repr, and derive the canonical section list from get_literal_members(AnnDataElem). Why not iter_outer: iter_outer yields (name, getattr(adata, name)) pairs, and propagates the first exception it hits — so a single broken section (corrupt aligned mapping, subclass with a crashing property, etc.) terminates the generator mid-iteration. _repr_html_'s top-level except catches that, the repr returns None, and the whole cell output disappears. The adversarial "Evil AnnData" case hit this. Iterating get_literal_members(AnnDataElem) ourselves and doing the getattr inside each section's try/except isolates the failure: a broken section renders as an error placeholder and the remaining sections still appear. iter_outer stays for callers that want strict semantics (AnnData.__str__, to_memory, _reduce, I/O). Also: use the same Literal as the single source of truth wherever the repr previously derived a set of section names from iter_outer (_detect_unknown_sections, _get_custom_sections_by_position). Those helpers now compute the set locally and no longer need the caller to thread it through. CSS fix: .anndata-entry:hover .anndata-entry__copy propagated the :hover state to every ancestor entry, revealing every ancestor's copy button when a deeply-nested row was hovered. Scope the trigger to the entry's own row: the entry itself for plain div rows (which never contain nested entries) and the <summary> for expandable rows (nested children live in .anndata-entry__nested-content, outside the summary).

katosh · 2026-04-21T22:55:47Z

On JS composition: taken. Just pushed an install-once guard.

On extensibility and the Styler analogy: useful pointer, and it maps cleanly onto the case you described. Packages that hook into anndata's structure (adding an attribute or section next to the others) and want stylistically integrated rendering of the new parts. The current design is already shaped similarly: TypeFormatter[T] plays Styler.format's role per-value, SectionFormatter is the table-level override, and FormattedOutput's facet fields (preview, summary, readme, type badge) stand in for apply()'s CSS-returning callables. Two intentional departures from Styler: registration is package-scoped via @register_formatter at import time (so bionty, treedata, spatialdata register once and every AnnData holding their types renders consistently) rather than instance-scoped df.style; and formatters return structured facets rather than CSS strings, closer to an nbconvert mime-bundle, so an extension opts into specific facets without owning cell layout.

On Jinja / the POC: the reframe is useful. Landing on robustness (one trust boundary + autoescape vs. threading mixed markup/data through ad-hoc escape_html()), with safety as a side effect, narrows the question. POC is up: settylab/anndata#9 · rendered preview. Minimal user-facing change. What building it surfaced: ~+500 lines net, templates split from the Python that produces their context, composition via Markup("\n").join(...) / macros through get_macros(), two new runtime deps. On the benefit side, autoescape-by-default is a structural guarantee where escape_html is reviewer discipline; with TestEscapingCoverage already asserting escaping at every user-data insertion point, the margin this adds sits narrow against the maintenance surface. The call is yours; the POC is there if you'd rather move forward with it.

katosh and others added 30 commits November 28, 2025 11:55

implement html representation

30a1e71

vizual inspection testing

5ce0afb

fix dark mode and nesting of htlm rep

9da45fe

handle disabled script in htlm rep

28292b9

more compact html rep

774c942

show categories in html rep

42ec6e6

dark mode and stability

73f0c5d

make max_cats configurable in html rep

292b4fc

test many cat and no JS for html rep

181b4d4

cnter folding icon in html rep

1dd4f18

max rows for counting n-unique in html rep

3db23cd

header coloring in html rep

d5974f6

max 20 categories in html rep

5cd1dd5

udpate many cats viz test of html rep

ef178c5

robust html rep for ad blocker

11949af

more tetsing of html rep

139f94d

future proof html rep

8a14312

htlm rep documentation

a64de45

show backed path inline in html rep

e7461f8

add custom uns rendering for html rep

966bb54

customizable section html rep

27b83f6

fix som html rep previews

bfe8221

better multi line categories in html rep

065eb2c

increase html rep testing

9505b63

formatt and style of html rep

8983df3

reduce complexity of html rep

bfe31eb

add "vart" to codespell's ignore-words-list

4b37eca

failed formatter wrnings in html

07632cf

explicit cleanup in html rep test

4c7ab7c

html rep aesthetics and formatting

c65952c

ilan-gold mentioned this pull request Mar 19, 2026

feat: AnnData.unwriteable based on AnnData._reduce + iter_outer + refactorings of other relevant functions #2372

Merged

5 tasks

katosh added 12 commits March 20, 2026 12:24

Merge remote-tracking branch 'origin/main' into html_rep

9f1f491

style: ruff format

5c9d47d

style: fix ruff EM101 string literal in exception

ecbb009

Merge remote-tracking branch 'origin/main' into html_rep

0a0c09a

scverse deleted a comment from azure-pipelines Bot Apr 20, 2026

style: some basic style

e141f4b

katosh mentioned this pull request Apr 20, 2026

POC: Jinja + Markup middle-ground for outer repr template settylab/anndata#8

Open

katosh mentioned this pull request Apr 21, 2026

Expose canonical section-name tuple alongside iter_outer #2401

Open

katosh mentioned this pull request Apr 21, 2026

Jinja + MarkupSafe adoption for AnnData._repr_html_ settylab/anndata#9

Open

katosh mentioned this pull request Apr 21, 2026

ci: broaden zarr UnstableSpecificationWarning filter to match Struct(…) repr #2403

Merged

2 tasks

Merge branch 'main' into html_rep

e486c3c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add HTML representation#2236

feat: Add HTML representation#2236
katosh wants to merge 211 commits intoscverse:mainfrom
settylab:html_rep

katosh commented Nov 29, 2025 •

edited by flying-sheep

Loading

Uh oh!

ilan-gold commented Feb 23, 2026

Uh oh!

katosh commented Feb 27, 2026 •

edited

Loading

Uh oh!

katosh commented Apr 15, 2026 •

edited

Loading

Uh oh!

flying-sheep commented Apr 20, 2026 •

edited

Loading

Uh oh!

katosh commented Apr 20, 2026

Uh oh!

flying-sheep commented Apr 21, 2026 •

edited

Loading

Uh oh!

katosh commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

katosh commented Nov 29, 2025 • edited by flying-sheep Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rich HTML representation for AnnData

Summary

Screenshot

Features

Interactive Display

Visual Indicators

Serialization Warnings

Compatibility

Extensibility

Testing

Related

Acknowledgments

Lazy Loading

Config Changes

Edit (Dec 27, 2024)

Edit (Jan 4, 2025)

Edit (Jan 6, 2025)

Edit (Jan 6, 2025 - continued)

Edit (Jan 7, 2025)

Edit (Jan 8, 2025)

Edit (Jan 9, 2025)

Edit (Jan 16, 2025)

Edit (Jan 26, 2025)

Already Handled

May Require Updates When Merged

Recommended Post-Merge Actions

Internal API Usage Inventory

Uh oh!

ilan-gold commented Feb 23, 2026

Uh oh!

katosh commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

The core tradeoff

How the current architecture works

What maps cleanly to TypedDict + Jinja

What a Jinja migration must reimplement

The maintenance cost at the boundary

What TypedDicts structurally prevent

Why extensibility matters

On Jinja and security

On robustness and scope

On JSON export

Path forward

Uh oh!

katosh commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

flying-sheep commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

katosh commented Apr 20, 2026

Uh oh!

flying-sheep commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

katosh commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

katosh commented Nov 29, 2025 •

edited by flying-sheep

Loading

katosh commented Feb 27, 2026 •

edited

Loading

katosh commented Apr 15, 2026 •

edited

Loading

flying-sheep commented Apr 20, 2026 •

edited

Loading

flying-sheep commented Apr 21, 2026 •

edited

Loading