HTML API: Roadmap #60397

dmsnell · 2024-04-02T22:29:22Z

⚠️ Note: This issue was created from the HTML API: Roadmap discussion.

See where this work fits in with Dennis' broad list of interesting things in #62437.

Proposed HTML Specification Changes

Untriaged plans.

Tasks

Bug fixes and quality

We need to defer applying enqueued edits as much as we can. When we removed that optimization in order to make the code simpler, we overlooked that on documents with many edits, this could lead to cataclysmic runtime overhead both in processing and memory. The workaround is to track edits externally and apply them all at once. The mechanism is that when applying an edit we copy the entire document, so if we have 50 edits we copy the entire document 50 times. With deferred updates, we only copy the entire document once when applying all of them in one go. [HTML API: Defer applying updates until necessary. wordpress-develop#6120]

Waiting review and merge

Future releases

Provide new filtering pipelines for final rendered HTML. Core-43258

WordPress 6.6

Safe get/set inner HTML
HTML Templating for safe HTML generation.
New render-pipeline filter replacing Core functionality:
- smilies/emojify
- capital_P_dangit
Core refactors for HTML processing
- wp_strip_all_tags()
- force_balance_tags() rewritten as "serialize this HTML"

WordPress 6.5

Support in the HTML Processor for most common tags IN BODY.
Scan all tokens in the Tag Processor to enable modifying HTML structure.
~~HTML Templating for safe HTML generation.~~
Establish test suite of real posts and websites against which to run the HTML Processor and report progress.
Run the html5lib tests against the HTML Processor in WordPress CI suite.

WordPress 6.4

Merged and bound for 6.4

In progress for WordPress 6.4

Support for elements in IN BODY mode.
HTML API: Only pass a single class name to add_class() wordpress-develop#5325
- this probably won't merge, but parts of it might go out. we want to remove code that's passing a plurality of things to a singularity of thing. the issue might seem pedantic, but this can cause defects in class handling and we need to find the right way to resolve it.

Plans for post-6.4 merges

Need to refactor the Tag Processor to think more about "tokens" than "tags" internally so that we can stop on comments and other non-tag tokens. This will not only support "funky comments" but is also necessary for work like the wp_strip_tags() and truncate_html() functionality, which needs to read plaintext content of markup (which needs to ignore comment and other meta content).
Focus on adding HTML templating so that the HTML API becomes useful for safe HTML generation in all the places we're currently forgetting to escape attributes and the like.

PRs to revisit

Search block: refactor to use HTML Tag Processor #51273 - see if we can unwind the unsafe concatenation of HTML inside the constructor
behaviors/lightbox - see if we can remove using multiple instances

Areas of active exploration

Expose "original raw tag" for backwards compatibility with filters that expect spans of the HTML document from functions such as wp_kses_hair(). [HTML API: Expose raw tag markup to support existing filters wordpress-develop#5143]
- This is probably best left for an internal Core class, which is also needed for several of Core's cleanup tasks, tasks that need to examine raw markup and avoid making needless changes.
Add set_raw_inner_markup() and get_raw_inner_markup() (or not, if it's not the right interface). [HTML API: add get/set inner/outer markup wordpress-develop#4956]
A new wp_strip_tags() function/approach that only parses as much HTML as is necessary. [WIP: HTML API: Extract previous text and HTML chunks while processing. wordpress-develop#5208]
Allow extending the input document for more strict streaming work. [WIP: HTML API: Allow extending input document for chunked processing. wordpress-develop#5050]

HTML Templating

Provide a means to generate HTML conveniently with placeholders. The placeholders should be "funky comments" that mirror array values passed in to the rendering function. This will/should form the basis for raw HTML templating, replacing inner contents, powering Bits so that we can apply heuristics to the replacement markup, and more.

The text was updated successfully, but these errors were encountered:

This was referenced Apr 2, 2024

HTML API: Plans for 6.7 #60396

Open

HTML API: Plans for WP 6.6 #60324

Closed

dmsnell added [Feature] Block API API that allows to express the block paradigm. [Type] Tracking Issue Tactical breakdown of efforts across the codebase and/or tied to Overview issues. [Feature] HTML API An API for updating HTML attributes in markup labels Apr 2, 2024

dmsnell mentioned this issue May 27, 2024

Ensure valid HTML is properly processed by refining regex handling WordPress/wordpress-develop#5697

Closed

dmsnell mentioned this issue Jun 10, 2024

Dennis' list of broad and interesting things. #62437

Open

dmsnell mentioned this issue Jul 1, 2024

HTML API: Plans for 6.8 #63037

Open

11 tasks

dmsnell mentioned this issue Aug 26, 2024

XML API: Roadmap #64808

Open

gziolo mentioned this issue Oct 10, 2024

Block API #41236

Open

67 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HTML API: Roadmap #60397

HTML API: Roadmap #60397

dmsnell commented Apr 2, 2024 •

edited

Loading

Merged and bound for 6.4

In progress for WordPress 6.4

Plans for post-6.4 merges

HTML API: Roadmap #60397

HTML API: Roadmap #60397

Comments

dmsnell commented Apr 2, 2024 • edited Loading

Proposed HTML Specification Changes

Related

Untriaged plans.

Tasks

Bug fixes and quality

Waiting review and merge

Merged and bound for 6.4

In progress for WordPress 6.4

Plans for post-6.4 merges

PRs to revisit

Areas of active exploration

HTML Templating

dmsnell commented Apr 2, 2024 •

edited

Loading