HTML Editing APIs

Equivalent values: Either both strings are valid CSS colors and have the same red, green, blue, and alpha components, or neither string is a valid CSS color.

The hiliteColor command

IE 9 RC doesn't support this. It uses backColor instead, but Gecko and Opera treat that differently, while all non-IE browsers treat hiliteColor the same, so I'm standardizing hiliteColor as the way to highlight text.

This is slightly tricky, because background-color does different things on block and inline elements. Given the name ("hiliteColor"), we really only want to apply it to inline elements. This is how everyone but Gecko behaves, but Gecko sometimes applies it to blocks too. WebKit doesn't set it on non-inline elements, but does clear it and push it down from them.

The spec doesn't do any of these: background-color on non-inline elements is not touched by hiliteColor, neither created nor removed. If users want to remove the style, they need to use removeFormat. Adding it usually makes no sense; see the comment for backColor.

For color parsing, see the comment for foreColor.

See bug 13829.

For indeterminacy, this follows no one. Firefox 6.0a2 and Chrome 14 dev both always return false. However, the spec makes sense, since it's consistent with other commands.

Equivalent values: Either both strings are valid CSS colors and have the same red, green, blue, and alpha components, or neither string is a valid CSS color.

The italic command

The removeFormat command

See bug, and also research by Ryosuke for WebKit.

Tested in IE 9, Firefox 4.0, Chrome 12 dev, Opera 11.00.

Tags stripped by everyone: b big cite code dfn em font i ins kbd s samp small strike strong sub sup tt u var
Tags left alone by everyone: br hr img
Unrecognized elements: stripped by Firefox and Opera, left alone by IE and Chrome.
blink: stripped only by IE
abbr: stripped only by Firefox
a, wbr: stripped only by Opera
nobr: left alone only by Firefox
acronym, bdo, q: left alone only by Opera
bdi, del, mark, span, svg: treated the same as unknown elements

All elements whose default rendering is display: block are left untouched by all browsers (although IE seems to throw an exception on for some reason).
It's not clear to me why we should leave alone, but everyone but Opera does. In OpenOffice.org 3.2.1, doing "Default Formatting (Ctrl+M)" doesn't remove links. In Microsoft Word 2007, doing "Clear Formatting" also doesn't remove links. Verdict: don't remove links. Apparently they don't logically qualify as "formatting".
Conclusion: IE/WebKit is a solid majority by market share and they're closely interoperable, since WebKit copied IE here. Also, it makes more sense to assume that unrecognized elements don't represent any kind of inline formatting, i.e., have a blacklist of elements to remove instead of a whitelist to keep. Thus I remove more or less the same things as IE/WebKit.
I remove blink because IE does it and it makes sense, although Chrome doesn't; I remove abbr although only Firefox does, for consistency with acronym; and I remove bdi and mark because they're evidently left alone only because they're unrecognized. Finally, I remove span because otherwise, something like will be left intact, which isn't expected and matches no browser except IE. (Chrome doesn't remove spans in general, but it does remove spans with style attributes, or something like that.)
Browsers will split up all these inline elements if the selection is contained within them. Opera does strip unrecognized elements with display: block if they're within the selection, but doesn't split them up if they contain the selection.
Chrome 14 dev removes style attributes from every element in the range, but IE10PP2, Firefox 7.0a2, and Opera 11.50 do not, so I go with them. As noted above, this means I need to remove spans. I could conceivably change to remove only spans with style attributes, but it doesn't seem worth it: I'll just match Gecko.
TODO: This has to be kept in sync when new HTML elements are added. I need to figure out some way of coordinating this.

A removeFormat candidate is an editable HTML element with local name "abbr", "acronym", "b", "bdi", "bdo", "big", "blink", "cite", "code", "dfn", "em", "font", "i", "ins", "kbd", "mark", "nobr", "q", "s", "samp", "small", "span", "strike", "strong", "sub", "sup", "tt", "u", or "var".
Action:

Let elements to remove be a list of every removeFormat candidate effectively contained in the active range.
For each element in elements to remove:

While element has children, insert the first child of element into the parent of element immediately before element, preserving ranges.
Remove element from its parent.

The last sentence just prettifies the resulting range a bit.
If the active range's start node is an editable Text node, and its start offset is neither zero nor its start node's length, call splitText() on the active range's start node, with argument equal to the active range's start offset. Then set the active range's start node to the result, and its start offset to zero.
If the active range's end node is an editable Text node, and its end offset is neither zero nor its end node's length, call splitText() on the active range's end node, with argument equal to the active range's end offset.
Let node list consist of all editable nodes effectively contained in the active range.

TODO: Splitting the parent is really a block algorithm. It's not clear whether it's desirable to use for inline nodes. Perhaps it's okay, but it makes me a little uneasy.
For each node in node list, while node's parent is a removeFormat candidate in the same editing host as node, split the parent of the one-node list consisting of node.

This step is for cases like
foo[bar]baz
, where splitting/removing tags won't help. We don't need to run superscript, since subscript does the same thing here. We run subscript first so _{/^{won't upset
fontSize.

For each of the entries in the following list, in the given order,
set the selection's value to null, with command as
given.

subscript

bold

fontName

fontSize

foreColor

hiliteColor

italic

strikethrough

underline}}
Return true.

The strikethrough command

TODO: See underline TODO.
Action: If queryCommandState("strikethrough") returns true, set the selection's value to null. Otherwise set the selection's value to "line-through". Either way, return true.
Inline command activated values: "line-through"
The subscript command

Action:

Call queryCommandState("subscript"), and let state be the result.
Set the selection's value to null.
If state is false, set the selection's value to "subscript".
Return true.

Indeterminate: True if either among formattable nodes that are effectively contained in the active range, there is at least one with effective command value "subscript" and at least one with some other effective command value; or if there is some formattable node effectively contained in the active range with effective command value "mixed". Otherwise false.
For ^_foo, Firefox 6.0a2 and Opera 11.11 say the state is true for both superscript and subscript, and indeterminate is false; Chrome 14 dev says it's true for subscript but not superscript, and indeterminate is false. We follow neither of these behaviors: we return false for both states, and say indeterminate is true. The reason is because we want to return true for a state if we'll do nothing, false if we'll do something; and if we have nesting like this, we'll always do something, namely get rid of all those ancestors and replace them with a single tag. This matches what happens in other indeterminate situations, so it's fair to consider it indeterminate.
Inline command activated values: "subscript"
The superscript command

Action:

Call queryCommandState("superscript"), and let state be the result.
Set the selection's value to null.
If state is false, set the selection's value to "superscript".
Return true.

Indeterminate: True if either among formattable nodes that are effectively contained in the active range, there is at least one with effective command value "superscript" and at least one with some other effective command value; or if there is some formattable node effectively contained in the active range with effective command value "mixed". Otherwise false.
Inline command activated values: "superscript"
The underline command

TODO: There are a lot of problems with underline color and thickness, because text-decoration in CSS is horrible. These aren't prohibitive for normal use and existing browsers don't handle them either, so fixing these problems or working around them can be put off for now.

Pushing down underlines can change their color, since the color of an underline follows the color of the element where it's declared instead of the text it's drawn under. This could be fixed by adding a special case for this condition and inserting extra color rules, such as by setting a color on the underlining element and then having another element inside it that resets the color. Horrible, but that's text-decoration for you. Alternatively, the new text-decoration-color property in the CSS 3 Text draft could come in handy here, in which case we'd degrade pretty gracefully in legacy UAs.
Underline thickness depends on font-size in all rendering engines but WebKit, so pushing them down creates thickness problems as well as color problems. Working around this is a similar story to the previous, except we have no text-decoration-width property yet (see www-style post).
The preceding two points can't be avoided, because the only way to remove underlines in CSS is to push down styles (unlike most other things where you could override it). Recent (February 2011) CSS 3 Text drafts have added support for a "text-decoration-line: cancel-underline" property, but we can only use that if there's no other possibility, since it won't work in legacy browsers. (Although we should use it once there's no other possibility.)
More generally, from a user's perspective, color and thickness of underlines is going to be more or less random if they're applying them to text with varying size or color. If they underline a bunch of text all at once, it will all get the same color/thickness, probably. But if they underline letter-by-letter, it probably will vary. But sometimes when they underline a bunch of text at once it will also vary, if the algorithm decides to create multiple elements for whatever reason (like an intervening unwrappable node). This is unlikely to match user expectations. There's not much we can do about this without entirely revamping text-decoration, so we'll have to live with it.
Currently we don't treat non-underline text-decorations properly, because we have no way to set (or cancel) underlines independently of other text-decorations from within CSS. I've sent feedback to www-style.

Action: If queryCommandState("underline") returns true, set the selection's value to null. Otherwise set the selection's value to "underline". Either way, return true.
Inline command activated values: "underline"
The unlink command

IE 9 RC unlinks the whole link you're pointing at, while others only unlink the current text. The latter behavior seems less expected, as with createLink, although I can't articulate precisely why. Word 2007 and OpenOffice.org 3.2.1 (Ubuntu) seem to give an option to remove the whole link or none of it, which backs the spec's requirement. See also #whatwg logs starting at 2011-05-13 at 16:53 EDT (UTC-0400).
See comment for the createLink command about indeterm/state/value.

Action:

Let hyperlinks be a list of every a element that has an href attribute and is contained in the active range or is an ancestor of one of its boundary points.
Clear the value of each member of hyperlinks.
Return true.

Block formatting commands

Block formatting command definitions

An indentation element is either a blockquote, or a div that has a style attribute that sets "margin" or some subproperty of it.
We need to allow stuff that sets border/padding because WebKit (Chrome 12 dev) sets "border: none; padding: 0px" when indenting. We need to allow stuff that sets classes because WebKit sets class="webkit-indent-blockquote". We need to allow stuff that sets dir because IE9 does. The criteria could probably be tightened up a bit to reduce false positives, but it'll do for now.
A simple indentation element is an indentation element that has no attributes except possibly

a style attribute that sets no properties other than "margin", "border", "padding", or subproperties of those; and/or
a dir attribute.

The notions of indentation element and simple indentation element parallel those of modifiable element and simple modifiable element.
listing and xmp are included because otherwise insertParagraph inside them won't work, since paragraphs aren't an allowed child.
A non-list single-line container is an HTML element with local name "address", "div", "h1", "h2", "h3", "h4", "h5", "h6", "listing", "p", "pre", or "xmp".
A single-line container is either a non-list single-line container, or an HTML element with local name "li", "dt", or "dd".
The block node of a node node is either a block node or null, as returned by the following algorithm:

While node is an inline node, set node to its parent.
Return node.

Bug 14062. See also Mozilla bug 590640, specifically comments 48 and on.
If a command preserves overrides, then before taking its action, the user agent must record current overrides. After taking the action, if the active range is collapsed, it must restore states and values from the recorded list.
All block commands preserve overrides except the insertText command, which treats overrides specially.
Assorted block formatting command algorithms

TODO: When breaking a non-inline element out of an inline element, like p in b or whatever, it would make sense to re-wrap the contents in the inline tag.
To fix disallowed ancestors of node:
We often run this algorithm after we move a node someplace, just in case it wound up somewhere it's not supposed to be. This avoids things like unserializable DOMs, blocks nested inside inlines, etc.

If node is not editable, abort these steps.

This case is really intended to handle stuff like list items or table cells that wander outside their proper place. We generally convert them into ps.
If node is not an allowed child of any of its ancestors in the same editing host:

If node is a dd or dt, wrap the one-node list consisting of node, with sibling criteria returning true for any dl with no attributes and false otherwise, and new parent instructions returning the result of calling createElement("dl") on the context object. Then abort these steps.

There's no reason to change the node to a paragraph if that won't make it an allowed child anyway.
If "p" is not an allowed child of the editing host of node, abort these steps.
If node is not a prohibited paragraph child, abort these steps.
Set the tag name of node to the default single-line container name, and let node be the result.

Because maybe it somehow wound up as the child of a p, like via insertHTML.
Fix disallowed ancestors of node.
Let children be node's children.
For each child in children, if child is a prohibited paragraph child:

Record the values of the one-node list consisting of child, and let values be the result.
Split the parent of the one-node list consisting of child.
Restore the values from values.

Abort these steps.

Record the values of the one-node list consisting of node, and let values be the result.
While node is not an allowed child of its parent, split the parent of the one-node list consisting of node.
Restore the values from values.

This algorithm implies that we don't support a sublist in the middle of an item, only at the end. For instance,
foo
...
bar

gets transformed to
foo
...
bar

which in particular creates an extra list marker for "bar". This is okay; we don't need to expose all of HTML's markup abilities through execCommand(). Similarly, the superscript and subscript commands don't allow nesting. I didn't see any way to get a sublist in the middle of an item in Word 2007 or in OpenOffice.org 3.2.1 Ubuntu package, nor in any browser using just execCommand(), so it should be no big problem if we require that such nesting not occur. (Existing browsers behave weirdly and inconsistently when confronted with this kind of nesting.)
The reason we need this is that otherwise it gets very confusing to figure out what happens in cases like trying to outdent
[foo
bar]
baz

If we first normalize, then the natural answer is something like
[foo
bar]
baz

but if we don't, we'd have to special-case in the toggle lists and outdent algorithms. This might be worthwhile, but it's not at all clear, and what I have works okay, so I'll stick with it for now.
TODO: Investigate fixing this.

To normalize sublists in a node item:

If item is not an li or it is not editable or its parent is not editable, abort these steps.
Let new item be null.
While item has an ol or ul child:

Let child be the last child of item.
If child is an ol or ul, or new item is null and child is a Text node whose data consists of zero of more space characters:

Set new item to null.
Insert child into the parent of item immediately following item, preserving ranges.

Otherwise:

If new item is null, let new item be the result of calling createElement("li") on the ownerDocument of item, then insert new item into the parent of item immediately after item.
Insert child into new item as its first child, preserving ranges.

The selection's list state is returned by the following algorithm:

This is just a helper to tell the state and indeterminacy of the insertOrderedList command and the insertUnorderedList command:

ol indeterm ol state ul indeterm ul state
ol false true false false
ul false false false true
mixed true false true false
mixed ol true false false false
mixed ul false false true false
none false false false false

If the active range is null, return "none".
Block-extend the active range, and let new range be the result.
Let node list be a list of nodes, initially empty.
For each node contained in new range, append node to node list if the last member of node list (if any) is not an ancestor of node; node is editable; node is not an indentation element; and node is either an ol or ul, or the child of an ol or ul, or an allowed child of "li".
If node list is empty, return "none".

The child-of-child case is necessary right now because of the following:
[foo
bar]
baz

With the current (July 2011) block-extend algorithm, this will become:
{
foo
bar
}baz

because of the magical li handling in block-extend. We want this to register as ol, because after normalizing sublists it will become
{
foo
bar
}
baz

But the text node "foo" will wind up in node list, and is not the child of an ol. This is all very messy and has to do with questionable decisions about how to handle nested lists.

If every member of node list is either an ol or the child of an ol or the child of an li child of an ol, and none is a ul or an ancestor of a ul, return "ol".

This condition and the last are mutually exclusive, so the order is actually irrelevant. Clearly they could only both hold if no member of node list is an ol or ul, so if they both held, every member would have to be either the child of an ol and of a ul, or of an ol and an li, or a ul and an li, or of an li that's the child of both an ol and a ul. This is impossible unless the list is empty, in which case we already aborted.
If every member of node list is either a ul or the child of a ul or the child of an li child of a ul, and none is an ol or an ancestor of an ol, return "ul".
If some member of node list is either an ol or the child or ancestor of an ol or the child of an li child of an ol, and some member of node list is either a ul or the child or ancestor of a ul or the child of an li child of a ul, return "mixed".
If some member of node list is either an ol or the child or ancestor of an ol or the child of an li child of an ol, return "mixed ol".
If some member of node list is either a ul or the child or ancestor of a ul or the child of an li child of a ul, return "mixed ul".
Return "none".

When querying the value of justify*, IE9 seems to return boolean false across the board when it doesn't throw exceptions, which it usually does in my tests. Chrome 14 dev returns the string "true" or "false" depending on state, as in other cases, which is useless. Opera 11.11 returns "" across the board. Firefox 6.0a2 behaves like with other command values: it returns "center"/"justify"/"left"/"right" depending on the active range's start node. Since this is the only behavior that's possibly useful, it's what I specced. Firefox ties the value closely to the state, returning true for the state if and only if the value matches the desired value, but this seems less useful than what I've specced for the state.
This API is based on the four-state text-align of CSS 2.1. We do some crude mapping to make it not break too badly with CSS3 values, but it's not going to work well given the design of the API.

The alignment value of a node node is returned by the following algorithm:
This is basically like the resolved value of text-align, but with two key differences. First, it only ever evaluates to center/justify/left/right, since that's the model that the justify commands work with. Second, it ignores inline elements, because text-align has no effect on them and their alignment is actually governed by their nearest block ancestor (if any).

While node is neither null nor an Element, or it is an Element but its "display" property has resolved value "inline" or "none", set node to its parent.

This means there's no applicable style rule, so probably it will wind up left-aligned. Of course this ignores the fact that the alignment will really be "start", so this is wrong for RTL, but it's a pretty marginal corner case anyway. (It will only happen if, e.g., everything up to and including the html and body elements have display: inline or none.)
If node is not an Element, return "left".
If node's "text-align" property has resolved value "start", return "left" if the directionality of node is "ltr", "right" if it is "rtl".
If node's "text-align" property has resolved value "end", return "right" if the directionality of node is "ltr", "left" if it is "rtl".
If node's "text-align" property has resolved value "center", "justify", "left", or "right", return that value.
Return "left".

Sometimes one location corresponds to multiple distinct boundary points. For instance, in the DOM
Hello
, a boundary point might lie at the beginning of the text node or the beginning of the element node, but these don't logically differ much and will appear the same to the user, so we often want to treat them the same. The algorithms here allow navigating through such equivalent boundary points, for when we want to make the selection as inclusive or exclusive as possible. For deletion, we want to delete as few nodes as possible, so we move the start node forward and the end node backward. In other cases we might do the reverse, expanding the selection. In still other cases we might want to move forward or backward to try getting to a text node.
Given a boundary point (node, offset), the next equivalent point is either a boundary point or null, as returned by the following algorithm:

If node's length is zero, return null.
We don't want to move into or out of zero-length nodes, because that would move us straight through them. For instance, if {} were equivalent to {}, it would also be equivalent to {}. This produces very unexpected results for nodes like .
If offset is node's length, and node's parent is not null, and node is an inline node, return (node's parent, 1 + node's index).
For instance, foo[] is equivalent to foo{}, which is equivalent to foo{}. However, foo{} is not equivalent to foo {} – the cursor might look like it's in a visibly different position.
If node has a child with index offset, and that child's length is not zero, and that child is an inline node, return (that child, 0).
For instance, {}foo is equivalent to {}foo, which is equivalent to []foo. As noted before, though, we don't descend into empty nodes. And again, {}foo is different from {}foo.
Return null.

Given a boundary point (node, offset), the previous equivalent point is either a boundary point or null, as returned by the following algorithm:

If node's length is zero, return null.
If offset is 0, and node's parent is not null, and node is an inline node, return (node's parent, node's index).
If node has a child with index offset − 1, and that child's length is not zero, and that child is an inline node, return (that child, that child's length).
Return null.

The first equivalent point of a boundary point (node, offset) is returned by the following algorithm:

While (node, offset)'s previous equivalent point is not null, set (node, offset) to its previous equivalent point.
Return (node, offset).

The last equivalent point of a boundary point (node, offset) is returned by the following algorithm:

While (node, offset)'s next equivalent point is not null, set (node, offset) to its next equivalent point.
Return (node, offset).

Block-extending a range

A boundary point (node, offset) is a block start point if either node's parent is null and offset is zero; or node has a child with index offset − 1, and that child is either a visible block node or a visible br.
A boundary point (node, offset) is a block end point if either node's parent is null and offset is node's length; or node has a child with index offset, and that child is a visible block node.
A boundary point is a block boundary point if it is either a block start point or a block end point.
When a user agent is to block-extend a range range, it must run the following steps:

Generally, block commands work on any block that contains part of the selection, even if the selection doesn't include the whole block. This algorithm takes an input range, copies it, stretches out the copy to contain entire blocks, and returns the result. Then the caller will normally use it instead of the range it started with. For instance, if the cursor is collapsed in a text node inside a paragraph, this will generally return a range that includes the whole paragraph.
Two bits of magic worth noting. First, counts as a block delimiter here, since it looks the same as a block boundary (assuming no margin etc.) and this is a visual API. We include the as part of the line that precedes it. Second, if the selection is inside an
, this will extend it to include the whole
. This latter point is weird, and I should re-examine it sometime, but it seems to work.

Let start node, start offset, end node, and end offset be the start and end nodes and offsets of range.
If some inclusive ancestor of start node is an li, set start offset to the index of the last such li in tree order, and set start node to that li's parent.
If (start node, start offset) is not a block start point, repeat the following steps:

If start offset is zero, set it to start node's index, then set start node to its parent.
Otherwise, subtract one from start offset.
If (start node, start offset) is a block boundary point, break from this loop.

This just changes something like
{foo]
to {foo].
While start offset is zero and start node's parent is not null, set start offset to start node's index, then set start node to its parent.
If some inclusive ancestor of end node is an li, set end offset to one plus the index of the last such li in tree order, and set end node to that li's parent.
If (end node, end offset) is not a block end point, repeat the following steps:

If end offset is end node's length, set it to one plus end node's index, then set end node to its parent.
Otherwise, add one to end offset.
If (end node, end offset) is a block boundary point, break from this loop.

While end offset is end node's length and end node's parent is not null, set end offset to one plus end node's index, then set end node to its parent.
Let new range be a new range whose start and end nodes and offsets are start node, start offset, end node, and end offset.
Return new range.

A node node follows a line break if the following algorithm returns true:

Let offset be zero.
While (node, offset) is not a block boundary point:

If node has a visible child with index offset minus one, return false.
If offset is zero or node has no children, set offset to node's index, then set node to its parent.
Otherwise, set node to its child with index offset minus one, then set offset to node's length.

Return true.

A node node precedes a line break if the following algorithm returns true:

Let offset be node's length.
While (node, offset) is not a block boundary point:

If node has a visible child with index offset, return false.
If offset is node's length or node has no children, set offset to one plus node's index, then set node to its parent.
Otherwise, set node to its child with index offset and set offset to zero.

Return true.

Recording and restoring overrides

To record current overrides:

Let overrides be a list of (string, string or boolean) ordered pairs, initially empty.

When restoring, some commands can interfere with others. Specifically, we want to restore createLink before foreColor and underline, and subscript and superscript before fontSize. TODO: This approach only works for default styles (although I'm not sure offhand how we could handle non-default styles in principle).
Firefox 7.0a2 and Opera 11.50 don't honor createLink with collapsed selections. If you insert text, it's not linked. The spec follows Chrome 14 dev. IE9 also ignores createLink with collapsed selections, but its behavior in other cases for collapsed selections is totally different from all other browsers, so it's not a fair comparison.

If there is a value override for "createLink", add ("createLink", value override for "createLink") to overrides.

Firefox 7.0a2 and Opera 11.50 will honor repeated subscript/superscript commands on a collapsed selection, allowing you to nest them. The spec follows the general philosophy that we don't allow users to nest subscript/superscript, so the last one wins. Chrome 14 dev is similar to the spec.
For each command in the list "bold", "italic", "strikethrough", "subscript", "superscript", "underline", in order: if there is a state override for command, add (command, command's state override) to overrides.
For each command in the list "fontName", "fontSize", "foreColor", "hiliteColor", in order: if there is a value override for command, add (command, command's value override) to overrides.
Return overrides.

To record current states and values:

Let overrides be a list of (string, string or boolean) ordered pairs, initially empty.
Let node be the first formattable node effectively contained in the active range, or null if there is none.
If node is null, return overrides.
Add ("createLink", node's effective command value for "createLink") to overrides.

Thus we will set state overrides based on the first formattable node, to match values. This means that if you have
foo[barbaz]
and hit backspace and hit A, you'll get fooa[], although bold was previously indeterminate. This is needed to match the behavior of hitting A straight away, since innerText doesn't strip wrappers when it invokes "delete the contents".
For each command in the list "bold", "italic", "strikethrough", "subscript", "superscript", "underline", in order: if node's effective command value for command is one of its inline command activated values, add (command, true) to overrides, and otherwise add (command, false) to overrides.
For each command in the list "fontName", "foreColor", "hiliteColor", in order: add (command, command's value) to overrides.

Special case for fontSize, because its values are weird.
Add ("fontSize", node's effective command value for "fontSize") to overrides.
This is wrong: it will convert non-pixel sizes to pixel sizes. But I don't see any way to avoid it. Hopefully it won't come up too often. font-size is a real problem, because the mapping from specified value to computed value is lossy and not fully defined (e.g., how many px is "small"?).
Return overrides.

To restore states and values specified by a list overrides returned by the record current overrides or record current states and values algorithm:

Let node be the first formattable node effectively contained in the active range, or null if there is none.
If node is not null, then for each (command, override) pair in overrides, in order:

If override is a boolean, and queryCommandState(command) returns something different from override, take the action for command, with value equal to the empty string.
Otherwise, if override is a string, and command is neither "createLink" nor "fontSize", and queryCommandValue(command) returns something not equivalent to override, take the action for command, with value equal to override.

This special case is needed because createLink has no value.
Otherwise, if override is a string; and command is "createLink"; and either there is a value override for "createLink" that is not equal to override, or there is no value override for "createLink" and node's effective command value for "createLink" is not equal to override: take the action for "createLink", with value equal to override.

The override will be some CSS value, so we have to convert it to a legacy font size.
Otherwise, if override is a string; and command is "fontSize"; and either there is a value override for "fontSize" that is not equal to override, or there is no value override for "fontSize" and node's effective command value for "fontSize" is not loosely equivalent to override:

Convert override to an integer number of pixels, and set override to the legacy font size for the result.
Take the action for "fontSize", with value equal to override.

Otherwise, continue this loop from the beginning.

If we took the action for a command, we need to reset node, because it might have changed. For instance, if the selection was foo[bar]baz, the text node could have been split so that the first part is now outside the active range.
Set node to the first formattable node effectively contained in the active range, if there is one.

Otherwise, for each (command, override) pair in overrides, in order:

If override is a boolean, set the state override for command to override.
If override is a string, set the value override for command to override.

Deleting the selection

TODO: Consider what should happen for block merging in corner cases like display: inline-table.
To delete the selection, given a block merging flag that defaults to true, a strip wrappers flag that defaults to true, and a string direction that defaults to "forward":

The idea behind this algorithm is self-explanatory, but the details wind up being remarkably complicated.
First, any editable nodes inside the selection will be deleted, and the selection will be collapsed. By way of contrast, effectively contained tries to expand the range to include as much as possible, so
[foo]
contains the
. What we do here is contract the range to include as little as possible, so {
foo
} contains only foo and doesn't delete the paragraph.
After that, if the selection originally started and ended in different blocks, and the block merging flag is true, the end block will get merged into the start block. This is needed so if the user selects text on several lines and deletes it, the text immediately that was before the selection winds up on the same line as the text immediately after it. For example,
fo[o
b]ar
becomes fo[]ar. This procedure winds up being tricky, and takes up a large chunk of the logic.
Tables are a notable special case. If an entire table is contained in the range, it will be deleted. If it's anything less, only the contents of the cells will be deleted and the table structure will be left intact.
The strip wrappers flag controls what happens if the deletion removes all the contents of an inline element. If wrappers are being stripped, the empty inline element will be removed: this is usually what you want, because the user can't position the selection inside it. But callers like the insertText command that intend to immediately insert new contents want to leave the wrappers, so the new contents are wrapped by the same thing as the old.
Even if strip wrappers is true, the algorithm will set a state override and value override for any styles it winds up removing. This way, if the user deletes a wrapper that adds a style (or link for that matter), then types something, the new text will get the style from the old text.

If the active range is null, abort these steps and do nothing.
Canonicalize whitespace at the active range's start.
Canonicalize whitespace at the active range's end.
Let (start node, start offset) be the last equivalent point for the active range's start.
Let (end node, end offset) be the first equivalent point for the active range's end.
If (end node, end offset) is not after (start node, start offset):
This is a selection like foo[]bar, where the boundary points are equivalent but not identical. We just collapse it and abort, since there's nothing to delete.

If direction is "forward", call collapseToStart() on the context object's selection.
Otherwise, call collapseToEnd() on the context object's selection.
Abort these steps.

If start node is a Text node and start offset is 0, set start offset to the index of start node, then set start node to its parent.
If end node is a Text node and end offset is its length, set end offset to one plus the index of end node, then set end node to its parent.
The previous two steps are so that we won't leave empty text nodes anywhere.
Call collapse(start node, start offset) on the context object's selection.
Call extend(end node, end offset) on the context object's selection.

When we delete a selection that spans multiple blocks, we merge the end block's contents into the start block, like
fo[o
b]ar
->
fo[]ar
.

We figure out what the start and end blocks are before we start deleting anything.
Let start block be the active range's start node.
While start block's parent is in the same editing host and start block is an inline node, set start block to its parent.

We only merge to or from block nodes or editing hosts. (This is just in case someone makes a span into an editing host and sticks paragraphs inside it or something . . . we could probably drop that proviso.) Firefox 7.0a2 ignores the display property when merging, so it doesn't merge but does merge
. This is undesirable, because it's visually wrong. IE10PP2 and Chrome 14 dev behave more like the spec, and Opera 11.50 seems to be unable to make up its mind.
If span isn't an allowed child, it's probably something unpleasant like a table row or a list or such. We don't want to merge to or from something like that, because we'd most likely wind up with the wrong type of child somewhere. It should be pretty hard for this to happen given the normalization we do on the selection; I'm not actually sure how it could happen at all, actually, unless you start out with a DOM that has non-allowed children someplace. So it's basically a sanity check.
We don't let either start block or end block be a td or th. This means we'll never merge to or from a td or th. This matches Firefox 5.0a2, and reportedly Word as well. Chrome 13 dev and Opera 11.11 allow merging from a non-table cell end block to a table cell start block, but not vice versa. In IE9 the delete key just does nothing.

If start block is neither a block node nor an editing host, or "span" is not an allowed child of start block, or start block is a td or th, set start block to null.
Let end block be the active range's end node.
While end block's parent is in the same editing host and end block is an inline node, set end block to its parent.
If end block is neither a block node nor an editing host, or "span" is not an allowed child of end block, or end block is a td or th, set end block to null.

Later on we'll restore overrides. This ensures that if we delete inline formatting elements and the user then types something, the typed text will have the same style as before.

As far as I can tell, IE9 and Opera 11.50 don't do this at all. If you delete a selection and then start typing, the new text doesn't take on the styles of the old text.
Firefox 7.0a2 seems to do it for some styles but not others. Strikethrough, superscript, subscript, and links seem to be lost, at a minimum.
The spec goes with something like Chrome 14 dev, which tries to preserve lots of stuff.

Record current states and values, and let overrides be the result.

Now we actually begin deleting things.

This whole piece of the algorithm is based on deleteContents() in DOM Range, copy-pasted and then adjusted to fit.
If start node and end node are the same, and start node is an editable Text node:

Call deleteData(start offset, end offset − start offset) on start node.
Canonicalize whitespace at (start node, start offset), with fix collapsed space false.
If direction is "forward", call collapseToStart() on the context object's selection.
Otherwise, call collapseToEnd() on the context object's selection.

This is needed to restore any overrides that would otherwise be lost. TODO: In this and similar cases, we could optimize by saving only overrides, not the full state/value.
Restore states and values from overrides.
Abort these steps.

If start node is an editable Text node, call deleteData() on it, with start offset as the first argument and (length of start node − start offset) as the second argument.
Let node list be a list of nodes, initially empty.

IE9 doesn't seem to let you do any intercell deletions: the delete key does nothing if you select across multiple cells. Firefox 5.0a2 and Opera 11.11 behave as the spec says, not removing any table things. Chrome 13 dev will remove entire rows if selected. Note that IE, Firefox, Word 2007, and OpenOffice.org 3.2.1 Ubuntu all switch to a magic cell-selection mode when you try to select between cells, at least in some cases, instead of selecting letter-by-letter.
For each node contained in the active range, append node to node list if the last member of node list (if any) is not an ancestor of node; node is editable; and node is not a thead, tbody, tfoot, tr, th, or td.
For each node in node list:

Let parent be the parent of node.
Remove node from parent.

Do this before stripping wrappers: see bug 13831.
If the block node of parent has no visible children, and parent is editable or an editing host, call createElement("br") on the context object and append the result as the last child of parent.

Taking insertText to test the case where strip wrappers is false, with value a:
[foobar]baz becomes
a[]baz per spec, in IE9, and in Chrome 14 dev. Firefox 7.0a2 and Opera 11.50 make it
a[]baz, with a useless wrapper.
foo[barbaz] becomes
fooa[] per spec and in IE9 and Firefox 7.0a2 and Opera 11.50; in Chrome 14 dev apparently it initially becomes
fooa[], but then the style is recreated. This is detectable if you do something weird like instead of : it comes or such. I follow IE9 in all cases, because it makes the most sense.
If strip wrappers is true or parent is not an inclusive ancestor of start node, while parent is an editable inline node with length 0, let grandparent be the parent of parent, then remove parent from grandparent, then set parent to grandparent.
Even if strip wrappers is false, we still want to strip wrappers that aren't inclusive ancestors of start node. The idea of not stripping wrappers is that we're going to insert new content right afterward, like text or an image, but that new content will be inserted at the start node. Wrappers in other places still need to be removed, because they would otherwise remain empty.

If end node is an editable Text node, call deleteData(0, end offset) on it.
Canonicalize whitespace at the active range's start, with fix collapsed space false.
Canonicalize whitespace at the active range's end, with fix collapsed space false.

Now we need to merge blocks. The simplest case is something like
fo[o
bar
b]az
->
fo
{}
az
->
fo{}az

where neither block descends from the other. More complicated is something like
foo[
]bar
-> foo[]bar

or
foo[
]bar ->
foo[]bar

where one descends from the other.
If block merging is false, or start block or end block is null, or start block is not in the same editing host as end block, or start block and end block are the same:

If direction is "forward", call collapseToStart() on the context object's selection.
Otherwise, call collapseToEnd() on the context object's selection.
Restore states and values from overrides.
Abort these steps.

We might have added a br to the start/end block in an earlier step. Now we're about to merge the blocks, and we don't want the br's to get in the way. The end block is being destroyed no matter what. If the start block winds up empty after merging, we'll add a new br child at the end so it doesn't collapse.
If start block has one child, which is a collapsed block prop, remove its child from it.

Just repeatedly blow up the end block in this case.
If start block is an ancestor of end block:

Let reference node be end block.
While reference node is not a child of start block, set reference node to its parent.
Call collapse() on the context object's selection, with first argument start block and second argument the index of reference node.
If end block has no children:

While end block is editable and is the only child of its parent and is not a child of start block, let parent equal end block, then remove end block from parent, then set end block to parent.
If end block is editable and is not an inline node, and its previousSibling and nextSibling are both inline nodes, call createElement("br") on the context object and insert it into end block's parent immediately after end block.
If end block is editable, remove it from its parent.
Restore states and values from overrides.
Abort these steps.

If end block's firstChild is not an inline node, restore states and values from record, then abort these steps.
Let children be a list of nodes, initially empty.
Append the first child of end block to children.
While children's last member is not a br, and children's last member's nextSibling is an inline node, append children's last member's nextSibling to children.
Record the values of children, and let values be the result.
While children's first member's parent is not start block, split the parent of children.
If children's first member's previousSibling is an editable br, remove that br from its parent.

In this case, pull in everything that comes after start block, until we hit a br or block node.
Otherwise, if start block is a descendant of end block:

Call collapse() on the context object's selection, with first argument start block and second argument start block's length.
Let reference node be start block.
While reference node is not a child of end block, set reference node to its parent.
If reference node's nextSibling is an inline node and start block's lastChild is a br, remove start block's lastChild from it.
Let nodes to move be a list of nodes, initially empty.
If reference node's nextSibling is neither null nor a block node, append it to nodes to move.
While nodes to move is nonempty and its last member isn't a br and its last member's nextSibling is neither null nor a block node, append its last member's nextSibling to nodes to move.
Record the values of nodes to move, and let values be the result.
For each node in nodes to move, append node as the last child of start block, preserving ranges.

In the last case, just move all the children of the end block to the start block, and then get rid of any elements we emptied that way.
Otherwise:

Call collapse() on the context object's selection, with first argument start block and second argument start block's length.
If end block's firstChild is an inline node and start block's lastChild is a br, remove start block's lastChild from it.
Record the values of end block's children, and let values be the result.
While end block has children, append the first child of end block to start block, preserving ranges.
While end block has no children, let parent be the parent of end block, then remove end block from parent, then set end block to parent.

We might have deleted the contents between two lists, in which case we should merge them. See bug 13976.
Let ancestor be start block.
While ancestor has an inclusive ancestor ol in the same editing host whose nextSibling is also an ol in the same editing host, or an inclusive ancestor ul in the same editing host whose nextSibling is also a ul in the same editing host:

While ancestor and its nextSibling are not both ols in the same editing host, and are also not both uls in the same editing host, set ancestor to its parent.
While ancestor's nextSibling has children, append ancestor's nextSibling's firstChild as the last child of ancestor, preserving ranges.
Remove ancestor's nextSibling from its parent.

Restore the values from values.
If start block has no children, call createElement("br") on the context object and append the result as the last child of start block.
Remove extraneous line breaks at the end of start block.
Restore states and values from overrides.

Splitting a node list's parent

To split the parent of a list node list of consecutive sibling nodes:

This algorithm breaks up the parent of node list. If they're the only children of their parent, the parent is removed entirely. If there are preceding or following siblings, the original parent is left intact as the parent of those siblings. If there are both preceding and following siblings, the original parent is left as the parent of the following siblings and a clone is used for the parent of the preceding siblings.
We make sure not to disrupt the appearance any more than necessary. Obviously margins or such on the parent will be lost, but the children will not wind up on the same line as anything they weren't already on the same line as. E.g., if we split the parent of "bar" in foo
bar
, we get foo bar, not foobar. (This is amazingly complicated and error-prone.) We don't preserve inline styles: callers that want to do that should call record the values and restore the values themselves.
All this is useful in a lot of situations, like for outdenting. For inline formatting commands, we almost always rely on pushing down values instead, since that often leads to tidier markup.

Let original parent be the parent of the first member of node list.
If original parent is not editable or its parent is null, do nothing and abort these steps.
If the first child of original parent is in node list, remove extraneous line breaks before original parent.
If the first child of original parent is in node list, and original parent follows a line break, set follows line break to true. Otherwise, set follows line break to false.
If the last child of original parent is in node list, and original parent precedes a line break, set precedes line break to true. Otherwise, set precedes line break to false.

TODO: We insert things after the parent. This is bad, because it will cause them to become part of any ranges that immediately follow. For instance, if we're hitting "bar" in
foo
bar
{
baz}

it becomes
foo
{
bar
baz}

instead of
foo
bar{
baz}

because of how range mutation rules work. This doesn't happen if we insert before. This may or may not be important enough to bother working around.

If the first child of original parent is not in node list, but its last child is:

For each node in node list, in reverse order, insert node into the parent of original parent immediately after original parent, preserving ranges.
If precedes line break is true, and the last member of node list does not precede a line break, call createElement("br") on the context object and insert the result immediately after the last member of node list.
Remove extraneous line breaks at the end of original parent.
Abort these steps.

If the first child of original parent is not in node list:

Let cloned parent be the result of calling cloneNode(false) on original parent.
If original parent has an id attribute, unset it.
Insert cloned parent into the parent of original parent immediately before original parent.
While the previousSibling of the first member of node list is not null, append the first child of original parent as the last child of cloned parent, preserving ranges.

Notice that a boundary point that was immediately before the element will now be immediately before its children, just because of the regular range mutation rules, without needing to worry about preserving ranges. Likewise for boundary points immediately after the element, if we wind up removing the element in the final step. Preserving ranges is only necessary for the sake of boundary points in the element or its descendants.
For each node in node list, insert node into the parent of original parent immediately before original parent, preserving ranges.
If follows line break is true, and the first member of node list does not follow a line break, call createElement("br") on the context object and insert the result immediately before the first member of node list.
If the last member of node list is an inline node other than a br, and the first child of original parent is a br, and original parent is not an inline node, remove the first child of original parent from original parent.
If original parent has no children:

Remove original parent from its parent.
If precedes line break is true, and the last member of node list does not precede a line break, call createElement("br") on the context object and insert the result immediately after the last member of node list.

Otherwise, remove extraneous line breaks before original parent.

The parent might be null if it's a br that we removed in the last step, in which case this step isn't necessary.
If node list's last member's nextSibling is null, but its parent is not null, remove extraneous line breaks at the end of node list's last member's parent.

To remove a node node while preserving its descendants, split the parent of node's children if it has any. If it has no children, instead remove it from its parent.
Canonical space sequences

Whitespace in HTML normally collapses. However, if the user hits the space bar twice in an HTML editor, they expect to see two spaces, not one. Even if they hit the space bar once at the beginning or end of a line, it would collapse without special handling. The only good solution here is for the author to set white-space: pre-wrap on the editable area, and on everywhere the content is reproduced. But if they don't, we have to painfully hack around the problem.
This is a basically intractable problem because of the unfortunate confluence of three factors. One, our characters are Unicode, and Unicode doesn't know about whitespace collapsing, so it provides no special characters to control it. Two, HTML itself provides no features that control whitespace collapsing without undesired side effects (like inhibiting line breaks or not being allowed inside
). Three, we need to support user agents that don't reliably support CSS, since that includes many popular mail clients.
The upshot is we have no good way to control whitespace collapse, so we rely on the least bad way available: . This doesn't collapse with adjacent whitespace in browsers, which is good. But it also doesn't allow a line break opportunity, which is bad. In any run of whitespace that we don't want to collapse, any two regular spaces must be separated by an so they don't collapse together, but we need to carefully limit runs of consecutive s to minimize the damage to line-breaking behavior.
The result is an elaborate and meticulously-crafted hodgepodge of bad compromises that frankly isn't worth the effort to explain here. The saving grace is that it all gets disabled if white-space is set to pre-wrap as it should be, so authors can opt out of the insanity. Interested readers will find detailed rationale for the exact sequences required in the comments.

See long comment before insertText.
The canonical space sequence of length n, with boolean flags non-breaking start and non-breaking end, is returned by the following algorithm:

If n is zero, return the empty string.
If n is one and both non-breaking start and non-breaking end are false, return a single space (U+0020).
If n is one, return a single non-breaking space (U+00A0).
Let buffer be the empty string.
If non-breaking start is true, let repeated pair be U+00A0 U+0020. Otherwise, let it be U+0020 U+00A0.
While n is greater than three, append repeated pair to buffer and subtract two from n.
If n is three, append a three-code unit string to buffer depending on non-breaking start and non-breaking end:

non-breaking start and non-breaking end false
U+0020 U+00A0 U+0020
non-breaking start true, non-breaking end false
U+00A0 U+00A0 U+0020
non-breaking start false, non-breaking end true
U+0020 U+00A0 U+00A0
non-breaking start and non-breaking end both true
U+00A0 U+0020 U+00A0

Otherwise, append a two-code unit string to buffer depending on non-breaking start and non-breaking end:

non-breaking start and non-breaking end false
non-breaking start true, non-breaking end false
U+00A0 U+0020
non-breaking start false, non-breaking end true
U+0020 U+00A0
non-breaking start and non-breaking end both true
U+00A0 U+00A0

Return buffer.

To canonicalize whitespace at (node, offset), given an optional boolean argument fix collapsed space that defaults to true:

If node is neither editable nor an editing host, abort these steps.
Let start node equal node and let start offset equal offset.

First we go to the beginning of the current whitespace run.
Repeat the following steps:

If start node has a child in the same editing host with index start offset minus one, set start node to that child, then set start offset to start node's length.

TODO: Following a line break is unlikely to be the right criterion.
Otherwise, if start offset is zero and start node does not follow a line break and start node's parent is in the same editing host, set start offset to start node's index, then set start node to its parent.
Otherwise, if start node is a Text node and its parent's resolved value for "white-space" is neither "pre" nor "pre-wrap" and start offset is not zero and the (start offset − 1)st code unit of start node's data is a space (0x0020) or non-breaking space (0x00A0), subtract one from start offset.
Otherwise, break from this loop.

Now we collapse any consecutive spaces, if fix collapsed space is true.
Let end node equal start node and end offset equal start offset.
Let length equal zero.

This tries to delete spaces at the beginning of a line (bug 14119).
Let collapse spaces be true if start offset is zero and start node follows a line break, otherwise false.
Repeat the following steps:

If end node has a child in the same editing host with index end offset, set end node to that child, then set end offset to zero.

TODO: Preceding a line break is unlikely to be the right criterion.
Otherwise, if end offset is end node's length and end node does not precede a line break and end node's parent is in the same editing host, set end offset to one plus end node's index, then set end node to its parent.
Otherwise, if end node is a Text node and its parent's resolved value for "white-space" is neither "pre" nor "pre-wrap" and end offset is not end node's length and the end offsetth code unit of end node's data is a space (0x0020) or non-breaking space (0x00A0):

If fix collapsed space is true, and collapse spaces is true, and the end offsetth code unit of end node's data is a space (0x0020): call deleteData(end offset, 1) on end node, then continue this loop from the beginning.
Set collapse spaces to true if the end offsetth code unit of end node's data is a space (0x0020), false otherwise.
Add one to end offset.
Add one to length.

Otherwise, break from this loop.

We've already stripped leading whitespace, and collapsed consecutive spaces. Now we try to strip any collapsed trailing whitespace (bug 14119 again).
If fix collapsed space is true, then while (start node, start offset) is before (end node, end offset):

If end node has a child in the same editing host with index end offset − 1, set end node to that child, then set end offset to end node's length.
Otherwise, if end offset is zero and end node's parent is in the same editing host, set end offset to end node's index, then set end node to its parent.
Otherwise, if end node is a Text node and its parent's resolved value for "white-space" is neither "pre" nor "pre-wrap" and end offset is end node's length and the last code unit of end node's data is a space (0x0020) and end node precedes a line break:

Subtract one from end offset.
Subtract one from length.
Call deleteData(end offset, 1) on end node.

Otherwise, break from this loop.

Finally we replace with the canonical sequence.
Let replacement whitespace be the canonical space sequence of length length. non-breaking start is true if start offset is zero and start node follows a line break, and false otherwise. non-breaking end is true if end offset is end node's length and end node precedes a line break, and false otherwise.
While (start node, start offset) is before (end node, end offset):

If start node has a child with index start offset, set start node to that child, then set start offset to zero.
Otherwise, if start node is not a Text node or if start offset is start node's length, set start offset to one plus start node's index, then set start node to its parent.
Otherwise:

Remove the first code unit from replacement whitespace, and let element be that code unit.
If element is not the same as the start offsetth code unit of start node's data:

We need to insert then delete, so that we don't change range boundary points. TODO: switch to using "replace data" now that DOM Core has defined that.
Call insertData(start offset, element) on start node.
Call deleteData(start offset + 1, 1) on start node.

Add one to start offset.

Indenting and outdenting

There are two basically different types of indent/outdent: lists, and everything else. For lists we'll wrap the item in a nested list to indent, and split its parent to outdent. For everything else we'll wrap in a
to indent, and try breaking it out of an ancestor indentation element to outdent.
Indenting winds up being pretty simple: just add an appropriate wrapper. There's not really anything to think about here except which wrapper we want (
or
or
), and establishing that is not rocket science.
Outdenting is considerably more complicated. The basic idea we follow is to first find the nearest editable ancestor that's a list or indentation element. If we succeed, and the node we're trying to outdent is the only descendant of the ancestor, of course we can just remove the ancestor and that's that. Otherwise, what we do is remove the ancestor and then indent all its other descendants, much like pushing down values.
But of course, there are complications. We don't always actually want to remove the closest ancestor that's causing indentation. For one thing, we prefer ancestors that we can remove completely, i.e., simple indentation elements. When outdenting
foo
, removing the inner tag would result in foo, since we don't want to lose the id. Thus we prefer to remove the outer tag and wind up with foo.
Also, if the node we're outdenting is itself a list, we prefer to remove an ancestor indentation element rather than the list. Otherwise, if the user selected some text, indented it, then added a list, there would be no way to remove the indentation without removing the list first. This way, the user could remove the list with the appropriate list-toggling command or remove the indentation with the outdent command.

We have to handle entire lists of siblings at once, or else we'd wind up doing something like
{
foo

bar
}
->

foo

bar

->

foo

bar

since by the time we got to doing the
that originally contained "bar", we won't remember that we aren't supposed to indent "foo" a second time.

To indent a list node list of consecutive sibling nodes:

If node list is empty, do nothing and abort these steps.
Let first node be the first member of node list.
If first node's parent is an ol or ul:

Let tag be the local name of the parent of first node.

This matches IE9, Firefox 4.0, and Chrome 12 dev. If there's a preceding
, Opera 11.10 instead adds the new parent to the end of that
, so it's not the child of another list, which is invalid. But the other browsers' way of doing things makes things simpler. E.g., if we want to indent an
and it has
/
children, we have to distinguish between the case where we want to indent the whole
or only the first part. It also allows things like
foo
bar
baz

in which case it's unclear what we should do if the user selects "foo" and indents. I've filed a bug on HTML5.

Wrap node list, with sibling criteria returning true for an HTML element with local name tag and false otherwise, and new parent instructions returning the result of calling createElement(tag) on the ownerDocument of first node.
Abort these steps.

Firefox 4.0 respects the CSS styling flag for indent, but Chrome 12 dev does not. I always produce blockquotes, even if CSS styling is on, for two reasons. One, IE9 handles inline margin attributes badly: when outdenting, it propagates the margin to the parent, which doesn't actually remove it. Two, in CSS mode I'd want to use
to match non-CSS mode, but authors are very likely to want to remove the top/bottom margin, which they can't do if it's not a special tag. Authors who really want divs for indentation could always convert the blockquotes to divs themselves. But if people really want it, I could respect CSS styling mode here too.
The top/bottom margins might be undesirable here, but no more so than for
/
/
/etc. Here as there, authors can remove them with CSS if they want.
blockquote indents on both sides, so we don't have to worry about directionality. In theory it would be better if we indented only on the start side, but that requires care to get right in mixed-direction cases. Even once browsers start to support margin-start and so on, we can't use them because a) we have to work okay in legacy browsers and b) it doesn't help if a descendant block has different direction (so should be indented the other way). So let's not worry about it: most browsers don't, and the ones that do get it wrong. Just indent on both sides.

Wrap node list, with sibling criteria returning true for a simple indentation element and false otherwise, and new parent instructions returning the result of calling createElement("blockquote") on the ownerDocument of first node. Let new parent be the result.
Fix disallowed ancestors of new parent.

Things that are produced for indentation that we need to consider removing:

Plain
(produced by spec, Firefox 4.0 non-CSS, Opera 11.00)
and
(IE9)
(Chrome 12 dev)
and
(Firefox 4.0 CSS if no other element available)
Other random things with display: block whose left or right margin was increased by 40px (Firefox 4.0 CSS)

For discussion on the list-related stuff, see the comment for insertOrderedList.
Gecko in CSS mode just adds margin properties to random elements that are lying around. We don't attempt to remove those, because 1) the amount and position of the margin can vary (it increases the margin if there's a preexisting one), so it's potentially complicated, and 2) no browser removes such margins on outdent, including Gecko, except for Gecko in CSS mode. TODO: Consider removing it anyway.

To outdent a node node:

If node is not editable, abort these steps.

The easy case is when the whole element is indented. In this case we remove the whole thing indiscriminately. In the case of blockquotes created by IE, this might change the direction of some children, but then their direction was probably changed incorrectly in the first place, so no harm.
If node is a simple indentation element, remove node, preserving its descendants. Then abort these steps.

This might be a simple indentation element that had style added to it by Firefox in CSS mode, for instance (color, font-family, etc.).
If node is an indentation element:

Unset the dir attribute of node, if any.
Unset the margin, padding, and border CSS properties of node.
Set the tag name of node to "div".
Abort these steps.

Approximate algorithms when an ancestor is causing the indentation appear to be:

IE9
Go to the innermost element causing indentation. If the stuff to be outdented includes all the contents of that element, get rid of it, but if it has any attributes, change it to a
with those same attributes. This is an excellent idea in general, but unfortunately it preserves explicitly-specified margins in style attributes, which isn't great. In other cases, it moves the stuff to be outdented outside. Not clear on all the details, seems to be pretty confusing. Also does a bunch of seemingly arbitrary normalization like removing divs and some attributes from some things . . .
Firefox 4.0
Go to the innermost element causing indentation. If the stuff to be outdented includes all the contents of that element, get rid of it, even if it has arbitrary attributes. Otherwise, move the stuff to be outdented outside the indenting element. If there are any intervening elements that include stuff not to be outdented, wrap the outdented stuff in copies (which can duplicate id's, etc.).
Chrome 12 dev
Go to the outermost element causing indentation (even if the current element is itself causing indentation). Move the text to be outdented outside that outermost element, without regard to any intervening elements. Then recreate the original styles on the moved text, in some fashion. Something like that; it confuses me and doesn't seem to be reasonable.
Opera 11.00
Like Firefox, except it goes to the outermost element, not the innermost. Also seems to special-case to avoid duplicate id's, and has a few other quirks.

Overall, all flawed, so I'll make up my own, patterned after pushing down styles. First we search ancestors for a simple indentation element, which we stand a chance of completely removing. Failing that, we look for an indentation element that's not simple, so we can't completely remove it.

Let current ancestor be node's parent.
Let ancestor list be a list of nodes, initially empty.
While current ancestor is an editable Element that is neither a simple indentation element nor an ol nor a ul, append current ancestor to ancestor list and then set current ancestor to its parent.
If current ancestor is not an editable simple indentation element:

Let current ancestor be node's parent.
Let ancestor list be the empty list.
While current ancestor is an editable Element that is neither an indentation element nor an ol nor a ul, append current ancestor to ancestor list and then set current ancestor to its parent.

When asked to outdent a list wrapped in a simple indentation element, Chrome 12 dev removes the list instead of the simple indentation element. Opera 11.10 seems to remove both. IE9 and Firefox 4.0 remove the simple indentation element, as does the spec.
If node is an ol or ul and current ancestor is not an editable indentation element:

Unset the reversed, start, and type attributes of node, if any are set.
Let children be the children of node.

We can't turn it into a div if it's the child of an ol or ul, because that's not allowed: there's no way to group li's (see HTML bug 13128).
If node has attributes, and its parent is not an ol or ul, set the tag name of node to "div".
Otherwise:

Record the values of node's children, and let values be the result.
Remove node, preserving its descendants.
Restore the values from values.

Fix disallowed ancestors of each member of children.
Abort these steps.

If current ancestor is not an editable indentation element, abort these steps.

If we get to this point, we have an ancestor to split up.
Append current ancestor to ancestor list.

We can't outdent it yet, because we need its children to remain intact for the loop.
Let original ancestor be current ancestor.
While ancestor list is not empty:

Let current ancestor be the last member of ancestor list.
Remove the last member from ancestor list.
Let target be the child of current ancestor that is equal to either node or the last member of ancestor list.
If target is an inline node that is not a br, and its nextSibling is a br, remove target's nextSibling from its parent.
Let preceding siblings be the precedings siblings of target, and let following siblings be the followings siblings of target.
Indent preceding siblings.
Indent following siblings.

Outdent original ancestor.

Toggling lists

This is the action for the insertOrderedList command and the insertUnorderedList command, which behave identically except for which list type they target. It does several things that vary contextually.
If everything in the selection is contained in the target list type already, this more or less just outdents everything one step. This is relatively simple.
Otherwise, it's slightly more complicated:
First, any lists of the opposite list type (other tag name) get converted to the target list type (tag name). They get merged into a sibling if appropriate, otherwise we set the tag name.
Then we go through all the affected nodes, handling each run of consecutive siblings separately. Any line that's not already wrapped in an
gets wrapped. If the parent at this point isn't a list at all, the run gets wrapped in a list. If it's the wrong type of list, we split the parent and rewrap it in the right type of list. That's basically it, except that we have to exercise the usual care to try merging with siblings and so forth.

Research for insertOrderedList/insertUnorderedList: tested the following command sequences in IE9, Firefox 4.0, Chrome 12 dev, Opera 11.10, OpenOffice.org 3.2.1 Ubuntu package, Microsoft Office Word 2007. The commands "ol", "ul", "indent", "outdent" correspond in browsers to "insertOrderedList", "insertUnorderedList", "indent", and "outdent"; in OO.org to "Numbering On/Off", "Bullets On/Off", "Increase Indent", "Decrease Indent"; and in Word to "Numbering", "Bullets", "Increase Indent", "Decrease Indent".
Note: OO has a bunch of extra options, like "Promote One Level", "Demote One Level", "Promote One Level With Subpoints", "Demote One Level With Subpoints", "Insert Unnumbered Entry", "Restart Numbering". The regular "Increase/Decrease Indent" commands work oddly, and I assume they're not really meant to be used inside lists. Thus I also tested with "Promote One Level" and "Demote One Level". These are denoted by OO' instead of OO.
Assume that there are style rules in effect like
ol ol { list-style-type: lower-alpha } ol ol ol { list-style-type: lower-roman }

This is the default appearance in Word, and I set OO to something similar with Bullets and Numbering → Outline in the list editing toolbox. I'm ignoring bullet style throughout, for no particular reason.

In an existing ordered list equivalent to
foo
bar
baz
quz:

Select "bar", do "ol":

Word/OO
Remove indent and number "2", change "3" to "2".
Browsers
Remove indent and number "2", change "3" to "1".
Spec
Same as browsers.

Select "bar", do "ul":

Word
Leave indent the same, change "2" to a bullet, change "3" to "2".
OO
Increase indent, change "2" to a bullet, change "3" to "2".
IE
Change all numbers to bullets.
Firefox/Chrome/Opera
Leave indent the same, change "2" to a bullet, change "3" to "1".
Spec
Same as Firefox/Chrome/Opera.

Select "bar", do "indent":

Word/OO'/Browsers
Increase indent, change "2" to "a", change "3" to "2".
OO
Increase indent, do not change any numbers.
Spec
Same as Word/OO'/Browsers.

Select "bar", do "outdent":

Word
Do nothing.
OO
Leave indent the same, de-indent "2" so it goes past the left margin (?!), do not change any numbers.
OO'
Option grayed out.
Browsers
Remove indent and the number "2", change "3" to "1".
Spec
Same as browsers.

Select "quz", do "ol":

Word/OO/IE/Chrome
Add as fourth item to existing list, numbered "4".
Firefox/Opera
Create new list, number the item "1".
Spec
Same as OO/Word/IE/Chrome.

In an existing ordered list equivalent to
foo
bar
baz
:

Select "foo", do "ol":

Word/OO/IE/Chrome/Opera
Remove indent from both "foo" and "bar", change "2" -> "1".
Firefox
Increase indent for "foo" only, add additional "a" marker after "1" and before "foo".
Spec
Same as Word/OO/IE/Chrome/Opera.

Select "foo", do "ul":

Word/Opera
Change "1" -> bullet, "2" -> "1".
OO
Increase indent for both "foo" and "bar", change "1" -> bullet, "2" -> "1".
IE
Change all numbers to bullets.
Firefox
Increase indent for "foo" only, add additional bullet marker after "1" and before "foo".
Chrome
Remove indent from "bar", change "1" -> bullet, "2" -> "1".
Spec
Same as Word/Opera.

Select "foo", do "indent":

Word
Increase indent for whole list.
OO
Increase indent for both "foo" and "bar".
OO'
Increase indent for "foo", change "1" -> "a".
IE/Firefox non-CSS/Opera
Increase indent for both "foo" and "bar", change "1" -> "a", "2" -> "1".
Firefox CSS
Increase indent for "foo" only (
).
Chrome
Increase indent for "foo" only, add "a" before "foo", move "1" to be before "bar".
Spec
Same as IE/Firefox non-CSS/Opera.

Select "foo", do "outdent":

Word
Decrease indent for whole list, so it goes past the left margin.
OO
Decrease indent for "bar" and "1." (so "1." goes past the left margin), but not "foo".
OO'
Option grayed out.
IE/Chrome/Opera
Remove indent from both "foo" and "bar", remove "1", change "2" -> "1".
Firefox
Do nothing.
Spec
Same as IE/Chrome/Opera.

Select "bar", do "ol":

Word/OO/IE/Chrome/Opera
Remove indent from both "foo" and "bar", change "2" -> "1".
Firefox
Increase indent for "bar" only, add "a" marker before it.
Spec
Same as Word/OO/IE/Chrome/Opera.

Select "bar", do "ul":

Word/Opera
Change "1" -> bullet, "2" -> "1".
OO
Increase indent for both "foo" and "bar", change "1" -> bullet, "2" -> "1".
IE
Change all numbers to bullets.
Firefox
Increase indent for "bar" only, add bullet marker before it.
Chrome
Remove indent from "foo", change "1" -> bullet and move it before "bar", change "2" -> "1".
Spec
Same as Word/Opera.

Select "bar", do "indent":

Word
Increase indent for whole list.
OO
Increase indent for both "foo" and "bar".
OO'
Increase indent for "foo", change "1" -> "a".
IE/Firefox non-CSS/Opera
Increase indent for both "foo" and "bar", change "1" -> "a", "2" -> "1".
Firefox CSS
Increase indent for "bar" only (
).
Chrome
Increase indent for "bar" only, add "a" before "bar", move "bar" above "foo" (?!).
Spec
Same as IE/Firefox non-CSS/Opera.

Select "bar", do "outdent":

Word
Decrease indent for whole list, so it goes past the left margin.
OO
Decrease indent for "bar" and "1." (so "1." goes past the left margin), but not "foo".
OO'
Option grayed out.
IE/Chrome/Opera
Remove indent from both "foo" and "bar", remove "1", change "2" -> "1".
Firefox
Do nothing.
Spec
Same as IE/Chrome/Opera.

In an existing nested ordered list equivalent to
foo
bar
baz
quz
:

Select "bar", do "ol":

Word/IE/Firefox
Decrease indent, remove "a" ("bar" is aligned with "foo" with no marker of its own), change "b" -> "a".
OO
Remove all indent, change "b" -> "a".
Chrome
Decrease indent, change "a" -> "2", "b" -> "a", "2" -> "3".
Opera
Decrease indent, change "a" -> "2", "b" -> "a", "2" -> "4", insert extra "3" list marker before new "a".
Spec
Same as Chrome.

Select "bar", do "ul":

Word/Firefox/Chrome
Change "a" -> bullet, "b" -> "a".
OO
Increase indent, change "a" -> bullet, "b" -> "a".
IE
Change "a" and "b" to bullets.
Opera
Change "a" -> bullet, "b" -> "a", "2" -> "4", insert extra list markers "2" and "3" before new bullet and "a".
Spec
Same as Word/Firefox/Chrome.

Select "bar", do "indent":

Word/OO'/IE
Increase indent, change "a" -> "i", leave "b" alone.
OO
Increase indent, do not change numbers.
Firefox/Chrome/Opera
Increase indent, change "a" -> "i", "b" -> "a".
Spec
Same as Firefox/Chrome/Opera.

Select "bar", do "outdent":

Word/OO'/IE/Chrome
Decrease indent, change "a" -> "2", "b" -> "a", "2" -> "3".
OO
Leave indent the same, de-indent "a" so it goes past the left margin (?!).
Firefox
Decrease indent, remove "a" ("bar" is aligned with "foo" with no marker of its own), change "b" -> "a".
Opera
Decrease indent, change "a" -> "2", "b" -> "a", "2" -> "4", insert extra list marker "3" before new "a".
Spec
Same as Word/OO'/IE/Chrome.

In existing nested lists equivalent to
foo
bar
baz
quz
:

Select "bar", do "ol":

Word
Change all bullets to numbers. (Not letters, even though indented!)
OO
Decrease indent, change first bullet -> "2", "2" -> "3".
IE
Change all bullets to letters.
Firefox/Chrome
Change first bullet to "a".
Opera
Change first bullet -> "a", "2" -> "4", insert extra list markers "2" and "3" before new "a" and bullet.
Spec
Same as Firefox/Chrome.

Select "bar", do "ul":

Word/IE/Firefox
Decrease indent, remove first bullet ("bar" is aligned with "foo" with no marker of its own).
OO
Remove all indent, remove first bullet, leave all else the same.
Chrome
Decrease indent, change first bullet -> "2", "2" -> "3".
Opera
Decrease indent, change first bullet -> "2", "2" -> "4", insert extra list marker "3" before old bullet.
Spec
Same as Chrome.

Select "bar", do "indent":

Word
Increase indent, change first bullet to "i" (?!).
OO/OO'/Firefox/Chrome/Opera
Increase indent.
IE
Increase indent, change "2" -> "3" (?!?!). (I don't see from the markup why the 2 actually changes to a 3. The markup seems to be as other browsers.)
Spec
Same as OO/OO'/Firefox/Chrome/Opera.

Select "bar", do "outdent":

Word/IE/Chrome
Decrease indent, change first bullet -> "2", "2" -> "3".
OO
Usual crazy stuff, move bullet left but leave text alone.
OO'
Option grayed out. (Interesting.)
Firefox
Decrease indent, remove first bullet ("bar" is aligned with "foo" with no marker of its own).
Opera
Decrease indent, change first bullet -> "2", "2" -> "4", insert extra list marker "3" before old bullet.
Spec
Same as Word/IE/Chrome.

In an existing nested ordered list equivalent to
foo
bar
baz
quz
:

Select "bar", do "ol":

Word/OO
Remove indent and "2", change "3" -> "2".
IE/Chrome/Opera
Remove indent and "2", decrease indent of "baz", change "2" and "3" -> "1".
Firefox
Increase indent, add extra "a" marker between "2" and "bar".
Spec
Different from all of them: remove indent and "2", change "3" -> "1".

Select "bar", do "ul":

Word
Change "2" -> bullet.
OO
Increase indent, change "2" -> bullet, "3" -> "2".
IE
Change "1", "2", "3" -> bullets (and "a" to "1").
Firefox
Increase indent, add extra bullet marker between "2" and "bar".
Chrome
Decrease indent of "baz", change "2" -> bullet, "a" and "3" -> "1".
Opera
Change "2" -> bullet, "a" and "3" -> "1".
Spec
Different from all of them: change "2" -> bullet, "3" -> "1".

Select "bar", do "indent":

Word/OO'
Increase indent, change "2" -> "a", "a" -> "b", "3" -> "2".
OO
Increase indent (double amount, past "baz").
Firefox non-CSS/Opera
Increase indent of both "bar" and "baz", change "2" -> "a", "a" -> "i", "3" -> "2".
Firefox CSS
Increase indent.
Chrome
Increase indent, add "a" marker before "bar", move "2" marker to before the "a" marker of "baz".
Spec
Same as Word/OO'.

Select "bar", do "outdent":

Word/Firefox
Do nothing.
OO
Decrease indent on "2", leave "bar" alone.
OO'
Option grayed out.
IE
Decrease indent of "baz", change "2" and "3" -> "1", "a" -> "2".
Chrome/Opera
Decrease indent of "bar" and "baz", remove "2", change "a" and "3" -> "1".
Spec
Different from all of them: remove indent and "2", change "3" -> "1".

In an existing nested ordered list equivalent to
foo
bar
baz
quz
qoz
:

Does not appear to be possible in Word or OO.
Also might be impossible to actually make such a list using execCommand() in browsers.
Suffice it to say that there's a lot of variation.

In an existing indented region equivalent to foo
bar
baz:

Select "bar", do "ol":

Word/OO/Firefox/Chrome
Increase indent, add "1".
IE
Increase indent, add "a".
Opera
Add "1" (but do not increase indent).

Select "foobar", do "ol":

Word/IE
Increase indent of both, add "1" before "foo" and "a" before "bar".
OO
Increase indent of "bar" one step, increase indent of "foo" two steps so it's aligned with "bar", add "1" before "foo" and "2" before "bar".
Firefox
Increase indent of both, add "1" before foo", add "2" before "bar" aligned with the "1" of "foo" (so large gap between "2" and "bar").
Chrome
Increase indent of "foo", add "1" before "foo" and "2" before "bar".
Opera
Mash everything together on one line. But if you do
foo
bar
baz
instead, same as Chrome.

Select "foo" and do "ol", then select "bar" and do "ol":

Word/OO/Firefox/Opera
Different than doing both at once (often in exciting ways).
IE/Chrome
Same as doing both at once.

foo
bar
baz

Select "foobar" and do "ol":

Word
One-item list with sublist.
OO/Firefox/Chrome/Opera
One two-item list, unindented.
IE9
Two one-item lists.

Select "foo", do "ol", then select "bar" and do "ol":

Word/OO/Chrome
One two-item list, unindented.
IE9/Firefox
Two one-item lists.
Opera
Two one-item lists, both unindented.

Desired behavior: One-item list with sublist in both cases.

In an existing multi-line indented region equivalent to
foo
bar
baz
:

Select "bar", do "ol":

Word/OO/Firefox/Chrome
Increase indent, add "1".
IE
Increase indent of everything, add "a" before "foo". If you do
foo
bar
baz
, same as Word/OO/Firefox/Chrome.
Opera
Don't increase indent of anything, add "1" before "bar".

In an existing multi-line indented region equivalent to
foo
bar
baz:

Select "barbaz", do "ol":

Word
Indent both, add "a" before "bar" and "2" before "baz".
OO
Indent "baz", add "1" before "bar" and "2" before "baz".
IE
Indent everything, add "a" before "foo" and "1" before "baz". If you do
foo
bar
baz, indent "bar" and "baz" and put "1" before each.
Firefox
Indent "bar" and put "1" before it, put "baz" after "bar" on the same line. If you do
foo
bar
baz, same as Chrome.
Chrome
Indent "bar" once and "baz" twice, put "1" before "bar" and "2" before "baz".
Opera
Put a "1" before "bar" and move "baz" to the same line. If you do
foo
bar
baz, indent "baz", put a "1" before "bar" and a "2" before "baz".

Select "bar", do "ol", then select "baz" and do "ol":

Word/OO/Opera
Different from if you do both together.
IE
Different with
, same with
.
Firefox
Three behaviors, depending on whether you do it in one step with
, one step with
, or two steps with either (same behavior regardless with two steps).
Chrome
Same behavior in all four cases.

foo
bar
baz:

Select "baz", do "ol":

Word/OO/Chrome
Add "baz" as a new item to existing list.
IE/Firefox/Opera
Make "baz" its own new list.

foo
bar
baz:

Select "baz", do "ol":

IE/Firefox/Chrome/Opera
Separate list.

Ignoring the conceptual model of HTML, which users won't understand, here's the conceptual model I've developed for lists: text is divided up into blocks. Each block has an indentation level and a list marker type. The list marker type can be either nothing, ordered, or unordered. A list block cannot have indentation level less than one. Any given piece of text is part of only one block. A block may be visually non-contiguous, such as if a single list block is interrupted by a further-indented block.
To find the right number (or letter) for an ordered-list block, look at the immediately preceding block, but skip over any blocks of higher indentation level. If there is no immediately preceding block, or it's not an ordered-list block, or it has a lower indentation level, the number is 1 (or a, i, etc.). Otherwise, it's the number of the preceding block plus one.
ol/ul commands change the selected block to that list marker type, or remove the list marker type if it's already the chosen type. If the block has indentation level zero, it increases to one.
indent/outdent commands change the selected block's indentation level. If a list block's indentation level is reduced to zero, it's converted to a regular block.
What this means from an HTML perspective, roughly:

A list block is the entire contents of an
element, ignoring any nested list elements or indentation elements. A non-list block is a line box.
Indentation level is equal to the number of ancestor elements that are either
s or indentation elements (blockquotes or indenting divs).
To find the list marker type, go to the first ancestor that's either an
or indentation element.
Correct numbering should automatically follow from the way
works in HTML (which is one of the reasons I use this model).
An ol command in an ordered-list block removes the surrounding
, migrating its contents into the parent of the
. This splits up the
if it's not the first or last child, and wraps the contents in a new
if necessary. If there's another list or indentation element nested in the
we're removing, it will get re-wrapped in a new
, outside the newly-created
, so that it maintains its indentation. This might cause the new
to wind up in multiple pieces, if the original block was not contiguous, which means the non-contiguous block is split into several blocks (with different numbers).
An ol command in an unordered-list block breaks up the parent
and puts a new
in between the two pieces, moving the parent
into it. If the
was the first or last child, we merge with an existing adjacent
if possible. All children stay as they are.
An ol command in a non-list block with indentation zero wraps it in a new
, or merges with an adjacent
if possible.
An ol command in a non-list block with nonzero indentation converts the parent to an
, breaking it up if necessary.
The ul command works similarly to ol.
indent in a non-list block wraps in an indentation element. In a list block, it wraps the
in an extra
or
as appropriate. With merging. Whatever. Let me just write the spec.
outdent in a non-list block strips an indentation element, if one is present. In a list block, it breaks apart the parent
or
and makes the affected block a sibling in between the newly-split list elements. Will create new
s, etc. etc.

Sheesh, lists are complicated.

To toggle lists, given a string tag name (either "ol" or "ul"):

Let mode be "disable" if the selection's list state is tag name, and "enable" otherwise.
Let other tag name be "ol" if tag name is "ul", and "ul" if tag name is "ol".
Let items be a list of all lis that are inclusive ancestors of the active range's start and/or end node.

TODO: This overnormalizes, but it seems like the simplest solution for now.
For each item in items, normalize sublists of item.
Block-extend the active range, and let new range be the result.
If mode is "enable", then let lists to convert consist of every editable HTML element with local name other tag name that is contained in new range, and for every list in lists to convert:

Convert it to the right name. If possible, we want to merge with a neighboring list of the correct type. Failing that, we set the tag name.
If list's previousSibling or nextSibling is an editable HTML element with local name tag name:

Let children be list's children.
Record the values of children, and let values be the result.
Split the parent of children.
Wrap children, with sibling criteria returning true for an HTML element with local name tag name and false otherwise.
Restore the values from values.

Otherwise, set the tag name of list to tag name.

Let node list be a list of nodes, initially empty.

We exclude indentation elements so that selecting some random text and doing indent followed by insertOrderedList will have the same result as the reverse. Specifically,
[foo]
->
[foo]

per spec and Firefox 4.0 and (more or less) Chrome 12 dev. Opera 11.10 instead does
foo
, so the indentation vanishes. IE9 does
foo
, but that doesn't make semantic sense and is different from how it would work if you reversed the commands. OpenOffice.org 3.2.1 (Ubuntu) and Word 2007 both agree with the spec in this case.

For each node node contained in new range, if node is editable; the last member of node list (if any) is not an ancestor of node; node is not an indentation element; and either node is an ol or ul, or its parent is an ol or ul, or it is an allowed child of "li"; then append node to node list.

We don't want to touch these. E.g., assuming tag name is "ol",
[foo
bar
baz] ->
[foo
bar
baz]
not
[foo
bar
baz]
.

But
foo
[bar
baz
quz]
->
foo
[bar
baz
quz]
not
foo
[bar
baz
quz]

If mode is "enable", remove from node list any ol or ul whose parent is not also an ol or ul.
If mode is "disable", then while node list is not empty:

Let sublist be an empty list of nodes.
Remove the first member from node list and append it to sublist.
If the first member of sublist is an HTML element with local name tag name, outdent it and continue this loop from the beginning.
While node list is not empty, and the first member of node list is the nextSibling of the last member of sublist and is not an HTML element with local name tag name, remove the first member from node list and append it to sublist.
Record the values of sublist, and let values be the result.
Split the parent of sublist.
Fix disallowed ancestors of each member of sublist.
Restore the values from values.

Otherwise, while node list is not empty:

Let sublist be an empty list of nodes.

Accumulate consecutive sibling nodes in sublist, first converting them all to li's (except if they're already lists).
While either sublist is empty, or node list is not empty and its first member is the nextSibling of sublist's last member:

Thus
foo
becomes
foo
instead of
foo
, and likewise for div, but other things will be put inside the
.
If node list's first member is a p or div, set the tag name of node list's first member to "li", and append the result to sublist. Remove the first member from node list.
Otherwise, if the first member of node list is an li or ol or ul, remove it from node list and append it to sublist.
Otherwise:

Let nodes to wrap be a list of nodes, initially empty.
While nodes to wrap is empty, or node list is not empty and its first member is the nextSibling of nodes to wrap's last member and the first member of node list is an inline node and the last member of nodes to wrap is an inline node other than a br, remove the first member from node list and append it to nodes to wrap.
Wrap nodes to wrap, with new parent instructions returning the result of calling createElement("li") on the context object. Append the result to sublist.

In this case it's already wrapped properly, nothing more to do.
If sublist's first member's parent is an HTML element with local name tag name, or if every member of sublist is an ol or ul, continue this loop from the beginning.
If sublist's first member's parent is an HTML element with local name other tag name:

Record the values of sublist, and let values be the result.
Split the parent of sublist.
Wrap sublist, with sibling criteria returning true for an HTML element with local name tag name and false otherwise, and new parent instructions returning the result of calling createElement(tag name) on the context object.
Restore the values from values.
Continue this loop from the beginning.

Wrap sublist, with sibling criteria returning true for an HTML element with local name tag name and false otherwise, and new parent instructions being the following:

Special case: something like
foo
[bar]

becomes
foo
[bar]

instead of
foo
[bar]
.

We handle the special case in new parent instructions instead of outside because we'd prefer to wind up in a sibling if there is one. We handle only previousSibling, not nextSibling, because we really mean for this to cover something like
[foo
bar]

which we'll handle node-by-node. TODO: Maybe we should do this differently, like just special-case simple indentation elements in an earlier part of the algorithm? This way's a bit weird.

If sublist's first member's parent is not an editable simple indentation element, or sublist's first member's parent's previousSibling is not an editable HTML element with local name tag name, call createElement(tag name) on the context object and return the result.
Let list be sublist's first member's parent's previousSibling.
Normalize sublists of list's lastChild.
If list's lastChild is not an editable HTML element with local name tag name, call createElement(tag name) on the context object, and append the result as the last child of list.
Return the last child of list.

Fix disallowed ancestors of the previous step's result.

Justifying the selection

This is the action for the four justify* commands. It's pretty straightforward, with no notable gotchas or special cases. It works more or less like a stripped-down version of set the selection's value, except it gets to be much simpler because it's much less general. (It's not similar enough to just invoke that algorithm: too many things differ between block and inline elements.)

There are two basic ways this works in browsers: using the align attribute, and using CSS text-align. IE9 and Opera 11.11 use only the align attribute, Chrome 13 dev uses only text-align, and Firefox 5.0a2 varies based on styleWithCSS. The two ways produce entirely different results, which is a real problem, so I don't think Firefox's approach is tenable. text-align is more valid, and for typical contenteditable cases it works the same. But for cases where you have fixed-width blocks, like tables or just divs with a width set, it behaves differently, and in those cases the align attribute is more useful.
TODO: text-align doesn't behave as expected if there are descendant blocks with non-100% width, like tables. The align attribute behaves a lot more nicely in such cases, but it's not valid. Not clear what to do. For now I've stuck with text-align, just because the cases where it misbehaves can't be created by any sequence of stock execCommand()s that I know of, but this needs more careful consideration. Gecko in CSS mode seems to special-case tables, adding auto margins to the table element to get it to align correctly.
TODO: We could do something along the lines of pushing down values here, although no browser does. In fact, it's very likely this can be rewritten in terms of the inline formatting command primitives, but it's not clear if it would be worth the added complexity.

To justify the selection to a string alignment (either "center", "justify", "left", or "right"):

Block-extend the active range, and let new range be the result.

No browser actually removes center, but it makes sense to do so.
Let element list be a list of all editable Elements contained in new range that either has an attribute in the HTML namespace whose local name is "align", or has a style attribute that sets "text-align", or is a center.
For each element in element list:

If element has an attribute in the HTML namespace whose local name is "align", remove that attribute.
Unset the CSS property "text-align" on element, if it's set by a style attribute.
If element is a div or span or center with no attributes, remove it, preserving its descendants.
If element is a center with one or more attributes, set the tag name of element to "div".

This could theoretically be necessary, like if it converted "
foo
bar" to "foo
bar". Now we need to select "foo
", nor just "foo".
Block-extend the active range, and let new range be the result.
Let node list be a list of nodes, initially empty.

Of tested browsers, only Chrome 13 dev seems to not apply the alignment to nodes that are already aligned. Even then, it does apply it if the alignment is just inherited from the root.
For each node node contained in new range, append node to node list if the last member of node list (if any) is not an ancestor of node; node is editable; node is an allowed child of "div"; and node's alignment value is not alignment.
While node list is not empty:

Let sublist be a list of nodes, initially empty.
Remove the first member of node list and append it to sublist.
While node list is not empty, and the first member of node list is the nextSibling of the last member of sublist, remove the first member of node list and append it to sublist.
Wrap sublist. Sibling criteria returns true for any div that has one or both of the following two attributes and no other attributes, and false otherwise:

An align attribute whose value is an ASCII case-insensitive match for alignment.
A style attribute which sets exactly one CSS property (including unrecognized or invalid attributes), which is "text-align", which is set to alignment.

As with inline formatting, I only ever create new elements, and don't ever modify existing ones. This doesn't match how any browser behaves in this case, but for inline formatting it matches everyone but Gecko's CSS mode, so I'm just being consistent.
New parent instructions are to call createElement("div") on the context object, then set its CSS property "text-align" to alignment and return the result.

Automatic linking

Bug 13807.
When the user inserts whitespace immediately following something that looks like a URL or e-mail address, we automatically run the createLink command on it.
An autolinkable URL is a string of the following form:

IE9 and LibreOffice 3.3.4 both have a whitelist of URL schemes. That would be complicated and involve political decisions, so instead, we'll just accept anything that looks like a hierarchical URL scheme. Google Docs is similar (as of November 9, 2011), but it's too lax, and allows characters in the scheme that can't be in a scheme. For non-hierarchical schemes, we just whitelist mailto:, since it's the only common one that makes sense to autolink.
Either a string matching the scheme pattern from RFC 3986 section 3.1 followed by the literal string ://, or the literal string mailto:; followed by

We don't try to enforce that the URL is anything resembling valid per spec. Too complicated for not enough gain.
Zero or more characters other than space characters; followed by

If the user types a URL followed by some punctuation, we still want to autolink, but we don't want to include the punctuation if it's probably not meant as part of the URL.
IE9 excludes !#&()*+,-.:;<=?@[]^_`{|}~ as trailing characters from both URLs and e-mails. A trailing " or > will prevent autolinking, and a trailing $%'/\ is included in the link.
LibreOffice 3.3.4 excludes trailing !”#'()*+,.:;<=>?[\]^_`{|}~, and prevents autolinking on $%&-@. It includes a trailing / in URLs, but it inhibits linking for e-mails.
Google Docs (as of November 9, 2011) is complicated. Trailing ”’,-. always prevents autolinking of a URL, and trailing #%/?_ is always included in a URL. Trailing !&=$()*+:;<>@[\]^`{|}~ prevent autolinking if there's no ? before them, but are included in the URL if there is a ?. For e-mails, _ is included, and everything else prevents autolinking.
None of these behaviors makes maximal sense. We should exclude characters if they're more likely as delimiters than actual trailing characters; include them if they're more likely as actual trailing characters; and prevent autolinking if their presence suggests that we're not actually looking at a link or e-mail address. The lists I made up for URLs are: exclude trailing !"'(),-.:;<>[]`{}, include anything else. For e-mail, exclude anything at all.

A character that is not one of the ASCII characters !"'(),-.:;<>[]`{}.

To autolink (node, end offset):

While (node, end offset)'s previous equivalent point is not null, set it to its previous equivalent point.
If node is not a Text node, or has an a ancestor, do nothing and abort these steps.
Let search be the largest substring of node's data whose end is end offset and that contains no space characters.
If some substring of search is an autolinkable URL:

While there is no substring of node's data ending at end offset that is an autolinkable URL, decrement end offset.
Let start offset be the start index of the longest substring of node's data that is an autolinkable URL ending at end offset.
Let href be the substring of node's data starting at start offset and ending at end offset.

Otherwise, if some substring of search is a valid e-mail address:

While there is no substring of node's data ending at end offset that is a valid e-mail address, decrement end offset.
Let start offset be the start index of the longest substring of node's data that is a valid e-mail address ending at end offset.
Let href be "mailto:" concatenated with the substring of node's data starting at start offset and ending at end offset.

Otherwise, do nothing and abort these steps.
Let original range be the active range.
Create a new range with start (node, start offset) and end (node, end offset), and set the context object's selection's range to it.
Take the action for "createLink", with value equal to href.
Set the context object's selection's range to original range.

The delete command

This is the same as hitting backspace (see Additional requirements). The easy part is if the selection isn't collapsed: just delete the selection. But it turns out rich-text editors have a lot of special behaviors for hitting backspace with a collapsed selection. Most obviously, if there's a text node right before the cursor (maybe wrapped in some inline elements), we delete its last character. But some of the special cases we run into are:

Invisible nodes are removed before anything else happens.
An gets removed if you backspace while the cursor is right after it, so the link disappears.
A or or gets deleted.
Backspacing at the start of most blocks merges with the previous block. (Visually, this is a matter of deleting a line break.)
Backspacing at the start of an indentation element, or an
or
or
that's at the beginning of a list, outdents the current block (rather than merging with the previous block).
Backspacing at the start of a table cell does nothing.
Backspacing immediately after a table selects the table, so a second backspace deletes it.
Backspacing at the start of a list item that's not at the beginning of a list merges with the previous list item, but keeps the contents on a separate line, so you have to hit backspace a second time to get them on the same line.

Preserves overrides
For all the deletions here, Firefox 7.0a2 will remove wrapper elements like only if they're selected, like {foo}. IE9, Chrome 14 dev, and Opera 11.50 will all remove them even if only their contents are selected, like [foo]. Gecko's behavior in the latter case leaves things like {} in the DOM, which is unhelpful, so I don't.
Action:

If the active range is not collapsed, delete the selection and return true.

Needed so that if there are multiple consecutive spaces we backspace over all at once.
Canonicalize whitespace at the active range's start.
Let node and offset be the active range's start node and offset.

First go up as high as possible within the current block, then drill down to the lowest possible level, in the hopes that we'll wind up at the end of a text node, or maybe in a br or hr.
Repeat the following steps:

If there's an invisible node somewhere, Firefox 5.0a2 removes that node and then stops, so each backspace removes one invisible node. All others remove the invisible node and then continue on looking for something visible to remove. The spec follows the latter behavior, since it makes more sense to the user. Of course, the definition of "invisible node" is not necessarily anything like the spec's.
If offset is zero and node's previousSibling is an editable invisible node, remove node's previousSibling from its parent.
Otherwise, if node has a child with index offset − 1 and that child is an editable invisible node, remove that child from node, then subtract one from offset.
Otherwise, if offset is zero and node is an inline node, or if node is an invisible node, set offset to the index of node, then set node to its parent.

When backspacing a link, Firefox 7.0a2, Chrome 14 dev, Opera 11.50, and OpenOffice.org 3.2.1 Ubuntu have no special behavior. IE9 and Word 2007 remove the link instead of deleting its last character. The latter behavior seems more useful and intuitive.
Otherwise, if node has a child with index offset − 1 and that child is an editable a, remove that child from node, preserving its descendants. Then return true.
Otherwise, if node has a child with index offset − 1 and that child is not a block node or a br or an img, set node to that child, then set offset to the length of node.
Otherwise, break from this loop.

At this point, node cannot be an invisible node. There are three cases:

offset is zero and node is a block node. Then we'll usually merge with the previous block if one exists.
offset is not zero, node is not a block node, and node does not have a child with index offset − 1. The only way this is possible is if node has a length greater than zero but no children, which implies it's a text or comment or PI. Comments and PIs are invisible nodes, so it must be a text node. We delete the previous character.
offset is not zero, and the child of node with index offset − 1 is a block node or a br or an img. Then we'll usually merge the offsetth child of node with the last descendant of the offset − 1st.

Unlike forwardDelete, there's no special case for diacritics. This means backspacing will just delete the last combining diacritic typed, or the whole character if it's precomposed. This matches everything I tested (IE9, Firefox 7.0a2, Chrome 14 dev, etc.).

If node is a Text node and offset is not zero, or if node is a block node that has a child with index offset − 1 and that child is a br or hr or img:

Call collapse(node, offset) on the context object's selection.
Call extend(node, offset − 1) on the context object's selection.
Delete the selection.
Return true.

At the time of this writing, this should be impossible. Just being safe.
If node is an inline node, return true.

If we're at the beginning of a list, we want to outdent the first list item. This doesn't actually match anyone or anything. Word 2007 and OpenOffice.org 3.2.1 Ubuntu just remove the list marker, which is weird and doesn't map well to HTML. Browsers tend to just merge with the preceding block, which isn't expected.
If node is an li or dt or dd and is the first child of its parent, and offset is zero:

Let items be a list of all lis that are ancestors of node.
Normalize sublists of each item in items.
Record the values of the one-node list consisting of node, and let values be the result.
Split the parent of the one-node list consisting of node.
Restore the values from values.

Annoying hack to prevent the dl from being re-added when fixing disallowed ancestors. In most cases we want a wrapper dl added, but in two cases (delete and insertParagraph) we're actually trying to outdent the list item. TODO: there might be a better way to do this.
If node is a dd or dt, and it is not an allowed child of any of its ancestors in the same editing host, set the tag name of node to the default single-line container name and let node be the result.
Fix disallowed ancestors of node.
Return true.

By this point, we're almost certainly going to merge something, and the only question is what.
Let start node equal node and let start offset equal offset.
Repeat the following steps:

If start offset is zero, set start offset to the index of start node and then set start node to its parent.
Otherwise, if start node has an editable invisible child with index start offset minus one, remove it from start node and subtract one from start offset.
Otherwise, break from this loop.

At the beginning of an indented block, outdent it, similar to a list item. Browsers don't do this, word processors do. Note: this copy-pastes from the outdent command action.
If offset is zero, and node has an editable inclusive ancestor in the same editing host that's an indentation element:

Block-extend the range whose start and end are both (node, 0), and let new range be the result.
Let node list be a list of nodes, initially empty.
For each node current node contained in new range, append current node to node list if the last member of node list (if any) is not an ancestor of current node, and current node is editable but has no editable descendants.
Outdent each node in node list.
Return true.

This is to avoid stripping a line break from
foo

[]bar

and similarly for
. We should just do nothing here.

If the child of start node with index start offset is a table, return true.

If you try backspacing into a table, select it. This doesn't match any browser; it matches the recommendation of the "behavior when typing in contentEditable elements" document. The idea is that then you can delete it with a second backspace.
If start node has a child with index start offset − 1, and that child is a table:

Call collapse(start node, start offset − 1) on the context object's selection.
Call extend(start node, start offset) on the context object's selection.
Return true.

Special case:
foo

[]bar
->
foo
[]bar

and likewise for
. But with we merge like in other cases:
foo
[]bar
->
foo
[]bar.

Browsers don't do this consistently. Firefox 5.0a2 doesn't seem to do it at all.

If offset is zero; and either the child of start node with index start offset minus one is an hr, or the child is a br whose previousSibling is either a br or not an inline node:

Call collapse(start node, start offset − 1) on the context object's selection.
Call extend(start node, start offset) on the context object's selection.
Delete the selection.
Call collapse(node, offset) on the selection.
Return true.

If you try backspacing out of a list item, merge it with the previous item, but add a line break. Then you have to backspace again if you really want them to be on the same line. This matches Word 2007 and OpenOffice.org 3.2.1 Ubuntu, and also matches "behavior when typing in contentEditable elements", but does not match any browser.
Note that this behavior is quite different from what happens if you actually select the linebreak in between the two lines. In that case, the blocks are merged as normal.
Also note that hitting backspace twice will merge with the previous item. This matches OO.org, but Word will outdent the item on subsequent backspaces. Word's behavior doesn't fit well with the way lists work in HTML, and we probably don't want it.

If the child of start node with index start offset is an li or dt or dd, and that child's firstChild is an inline node, and start offset is not zero:

Let previous item be the child of start node with index start offset minus one.

If the last child is already a br, we only need to append one extra br. Otherwise we need to append two, since the first will do nothing.
If previous item's lastChild is an inline node other than a br, call createElement("br") on the context object and append the result as the last child of previous item.
If previous item's lastChild is an inline node, call createElement("br") on the context object and append the result as the last child of previous item.

When merging adjacent list items, make sure we only merge the items themselves, not any block children. We want
foo
[]bar to become
foo
[]bar, not
foo []bar or
foo[]bar. To do the deletion, we need to wipe out the current selection, so we save it as a range. Saving it as a node/offset pair isn't enough, because it might be invalid after we do the deletion. A range will update according to the range mutation rules.
If start node's child with index start offset is an li or dt or dd, and that child's previousSibling is also an li or dt or dd:

Call cloneRange() on the active range, and let original range be the result.
Set start node to its child with index start offset − 1.
Set start offset to start node's length.
Set node to start node's nextSibling.
Call collapse(start node, start offset) on the context object's selection.
Call extend(node, 0) on the context object's selection.
Delete the selection.
Call removeAllRanges() on the context object's selection.
Call addRange(original range) on the context object's selection.
Return true.

General block-merging case.
While start node has a child with index start offset minus one:

If start node's child with index start offset minus one is editable and invisible, remove it from start node, then subtract one from start offset.
Otherwise, set start node to its child with index start offset minus one, then set start offset to the length of start node.

Call collapse(start node, start offset) on the context object's selection.
Call extend(node, offset) on the context object's selection.
Delete the selection, with direction "backward".
Return true.

The formatBlock command

This command lets you change what block element particular lines are wrapped in. It will convert an existing wrapper if one exists, and otherwise will create a new one.
A formattable block name is "address", "dd", "div", "dt", "h1", "h2", "h3", "h4", "h5", "h6", "p", or "pre".

Tested browser versions: IE9, Firefox 4.0, Chrome 13 dev, Opera 11.10.
Firefox and Chrome will replace a
by a
or other given tag. IE and Opera will nest the
inside instead. The latter makes more sense, given that a) we don't support formatBlock with
and b)
s are logically different, since they can contain many lines.
Firefox will not convert other tags like
to
, it will only wrap unwrapped lines in a
. Firefox also won't replace
by things like
, it will nest the
inside. The spec follows other browsers.
If you try to convert a
to a
or
or such, Firefox breaks out of the
entirely, leaving ...

. Chrome will convert a
or
to the given element, leaving a
or
or such as the child of a
. I follow IE/Opera, which only affect the contents of
/
(Firefox behaves this way for
as well, just not
). This means you can get invalid DOMs like
foo
, but they can be serialized as text/html, so I'm not too fussy.
When it comes to
, IE/Opera behave like with
/
, which is how I behave too. Firefox apparently refuses to do anything. Chrome tries to wrap the parent list element, breaking it up if only some of the children are selected; this produces unserializable DOMs if you're wrapping with
.
When you're converting multiple blocks at once, Chrome replaces them all by one block with
stuck in, like
foo
bar
->
foo
bar
. It wipes out intervening block containers too in some cases. This might make sense for
//
, but other browsers don't do it.

Preserves overrides
Action:

IE9 requires the brackets. If they're not provided, it does nothing.
If value begins with a "<" character and ends with a ">" character, remove the first and last characters from it.
Let value be converted to ASCII lowercase.

Opera 11.10 throws NOT_SUPPORTED_ERR for bad elements, all other tested browsers ignore the input. Testing in IE9, Firefox 4.0, Chrome 13 dev, and Opera 11.10, supported elements seem to be:

Everyone
address, div, h*, p, pre
Everyone but IE
blockquote
Everyone but Opera
dd, dt
IE only
dir, menu, ol, ul
Firefox and Chrome only
dl
Chrome only
article, aside, footer, header, hgroup, nav, section

HTML5 as of May 2011 supports: address, article, aside, blockquote, div, footer, h*, header, hgroup, nav, p, pre, section, which exactly matches Chrome except minus dd/dt/dl.
See mailing list discussion on the subject.

If value is not a formattable block name, return false.
Block-extend the active range, and let new range be the result.
Let node list be an empty list of nodes.
For each node node contained in new range, append node to node list if it is editable, the last member of original node list (if any) is not an ancestor of node, node is either a non-list single-line container or an allowed child of "p" or a dd or dt, and node is not the ancestor of a prohibited paragraph child.
Record the values of node list, and let values be the result.

This tries to avoid misnesting if only some lines of an element are selected, so
[foo]
bar
becomes
[foo]
bar
instead of
[foo]

bar
or such. It tries to heuristically distinguish between divs used as line-breakers and divs used as actual wrappers by checking if they have prohibited paragraph children as descendants. It works for address too, in case there are paragraphs nested inside. Thus
[foo]
bar
becomes
[foo]
bar
, but
[foo]
bar
becomes
[foo]
bar
. Likewise, we don't break things out of lists or tables or such if they happen to be nested in a
.
For each node in node list, while node is the descendant of an editable HTML element in the same editing host, whose local name is a formattable block name, and which is not the ancestor of a prohibited paragraph child, split the parent of the one-node list consisting of node.
Restore the values from values.

We have two different behaviors, one for div and p and one for everything else. The basic difference is that for div and p, we assume that it should be one line per element, while for other elements, we put in multiple lines separated by
. So if you do formatBlock to p on
foo
bar

or
foo
bar

you get
foo
bar

but formatBlock to h1 will get you
foo
bar
.

IE9 will just change the elements as they are, so it gives
foo
bar
and
foo
bar
for
foo
bar
, but
foo
bar
and
foo
bar
for foo
bar. This is unreasonable, because the two possible inputs here look identical to the user and might have been produced by identical user input.
Firefox 5.0a2 will give results like
foo
bar
or
foo
bar
no matter what (modulo oddities in its handling of divs). Opera 11.10 is similar, except it leaves a trailing
in the first element.
Chrome 13 dev will give results like
foo
bar
or
foo
bar
no matter what.
The specced behavior is a compromise between the existing behaviors, predicated on the fact that
foo
bar
almost never makes sense, and
foo
bar
isn't usually what's wanted either.

While node list is not empty:

If the first member of node list is a single-line container:

If you try to format a single-line container with no children, IE10PP2 inserts an nbsp before formatting. (It uses nbsp instead of
to make blocks not collapse, so the equivalent for us would be to insert a
.) Firefox 7.0a2 and Opera 11.50 make the element disappear. Chrome 14 dev leaves it alone and doesn't format it. I follow Firefox/Opera just because it's the simplest given how I happen to have written the spec, and it's a corner case, so exact behavior isn't important.
For blocks that contain only a collapsed whitespace node, IE10PP2 and Firefox 7.0a2 convert them like normal. Chrome 14 dev and Opera 11.50 leave it alone and don't format it. I go with the majority, which is again simpler to spec.

Let sublist be the children of the first member of node list.
Record the values of sublist, and let values be the result.
Remove the first member of node list from its parent, preserving its descendants.
Restore the values from values.
Remove the first member from node list.

Otherwise:

Let sublist be an empty list of nodes.
Remove the first member of node list and append it to sublist.
While node list is not empty, and the first member of node list is the nextSibling of the last member of sublist, and the first member of node list is not a single-line container, and the last member of sublist is not a br, remove the first member of node list and append it to sublist.

Wrap sublist. If value is "div" or "p", sibling criteria returns false; otherwise it returns true for an HTML element with local name value and no attributes, and false otherwise. New parent instructions return the result of running createElement(value) on the context object. Then fix disallowed ancestors of the result.

Return true.

Firefox 6.0a2 throws, Chrome 14 dev always returns false, Opera 11.11 doesn't support indeterm to start with, IE9 was uncooperative in testing so I'm not sure what it does. I'm speccing it just because it makes sense.
Indeterminate:

If the active range is null, return the empty string.
Block-extend the active range, and let new range be the result.
Let node list be all visible editable nodes that are contained in new range and have no children.
If node list is empty, return false.
Let type be null.
For each node in node list:

While node's parent is editable and in the same editing host as node, and node is not an HTML element whose local name is a formattable block name, set node to its parent.
Let current type be the empty string.
If node is an editable HTML element whose local name is a formattable block name, and node is not the ancestor of a prohibited paragraph child, set current type to node's local name.
If type is null, set type to current type.
Otherwise, if type does not equal current type, return true.

Return false.

IE9 returns human-readable strings like "Normal" (p/div/etc.), "Formatted" (pre), "Heading 1" (h1), etc. Firefox 6.0a2 and Chrome 14 dev both return the appropriate tag name in lowercase, or the empty string if there is no appropriate tag. Opera 11.11 behaves the same, but with uppercase.
IE9 looks like it recognizes address, h*, pre, dd, dt, ol, ul, and dir, with everything else registering as "Normal". Firefox 6.0a2 recognizes only the arguments it accepts for formatBlock, namely address, h*, p, and pre. Chrome 14 dev recognizes address, div, h*, dd, dl, dt, p, pre plus lots of random other stuff like blockquote and section. I'll go with everything that execCommand("formatblock") accepts as an argument, which at the time of this writing means what Firefox supports plus div.

Value:

If the active range is null, return the empty string.
Block-extend the active range, and let new range be the result.
Let node be the first visible editable node that is contained in new range and has no children. If there is no such node, return the empty string.

Opera 11.11 doesn't require it be editable, so it will return "DIV" instead of "" for
foo
.
While node's parent is editable and in the same editing host as node, and node is not an HTML element whose local name is a formattable block name, set node to its parent.

Chrome 14 dev will report "div" for
foo
or such. Opera 11.11 reports "". IE and Firefox didn't cooperate with testing. Opera makes more sense, and matches the fact that formatBlock now doesn't recognize such a div as a formatBlock candidate, so Opera it is.
We don't really need to specify "editable" here, since it has to be editable if we got to this point.

If node is an editable HTML element whose local name is a formattable block name, and node is not the ancestor of a prohibited paragraph child, return node's local name, converted to ASCII lowercase.
Return the empty string.

The forwardDelete command

This is the same as hitting the delete key (see Additional requirements). It behaves much the same as the delete command, except of course backwards. Also, some of the special cases for backspacing don't apply, as noted in the comments. The one special case you get when deleting forward but not backward is that if the cursor is before a grapheme cluster that consists of multiple characters, like a base character with combining diacritics, we delete the diacritics too. (Backspacing just deletes the last diacritic, so you have to backspace several times to remove the whole cluster.)
Preserves overrides
Copy-pasted from delete, see there for comments.
Action:

If the active range is not collapsed, delete the selection and return true.
Canonicalize whitespace at the active range's start.
Let node and offset be the active range's start node and offset.
Repeat the following steps:

If offset is the length of node and node's nextSibling is an editable invisible node, remove node's nextSibling from its parent.
Otherwise, if node has a child with index offset and that child is an editable invisible node, remove that child from node.
Otherwise, if offset is the length of node and node is an inline node, or if node is invisible, set offset to one plus the index of node, then set node to its parent.

No special link behavior for forwardDelete here, unlike delete.
Otherwise, if node has a child with index offset and that child is neither a block node nor a br nor an img nor a collapsed block prop, set node to that child, then set offset to zero.
Otherwise, break from this loop.

If node is a Text node and offset is not node's length:

Let end offset be offset plus one.

Firefox 7.0a2, Chrome 14 dev, Word 2007, and OpenOffice.org 3.2.1 Ubuntu act as the spec says, getting rid of all diacritics on forward delete. IE9 and Opera 11.50 have no special case and just delete the next character. I go with Firefox/Chrome/Word/OO.
However, when I actually type in the text box as opposed to running semi-automated tests, IE9 has magical behavior: it replaces the base character with something that looks like ◌ U+25CC DOTTED CIRCLE. Further strikes of the delete key remove the diacritics, and the circle vanishes along with the last of them. I wasn't able to get it to actually replace the base character, so I'm not sure what the point is. The circle doesn't seem to appear in the DOM, and apparently it disappears in some circumstances. This might be worth standardizing somehow, I don't know.
TODO: The way we remove diacritics is probably not right. We probably want to normalize to grapheme cluster boundaries, using UAX#29 or something. We also need to handle non-BMP stuff. The idea is that if the cursor is before a character that precedes a combining mark, you need to delete the combining mark too.

While end offset is not node's length and the end offsetth code unit of node's data has general category M when interpreted as a Unicode code point, add one to end offset.
Call collapse(node, offset) on the context object's selection.
Call extend(node, end offset) on the context object's selection.
Delete the selection.
Return true.

If node is an inline node, return true.
If node has a child with index offset and that child is a br or hr or img, but is not a collapsed block prop:

Call collapse(node, offset) on the context object's selection.
Call extend(node, offset + 1) on the context object's selection.
Delete the selection.
Return true.

No special list-item behavior for forwardDelete here, unlike delete.
Let end node equal node and let end offset equal offset.
If end node has a child with index end offset, and that child is a collapsed block prop, add one to end offset.
Repeat the following steps:

If end offset is the length of end node, set end offset to one plus the index of end node and then set end node to its parent.
Otherwise, if end node has an editable invisible child with index end offset, remove it from end node.
Otherwise, break from this loop.

No special indentation element behavior for forwardDelete here, unlike delete.
If the child of end node with index end offset minus one is a table, return true.
If the child of end node with index end offset is a table:

Call collapse(end node, end offset) on the context object's selection.
Call extend(end node, end offset + 1) on the context object's selection.
Return true.

Note, any br will do here: a br immediately after a block is always significant.
If offset is the length of node, and the child of end node with index end offset is an hr or br:

Call collapse(end node, end offset) on the context object's selection.
Call extend(end node, end offset + 1) on the context object's selection.
Delete the selection.
Call collapse(node, offset) on the selection.
Return true.

No special list-item behavior for forwardDelete here, unlike delete.
While end node has a child with index end offset:

If end node's child with index end offset is editable and invisible, remove it from end node.
Otherwise, set end node to its child with index end offset and set end offset to zero.

Call collapse(node, offset) on the context object's selection.
Call extend(end node, end offset) on the context object's selection.
Delete the selection.

The indent command

IE9
Outputs
, or when surrounding RTL blocks,
. The direction seems to go by the end of the selection. The presence of the dir attribute means that any contents that were inheriting a different dir from an ancestor get their direction changed as a side effect, but if they actually have the opposite dir specified, they won't appear to be indented. It doesn't reset top or bottom margins on the blockquote, so it adds them. If it's not wrapping a block element, like if it's only wrapping up until a
, it adds a
.
Firefox 4.0
In styleWithCSS mode, adds style="margin-left: 40px" to the appropriate block container (or margin-right if it's RTL). If there's no appropriate block container, adds a div. If multiple blocks are affected, it goes by the direction of the block whose style it's changing, which winds up being wrong for descendants with different direction. In non-styleWithCSS mode, uses
, so it indents on both sides and also adds top/bottom margins.
Chrome 12 dev
Outputs
in both modes for both LTR and RTL (which is broken for RTL, since it indents only on the left).
Opera 11.00
Outputs
, so it indents on both sides and on the top/bottom.

For repeated indentation, everyone except Opera that outputs
s just puts them at the outermost possible location, which works well. Opera puts them in the innermost position, which is broken, because it will even put them inside
(which will not round-trip through text/html serialization).
Gecko in CSS mode messes up by adding margins even to things like
that already have margins from CSS rules, instead of nesting a div, so it doesn't actually increase the indentation. However, if an element has an explicit left margin (assuming LTR), it will increase the margin to 80px, so it works with WebKit's blockquotes.
We have two strategies for handling directionality: always indent on both sides (Firefox non-CSS, Opera) or try to figure out heuristically which side we want (IE, Firefox CSS). The latter approach is only possible by adding extra markup and complexity, so for now we'll take the easy way out and go with just indenting on both sides.
This reasoning doesn't discuss lists. For research on lists, see the comment for insertOrderedList. List handling is more complicated and I wound up differing from all browsers in lots of ways.

Preserves overrides
Action:

Let items be a list of all lis that are inclusive ancestors of the active range's start and/or end node.

TODO: This overnormalizes, but it seems like the simplest solution for now.
For each item in items, normalize sublists of item.
Block-extend the active range, and let new range be the result.
Let node list be a list of nodes, initially empty.
For each node node contained in new range, if node is editable and is an allowed child of "div" or "ol" and if the last member of node list (if any) is not an ancestor of node, append node to node list.

Without this step, the last child of the previous sibling might be a list, which the li wouldn't get appended to.
If the first visible member of node list is an li whose parent is an ol or ul:

Let sibling be node list's first visible member's previousSibling.
While sibling is invisible, set sibling to its previousSibling.
If sibling is an li, normalize sublists of sibling.

While node list is not empty:

Let sublist be a list of nodes, initially empty.
Remove the first member of node list and append it to sublist.
While the first member of node list is the nextSibling of the last member of sublist, remove the first member of node list and append it to sublist.
Indent sublist.

The insertHorizontalRule command

Preserves overrides
You'd think interop here would be simple, right? Nope: we have three different behaviors across four browsers. Opera 11.00 is the only one that acts more or less like the spec. IE9 and Chrome 12 dev treat the value as an id, which is weird and probably useless, so I don't do it. Firefox 4.0 produces
instead of
, which is also weird and almost definitely useless, so I don't do it. Then you have the varying behavior in splitting up parents to ensure validity . . .
Action:

Let start node, start offset, end node, and end offset be the active range's start and end nodes and offsets.
While start offset is 0 and start node's parent is not null, set start offset to start node's index, then set start node to its parent.
While end offset is end node's length, and end node's parent is not null, set end offset to one plus end node's index, then set end node to its parent.
Call collapse(start node, start offset) on the context object's selection.
Call extend(end node, end offset) on the context object's selection.
Delete the selection, with block merging false.
If the active range's start node is neither editable nor an editing host, return true.

We don't want to call insertNode at the start or end of a text node, because that will leave an empty text node.
If the active range's start node is a Text node and its start offset is zero, call collapse() on the context object's selection, with first argument the active range's start node's parent and second argument the active range's start node's index.
If the active range's start node is a Text node and its start offset is the length of its start node, call collapse() on the context object's selection, with first argument the active range's start node's parent, and the second argument one plus the active range's start node's index.
Let hr be the result of calling createElement("hr") on the context object.
Run insertNode(hr) on the active range.

IE9 and Chrome 13 dev seem to never break up any ancestors, which can lead to unserializable DOMs like
inside
. Opera 11.11 seems to always break up parents going all the way up to the contenteditable root, even ones like
that can contain
. Firefox 5.0a2 acts the most sensibly: it only breaks up things like
or that shouldn't contain
. The spec goes with Firefox here (although the list of what to break up isn't precisely identical).
Fix disallowed ancestors of hr.
Run collapse() on the context object's selection, with first argument hr's parent and the second argument equal to one plus hr's index.
Return true.

The insertHTML command

Preserves overrides

Not supported by IE9. Handling of disallowed children is interesting:

Firefox 5.0a2
Will allow
inside
(doesn't serialize). If you try inserting dir/ol/ul inside an existing dir/ol/ul, it will strip the list element and leave only the li's, so inserting
abc
into
f[o]o
creates
f
abc
o
.
/
/
that don't descend from a list will be left alone, not converted to
. Empty elements seem not to be inserted.
will get put inside
, which breaks serialization. Nothing is allowed inside

We skip non-editable nodes.

IE9: Allows everything to be modified by execCommand(), regardless of whether it's editable.
Firefox 4.0: Ignores execCommand() if the start and end of the selection are not both editable. If the start and end are editable but something in the middle is not, seems to relocate the non-editable part in the middle or something like that.
Chrome 12 dev: Ignores execCommand() if the start and end of the selection are not both editable. If the start and end are editable but something in the middle is not, applies the given command but skips the non-editable parts. But the state doesn't ignore the non-editable parts, so if you bold such a selection you can't unbold it, for instance, since the middle part will remain bold (so it will keep on trying to bold it instead of switching to unbold).
Opera 11.00: Ignores execCommand() if the start and end of the selection are not both editable. If the start and end are editable but something in the middle is not, applies the command to everything, even the non-editable part.

I chose to go with the non-IE behavior, per discussion. Ignoring non-editable things is convenient for the common use-case of an editor, where you don't want the user to bold random parts of the UI when they hit the bold button. For cases where it's not desired, you can always turn designMode on briefly before using execCommand(), so the non-IE behavior is a lot easier to work around than the IE behavior.

I don't see the value in ever just ignoring execCommand(). If the start and end are not editable, I'm going to say you should still style any editable nodes in between. I'm also going to ignore non-editable nodes for the purposes of determining state, so (for instance) if all the editable nodes are bolded, it will unbold instead of bolding.

We have three behaviors to choose from for this one:

Chrome 11 dev and IE 9 RC treat it the same as hiliteColor (although IE 9 RC doesn't support hiliteColor itself).
Firefox 4 in non-CSS mode sets the bgcolor of the nearest td or body, or something like that. In testing, it seems to jump out of contenteditable elements to style non-editable ancestors, which is alarming.
Firefox 4 in CSS mode and Opera 11 set the background of the nearest block container, although it doesn't seem to be very dependable (probably I just don't get what exactly it's doing).

(1) is obviously redundant, but has plurality support, so we could spec it that way if the other ways were useless.

(3) is incoherent from a user perspective. For instance, if you try it on paragraphs the background will have big gaps where the margins are. If you try it on an inline element that's a child of the editing host, it will do nothing or apply the background to everything or such, even though such an inline element is visually indistinguishable from one sitting inside a div. This would only make sense if we take considerable effort to ensure that block elements all have no margins, or if we wrap things in a div if they have margins, or something like that.

That leaves (2). That might be useful if it actually set the document's background color, but it seems like it sets table cell backgrounds sometimes instead, which is really confusing.

The path of least resistance is to standardize this as meaning the same thing as hiliteColor, and make up new commands if we want to do things like set the document background color. See hiliteColor for comments.

The cutoff of 600 matches Chrome 14 dev. The cutoff used by IE9 and Firefox 6.0a2 seems to be 500, and the distinction isn't relevant for Opera 11.11 (it doesn't use CSS here at all AFAICT). On my test systems with default fonts, Chrome 14 dev displays 700 and up as bold, while the other three display 600 and up as bold.
Thus in Chrome on my system, the bold command will behave a bit oddly the first time you hit it if there's anything in the range with font-weight: 600, but it will look right in other browsers. On the other hand, if I followed IE/Firefox, it would look wrong on all my browsers for font-weight: 500.
700 actually makes more sense: then you'd view 100-300 as light, 400-600 as medium, 700-900 as bold. But that's not how it seems to work in browsers, so I'll go with 600 as the cutoff.

If the selection doesn't contain anything (meaning, e.g., deleteContents() doesn't change anything), then Chrome 12 dev inserts a link at the selection start, with the text equal to the link URL. Other browsers don't do it, so I don't either.

IE10PP2, Firefox 7.0a2, Chrome 14 dev, and Opera 11.50 all do not support indeterminate, state, or value for createLink or unlink. I previously defined indeterminate and value anyway because they make sense, but then undefined them. The nontrivial thing is what value to return if there's no link, since any string can occur as a link href, in principle.

What are the use-cases for indeterm, state, or value for createLink/unlink?

There are three approaches here. For instance, if you ask browsers to create a link to "http://example.org" on the "b" here:

Abc

Chrome 10 dev produces:

Firefox 4b11 produces (roughly):

(This doesn't round-trip through text/html serialization.) IE 9 RC and Opera 11 produce simply:

The last behavior probably best matches user expectations. If you happen to miss out a character when selecting the link you want to change, do you really intend to only change the link of part of it?

UAs differ a bit in the details here:

IE 9 RC: Empty string sets
Firefox 4b11: Empty string does nothing
Chrome 11 dev: Empty string does nothing, '"monospace"' same as 'monospace' (i.e., cannot escape font-family keywords because quotes are stripped, clearly wrong)
Opera 11: Empty string sets

Setting an empty font-family has the effect of inheriting the font from the parent (although I don't see where the February 24, 2011 CSS 3 Fonts draft says that). Thus it makes sense that if we special-case this, it should be to unset the font somehow.

Special-casing the empty string to do nothing doesn't make sense to me. With createLink we'd expect the user to enter the URL themselves, so it makes sense to special-case clicking OK without entering anything. But here it's very likely that the font list will be fixed by the author (how many users will understand CSS font-family syntax?), so I don't think such usability concerns apply.

The value is complicated.

IE 9 RC: Always the empty string. Not very useful.
Firefox 4b11: Confusing. Sometimes it returns generic family names, like "sans-serif". Sometimes it gives specific font names, like "tt" when the font is specified as "monospace". Sometimes it gives the literal font-family string. Not sure what it's doing here.
Chrome 11 dev: Gives the literal value of font-family, except if it's inherited from default values (no explicit style declarations anywhere), when it seems to return the exact font name.
Opera 11: Returns the literal value of font-family, except if it's inherited from default values, when it returns the empty string.

I'm just going to punt on this and say it should be the resolved value of font-family. I'll leave CSSOM to decide what that means if there are no applicable style rules.

IE 9: Parses the value as a number (allowing floating-point), rounds to the nearest integer, then clamps to the range 1 to 7. If the value is not a valid number, including if it has trailing characters (like "2em"), does nothing. Normalizes relative sizes, so "+0" is the same as "+3", etc. Treats empty string the same as "1".
Firefox 4.0: Passes the value through literally to , so "2em" gets you . Always uses , even with styleWithCss true. Ignores the command if the value is the empty string.
Chrome 12 dev: Parses the value as a legacy font size, so "2em" becomes "2", then outputs a with the resulting number. If there is no resulting number, like for a value of "xx-small", does nothing. In styleWithCss mode, outputs a span with corresponding CSS keywords: 1 = x-small, 2 = small, . . ., 6 = xx-large, 7 = -webkit-xxx-large. Normalizes relative sizes, so "+0" is the same as "3", etc. Ignores the command if the value is the empty string.
Opera 11: Parses the value as an integer (ignoring floating-point as trailing characters), then outputs that. This means that "+0" becomes instead of or . Non-numeric values get interpreted as 0. Does not clamp, and is willing to output negative numbers. Treats empty string as "0".

What all of these have in common is that they force the author to deal with legacy font values and don't let them use CSS. This is undesirable, but to avoid it we'd really have to create a new command. If nothing else, the value returned by queryCommandValue() has to be numeric, so authors can't really use the command sanely no matter what we do. See bug 14251.

Note that 1 is the same size as x-small in browsers, not xx-small, contrary to the CSS Fonts spec.

IE9: Seems to return a number based on the computed font-size, but only if it's exactly right, otherwise it returns null. Something like that.
Firefox 6.0a2: Seemingly goes up to the nearest ancestor that's a and returns the literal value of that attribute, or "" if there's no such ancestor.
Chrome 14 dev: Gets the computed font-size in pixels, and rounds to the nearest equivalent, rounding up in the event of a tie. Except that if it's small enough, it returns "0", which doesn't make sense because that behaves the same as "1".
Opera 11.11: Like Firefox, except it returns "3" if there's no ancestor, and it converts relative values to absolute ("+1" -> "4").

Chrome's behavior seems the most useful. As usual, IE returns a variable type and all other browsers return strings, and we follow other browsers.

If the selection isn't someplace editable, Chrome works like usual; some other browsers behave differently. I see no reason to behave differently.

Color interpretations:

                        IE10PP2       Firefox 7.0a2            Chrome 14 dev            Opera 11.50
blue                    blue          blue                     #0000ff                  #0000ff
f                       #f            -                        -                        #f00000
#f                      #f            -                        -                        #f00000
00f                     #00f          -                        #0000ff                  #00000f
#00f                    #00f          rgb(0, 0, 255)           #0000ff                  #00000f
0000ff                  #0000ff       -                        #0000ff                  #0000ff
#0000ff                 #0000ff       rgb(0, 0, 255)           #0000ff                  #0000ff
000000fff               #0000ff       -                        -                        -
#000000fff              #0000ff       -                        -                        -
rgb(0, 0, 255)          rgb(0,0,255)  rgb(0, 0, 255)           #0000ff                  #00b000
rgb(0%, 0%, 100%)       rgb(0,0,255)  rgb(0, 0, 255)           #0000ff                  #00b000
rgb( 0 ,0 ,255)         rgb(0,0,255)  rgb(0, 0, 255)           #0000ff                  #00b000
rgba(0, 0, 255, 0.0)    #ba0000       rgba(0, 0, 255, 0)       rgba(0, 0, 255, 0)       #00ba00
rgb(15, -10, 375)       rgb(15,0,255) rgb(15, 0, 255)          #0f00ff                  #00b015
rgba(0, 0, 0, 1)        #ba0010       rgb(0, 0, 0)             -                        #00ba00
rgba(255, 255, 255, 1)  #000055       rgb(255, 255, 255)       #ffffff                  #00ba02
rgba(0, 0, 255, 0.5)    #ba0000       rgba(0, 0, 255, 0.5)     rgba(0, 0, 255, 0.5)     #00ba00
hsl(240, 100%, 50%)     #000150       rgb(0, 0, 255)           #0000ff                  #000024
cornsilk                cornsilk      cornsilk                 #fff8dc                  #fff8dc
potato quiche           #0000c0       -                        -                        #000a00
transparent             transparent   -                        rgba(0, 0, 0, 0)         #00a000
currentColor            #c0e000       currentcolor             rgba(0, 0, 0, 0)         #c000e0

The interpretations given for Firefox are only in styleWithCSS mode. In non-styleWithCSS mode, it just outputs the string literally as the attribute value, which can lead to different results. The given output for Chrome is for ; the output in styleWithCSS mode is the same, but rgb() is used instead of hex notation, and "transparent" and "currentcolor" are passed through under those names. IE and Opera only support to begin with.

Conclusions:

Everyone accepts simple color keywords and #xxxxxx notation.

Opera mangles #xxx, but everyone else handles it fine.

The leading # is optional in all browsers but Gecko.

rgb() is accepted by everyone but Opera.

rgba() and hsl() are accepted by Gecko and WebKit, but rejected by IE and Opera.

IE and Opera mangle unrecognized stuff, Gecko and WebKit ignore.

Browsers will happily output stuff like "transparent" and "rgba()" into even though it won't be uniformly accepted there.

Opera and WebKit normalize the output color very aggressively, Gecko leaves keywords intact but otherwise normalizes for CSS output (but doesn't normalize at all for ), and IE normalizes inconsistently.

What I'm going to say is that it either has to be a valid CSS color, or prefixing it with # must result in a valid CSS color. For , I'll say that the output color should be normalized to #xxxxxx form. If the color is not a simple color (fully opaque with all channels between 0 and 255), I'll force style="" even if styleWithCSS mode is off. Some of this disagrees with all browsers, but it's unlikely to hurt and it makes sense.

Opera 11 seems to return true for the state if there's some color style applied, false otherwise, which seems fairly useless; authors want to use value here, not state. So I'll match other browsers and not define any state.

For value, the spec essentially matches Firefox 6.0a2 and Chrome 14 dev, as far as how to decide what color the node has. IE9 seems to always return the number 0 for some bizarre reason. There are some cases where Firefox returns the empty string for some reason, and it seems to select the active node a little differently. Opera uses #xxxxxx format for getComputedStyle() but rgb() here, and also drops the transparent part of the color if there is any.

	ol indeterm	ol state	ul indeterm	ul state
ol	false	true	false	false
ul	false	false	false	true
mixed	true	false	true	false
mixed ol	true	false	false	false
mixed ul	false	false	true	false
none	false	false	false	false

HTML Editing APIs

Work in Progress — Last Update 13 February 2014

Status of this Document

Table of contents

Introduction

Tests

General remarks

Command development tests

Command conformance tests

The backColor command

The bold command

The createLink command

The fontName command

The fontSize command

The foreColor command

The hiliteColor command

The italic command

The removeFormat command

The strikethrough command

The subscript command

The superscript command

The underline command

The unlink command

Block formatting commands

Block formatting command definitions

Assorted block formatting command algorithms

Block-extending a range

Recording and restoring overrides

Deleting the selection

Splitting a node list's parent

Canonical space sequences

Indenting and outdenting

Toggling lists

Justifying the selection

Automatic linking

The delete command

The formatBlock command

[foo]bar

bar

[foo]bar

foobar

foo

bar

foobar

foo

bar

foobar

foo

bar

The forwardDelete command

The indent command

The insertHorizontalRule command

The insertHTML command

The `backColor` command

The `bold` command

The `createLink` command

The `fontName` command

The `fontSize` command

The `foreColor` command

The `hiliteColor` command

The `italic` command

The `removeFormat` command

The `strikethrough` command

The `subscript` command

The `superscript` command

The `underline` command

The `unlink` command

The `delete` command

The `formatBlock` command

[foo]
bar

[foo]

bar

foo
bar

foo
bar

foo
bar

The `forwardDelete` command

The `indent` command

The `insertHorizontalRule` command

The `insertHTML` command