Does the space character in HTML Living Standard mean only ASCII space character?
I know that the HTML4 specification defines whitespace as follows:
9.1 White space
The document character set includes a wide variety of white space characters. Many of these are typographic elements used in some applications to produce particular visual spacing effects. In HTML, only the following characters are defined as white space characters:
- ASCII space ( )
- ASCII tab (	)
- ASCII form feed ()
- Zero-width space (​)
As a result, HTML4 recognizes that all whitespace characters other than those defined below are treated in the same way as ordinary characters. This fact means that continuous U+0020
is combined into one, but continuous U+2009
is not combined into one and maintains a continuous state.
<h2>U+0020 is combined</h2>
<p>this is loooooooooooo ng text</p>
<h2>U+2009 (white space that is out of definition) is not combined</h2>
<p>this is loooooooooooo ng text</p>
I searched the WHATWG Living Standard for a description corresponding to this white space definition in HTML4, but I couldn't find it. Where is the definition of white space in HTML Living Standard?
I read the following articles, but these did not have the answer to my question.