Skip to content

Is there a handy list of all characters considered (unicode) whitespace by NestedText? #56

Answered by KenKundert
AndydeCleyre asked this question in Q&A
Discussion options

You must be logged in to vote

If I recall correctly, there are basically three places where the language reference mentions white space.

  1. it limits the indentation to consist only of ASCII space characters
  2. it limits the tags to :, -, or > when followed by an ASCII space or a newline.
  3. in inline lists and dictionaries the whitespace can be any non-line breaking whitespace character. So that includes the ASCII space and tab characters and the unicode whitespace characters, which are defined here

My python implementation uses Pythons str.strip() function to remove whitespace, and I am pretty sure it just removes anything that is marked as whitespace in the Unicode standard.

The essential ones are space and tab. I suspect…

Replies: 2 comments 3 replies

Comment options

You must be logged in to vote
1 reply
@AndydeCleyre
Comment options

Answer selected by AndydeCleyre
Comment options

You must be logged in to vote
2 replies
@AndydeCleyre
Comment options

@KenKundert
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants