doc: properly handle preformatted blocks #8242

whitslack · 2025-04-19T23:59:43Z

Lowdown requires a blank line before all preformatted blocks, or it doesn't recognize them. tools/md2man.sh contained some ad-hoc efforts at fixing up some locations where these required blank lines are absent from the output of tools/fromschema.py, but it missed some. Instead of playing Whack-a-Mole, use a blanket sed expression to ensure that a blank line precedes every opening ```.

esc_underscores(…) in tools/fromschema.py did not work correctly on strings containing an odd number of backticks, notably the ``` delimiters surrounding preformatted text blocks. Specifically, it was dropping the last backtick since none of the alternatives in the regex matched it. Add a new alternative that matches a whole preformatted block as a single unit.

output_member(…) in tools/fromschema.py was passing each line of a member's description through esc_underscores(…) individually, but that breaks preformatted text blocks that are naturally multi-line and leads to mistakenly escaping underscores inside such blocks. Rewrite the code to make use of the outputs(…) utility function that joins all the provided lines together before passing the whole text through esc_underscores(…).

Drive-by fix a couple of flubbed preformatted blocks in schemas.

Checklist

Before submitting the PR, ensure the following tasks are completed. If an item is not applicable to your PR, please mark it as checked:

The changelog has been updated in the relevant commit(s) according to the guidelines.
Tests have been added or modified to reflect the changes. (Not applicable.)
Documentation has been reviewed and updated as needed. (That's what this PR does.)
Related issues have been listed and linked, including any that this PR closes. (I didn't find any.)

Lowdown requires a blank line before all preformatted blocks, or it doesn't recognize them. `tools/md2man.sh` contained some ad-hoc efforts at fixing up some locations where these required blank lines are absent from the output of `tools/fromschema.py`, but it missed some. Instead of playing Whack-a-Mole, use a blanket sed expression to ensure that a blank line precedes _every_ opening ```. `esc_underscores(…)` in `tools/fromschema.py` did not work correctly on strings containing an odd number of backticks, notably the ``` delimiters surrounding preformatted text blocks. Specifically, it was dropping the last backtick since none of the alternatives in the regex matched it. Add a new alternative that matches a whole preformatted block as a single unit. `output_member(…)` in `tools/fromschema.py` was passing each line of a member's description through `esc_underscores(…)` individually, but that breaks preformatted text blocks that are naturally multi-line and leads to mistakenly escaping underscores inside such blocks. Rewrite the code to make use of the `outputs(…)` utility function that joins all the provided lines together before passing the whole text through `esc_underscores(…)`. Drive-by fix a couple of flubbed preformatted blocks in schemas. Changelog-None

rustyrussell

Minor fix, generally looks good!

rustyrussell · 2025-04-28T05:17:26Z

tools/fromschema.py

@@ -21,7 +21,7 @@ def output_title(title, underline='-', num_leading_newlines=1, num_trailing_newl

 def esc_underscores(s):
    """Backslash-escape underscores outside of backtick-enclosed spans"""
-    return ''.join(['\\_' if x == '_' else x for x in re.findall(r'[^`_\\]+|`(?:[^`\\]|\\.)*`|\\.|_', s)])
+    return ''.join(['\\_' if x == '_' else x for x in re.findall(r'(?ms:^[ \t]*```.*?^[ \t]*```)|[^`_\\\n]++|`(?:[^`\\]|\\.)*`|\\.|[_\n]', s)])


"++" here is wrong:

re.error: multiple repeat at position 40

Hmm. Are you using a very old version of Python? The ++ (possessive one-or-more) quantifier works on Python 3.11.12, 3.12.10, and 3.13.3, but it gives the error you quoted on Python 3.10.17. Do you really need to maintain compatibility with ancient versions of Python? Possessive quantifiers avoid needlessly backtracking when we know that backtracking will not find any new matches, although, now that I am looking at this again, I don't think it's going to make any difference in this case since there are no assertions after that repeat, so no backtracking would ever be attempted even if the quantifier were non-possessive.

whitslack changed the title ~~doc: properly handle \\\preformatted blocks\\\~~ doc: properly handle preformatted blocks Apr 19, 2025

cdecker requested a review from ShahanaFarooqui April 21, 2025 15:26

rustyrussell reviewed Apr 28, 2025

View reviewed changes

rustyrussell added this to the v25.05 milestone Apr 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

doc: properly handle preformatted blocks #8242

doc: properly handle preformatted blocks #8242

whitslack commented Apr 19, 2025

rustyrussell left a comment

rustyrussell Apr 28, 2025

whitslack Apr 28, 2025

doc: properly handle preformatted blocks #8242

Are you sure you want to change the base?

doc: properly handle preformatted blocks #8242

Conversation

whitslack commented Apr 19, 2025

Checklist

rustyrussell left a comment

Choose a reason for hiding this comment

rustyrussell Apr 28, 2025

Choose a reason for hiding this comment

whitslack Apr 28, 2025

Choose a reason for hiding this comment