Update to commonmark spec 0.29 #156

robinst · 2019-07-12T05:22:25Z

This was a larger one because of the change in link reference definitions, see commonmark/commonmark-spec#395. I also cleaned up parsing of definitions to avoid parsing them twice. We can now also add them as nodes to the document, which is nice.

Fixes #152.

…spec 0.29) See commonmark/commonmark-spec#119

…iples of 3 (spec 0.29)

See commonmark/commonmark.js#129

* Allow spaces inside link destinations in pointy brackets * Disallow link destination beginning with `<` unless it is inside `<..>`

See commonmark/commonmark-spec#526

The spec was updated to clarify that they can also be part of setext headings. The way we make these work in our parser is to be able to "replace" the active paragraph block. In that case, we still need to collect link reference definitions from it. In order to implement this, I had to separate it from InlineParser. The new way is cleaner, and allows us to add a Node to the document as well (for #98). I think there's still room for improvement: We could parse the definition as we go in ParagraphParser and collect them earlier. That might eliminate the current "double parsing" that we do in some cases.

Hopefully this will allow us to reuse it for parsing link reference definitions in an incremental way. Also speeds up parsing a bit: -SpecBenchmark.parseExamples thrpt 50 453.493 ± 4.647 ops/s +SpecBenchmark.parseExamples thrpt 50 483.467 ± 3.418 ops/s

The old code would parse link reference definitions twice in the worst case. The new one parses it as part of paragraph parsing. Looks like this is faster too: -SpecBenchmark.parseExamples thrpt 50 485.743 ± 1.864 ops/s +SpecBenchmark.parseExamples thrpt 50 550.071 ± 8.638 ops/s -SpecBenchmark.parseWholeSpec thrpt 50 284.494 ± 2.641 ops/s +SpecBenchmark.parseWholeSpec thrpt 50 297.277 ± 3.272 ops/s

See commonmark/commonmark.js@c89b35c Also replaced the regex for escaping with a loop, which speeds up HTML rendering: -SpecBenchmark.parseAndRenderExamples thrpt 50 344.820 ± 1.215 ops/s +SpecBenchmark.parseAndRenderExamples thrpt 50 374.342 ± 2.445 ops/s -SpecBenchmark.parseAndRenderWholeSpec thrpt 50 151.209 ± 1.148 ops/s +SpecBenchmark.parseAndRenderWholeSpec thrpt 50 198.357 ± 2.601 ops/s (Note that these benchmarks include parsing, so rendering itself saw a very nice improvement.)

See commonmark/commonmark-spec#487

robinst · 2019-07-12T06:14:28Z

commonmark/src/main/java/org/commonmark/internal/ReferenceParser.java

-/**
- * Parser for inline references
- */
-public interface ReferenceParser {


FYI @lalunamel this has been removed (to be released). It shouldn't really impact you, but just a heads up.

robinst added 22 commits April 12, 2019 17:59

Update spec to CommonMark 0.29, sync regression tests

6c3bec2

Change how newlines/spaces are handled in inline code (spec 0.29)

d08f5dc

Info strings for tilde code blocks can contain backticks and tildes (…

62d5f10

…spec 0.29) See commonmark/commonmark-spec#119

Allow internal delim runs to match if both have lengths that are mult…

9e10d39

…iples of 3 (spec 0.29)

Fix pathological case with input [\\\\... (a lot of backslashes)

a3cc5f0

Fix pathological case with input []([]([](...

e368db3

See commonmark/commonmark.js#129

Changes to link destination parsing (spec 0.29)

ccff691

* Allow spaces inside link destinations in pointy brackets * Disallow link destination beginning with `<` unless it is inside `<..>`

Disallow unescaped '(' in link title

79c0a7c

See commonmark/commonmark-spec#526

Cleanup after extracting link reference definition parsing

320d570

Fix edge cases around link parsing

bdbc9d3

Disallow lists indented more than 3 spaces (spec 0.29)

48614f3

Adjust delimiter test to not use code block marker

e760f75

Adjust tests to spec changes

08b7748

Update regression tests from commonmark.js

490af42

No longer treat <meta> as a block tag (spec 0.29)

ab7c1c7

Fix strikethrough test by avoiding confusion with code block

c8ccf85

Adjust max length for decimal/numeric entities

b1d8bb4

See commonmark/commonmark-spec#487

Adjust comment and remove TODO

3011dcb

robinst merged commit 813dae3 into master Jul 12, 2019

robinst commented Jul 12, 2019

View reviewed changes

robinst deleted the issue-152-commonmark-0.29 branch July 16, 2019 01:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update to commonmark spec 0.29 #156

Update to commonmark spec 0.29 #156

robinst commented Jul 12, 2019

robinst Jul 12, 2019

Update to commonmark spec 0.29 #156

Update to commonmark spec 0.29 #156

Conversation

robinst commented Jul 12, 2019

robinst Jul 12, 2019

Choose a reason for hiding this comment