Add newline to the parsing grammar to resolve some bugs #56

jmbeck15 · 2022-08-17T13:23:32Z

Summary

By adding newline (\n) to the grammar, this pull request will:

Resolve There's a funny bug with the way newlines are handled #44
Resolve Duplicate arugments when a newline preceeds arguments without an explicit command #55
Fix some erroneous tests
Add some new tests

It may be easier to review by commit than in bulk.

Description

Newlines are special for GCode; they represent the end of the current command and the start of a new one. This commit makes newlines a first-class token, and parses with them under consideration.

How I tested

I added some tests, un-ignored or fixed other tests, and executed cargo test.

I also ran a bunch of gcode files through the parser as a sanity check. In retrospect, I should have created tests from them, so I may go back and do that if I need to make further changes.

Was replaced with broken_intra_doc_links.

Fix skip_whitespace test and add respect_newlines test.

Removed part of the test that was simply invalid.

Tweaked it to return the expected newline character.

Both fail, and are the basis for a new bug report.

Michael-F-Bryan

Thanks for making this PR @jmbeck15! I think adding newlines as an explicit token in the language totally makes sense.

Overall I'm pretty happy with the PR, but there were a couple small questions I'd like to get your thoughts on.

Michael-F-Bryan · 2022-08-17T15:12:40Z

gcode/src/lexer.rs

+    fn tokenize_newline(&mut self) -> Option<Token<'input>> {
+        let start = self.current_position;
+        let line = self.current_line;
+        let value = "\n";
+        self.current_position += 1;
+        self.current_line += 1;
+        Some(Token {
+            kind: TokenType::Newline,
+            value,
+            span: Span {
+                start,
+                line,
+                end: start + 1,
+            },
+        })
+    }


If the only way this function can be called is when we're looking at a \n character, would it be enough to add something like debug_assert!(self.rest().starts_with("\n")) and return a Token<'input>?

Otherwise, we should actually check for the newline character and return None if it isn't found.

Either way, I think we should try to use the newline characters from the original stream instead of writing let value = "\n".

I think what you're getting at is "this code has a weird smell", and I think you're right. 😁 But I'm honestly not sure how to construct this better. If we don't explicitly set value = "\n", we'd have to chomp() or somehow otherwise pull the newline from the string, which is a computational expense that seems superfluous.

As for your suggestion

would it be enough to add something like debug_assert!(self.rest().starts_with("\n")) and return a Token<'input>

I don't think I understand. The debug_assert is something I could add; seems reasonable. But how would you return the Token<'input>?

gcode/src/lexer.rs

Michael-F-Bryan · 2022-08-17T15:17:14Z

gcode/src/lexer.rs

@@ -371,4 +404,44 @@ mod tests {

        assert_eq!(got.value, "+3.14");
    }
+
+    #[test]
+    fn two_multi() {


It would probably be more readable if we rewrite the test to be something like this:

let expected = vec![ ("G", 0), ("0", 0), ("X", 0), ... ]; let actual: Vec<_> = Lexer::new("...") .map(|tok| (tok.value, tok.span.line)) .collect(); assert_eq!(actual, expected);

For long g-code example tests, that's probably the way to go. Maybe we could create some helper methods that make testing easier. We're really missing these long-form g-code verification tests, so I took this one from the pull request @dr0ps made.

How about we keep this test as is for now, and rework these kinds of tests in the future?

gcode/src/parser.rs

gcode/tests/smoke_test.rs

Co-authored-by: Michael Bryan <michaelfbryan@gmail.com>

jmbeck15 · 2022-08-18T11:38:28Z

@Michael-F-Bryan I pushed two more commits, one of which fixes #55. I was going to wait until after this pull request was merged to fix #55, but it turns out very much related to the newline handling.

If you prefer these fixes in separate pull requests, just let me know.

jpursell · 2024-05-05T12:19:04Z

Howdy! In case anyone is still out there, I tried using this library and ran into a bug. The library panicked on the gcode generated by DrawingBotV3-Free. Anyways, I am interested to know if this change would fix it. I'm hoping this message will motivate someone to resolve this merge request.

jmbeck15 · 2024-05-25T20:22:59Z

@jpursell if you send me some minimal example code that panics the library, I'll check it out. I'm curious.

jmbeck15 added 4 commits August 16, 2022 17:18

Replace deprecated intra_doc_link_resolution_failure to avoid warnings

5760850

Was replaced with broken_intra_doc_links.

Add newline to grammer to explicitly address end-of-command

c70543a

Fix skip_whitespace test and add respect_newlines test.

Fix smoke test expected_program_2_output

e3e5371

Removed part of the test that was simply invalid.

Add two_multi test from dr0ps

8ec331c

Tweaked it to return the expected newline character.

jmbeck15 force-pushed the fix-newline-detection branch from 8d7dfa8 to c11ad15 Compare August 17, 2022 13:26

Add tests implicit_command_after_newline and two_commands_in_a_row

2107449

Both fail, and are the basis for a new bug report.

jmbeck15 force-pushed the fix-newline-detection branch from c11ad15 to 2107449 Compare August 17, 2022 13:34

jmbeck15 marked this pull request as ready for review August 17, 2022 13:36

Michael-F-Bryan requested changes Aug 17, 2022

View reviewed changes

jmbeck15 and others added 4 commits August 17, 2022 20:33

Tweak test assert formulation

3fd5a6f

Co-authored-by: Michael Bryan <michaelfbryan@gmail.com>

Omit return statement for standard function exit

6185b90

Co-authored-by: Michael Bryan <michaelfbryan@gmail.com>

Fix implicit commands not respected after a newline

f52e6f6

Remove unused span and next_line_number functions

cec678b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add newline to the parsing grammar to resolve some bugs #56

Add newline to the parsing grammar to resolve some bugs #56

Uh oh!

jmbeck15 commented Aug 17, 2022 •

edited

Loading

Uh oh!

Michael-F-Bryan left a comment

Uh oh!

Michael-F-Bryan Aug 17, 2022

Uh oh!

jmbeck15 Aug 17, 2022

Uh oh!

Uh oh!

Michael-F-Bryan Aug 17, 2022

Uh oh!

jmbeck15 Aug 17, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jmbeck15 commented Aug 18, 2022

Uh oh!

jpursell commented May 5, 2024

Uh oh!

jmbeck15 commented May 25, 2024

Uh oh!

Uh oh!

Add newline to the parsing grammar to resolve some bugs #56

Are you sure you want to change the base?

Add newline to the parsing grammar to resolve some bugs #56

Uh oh!

Conversation

jmbeck15 commented Aug 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Description

How I tested

Uh oh!

Michael-F-Bryan left a comment

Choose a reason for hiding this comment

Uh oh!

Michael-F-Bryan Aug 17, 2022

Choose a reason for hiding this comment

Uh oh!

jmbeck15 Aug 17, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Michael-F-Bryan Aug 17, 2022

Choose a reason for hiding this comment

Uh oh!

jmbeck15 Aug 17, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jmbeck15 commented Aug 18, 2022

Uh oh!

jpursell commented May 5, 2024

Uh oh!

jmbeck15 commented May 25, 2024

Uh oh!

Uh oh!

jmbeck15 commented Aug 17, 2022 •

edited

Loading