Fix newline parsing at trailing trivia #1661

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

ahoppen merged 7 commits into swiftlang:main from TTOzzi:fix-newline-parsing-at-trailing-trivia

May 15, 2023

Member

TTOzzi commented May 14, 2023

Resolve #1626

Unlike leadingTrivia, trailingTrivia was not using the newlinePresence value of triviaResult.
Modified the logic to store the trailingTrivia newlinePresence value from the previous token and set isAtStartOfLine flag when parsing the next token.

TTOzzi added 2 commits

May 14, 2023 21:04


          Add logic to check if previous token's trailing trivia contains newli…

9b2ea0d

…ne characters


          Add test case

2ab4021

TTOzzi requested a review from ahoppen as a code owner

May 14, 2023 12:15

ahoppen reviewed

View reviewed changes

Member

ahoppen left a comment

Thanks for figuring this out @TTOzzi. I’ve got a couple questions/comments inline. Otherwise, this looks very good to me 👍🏽

Sources/SwiftParser/Lexer/Cursor.swift Outdated

+                    flags.insert(.isAtStartOfLine)
+                  }
+                  if let previousTokenNewlinePresence, previousTokenNewlinePresence == .present,
+                     !currentState.isParsingMultilineString {

Member

ahoppen May 14, 2023

Why do you need the isParsingMultilineString check? I just tried removing it and it looks like I just need to update three lexer tests, which seems perfectly reasonable to me because the line 1 and line 2 tokens really are on new lines.

Member Author

TTOzzi May 14, 2023

In an effort to minimize side effects, I focused on maintaining the existing test cases, which led me to misunderstand the behavior of the lexer. As you mentioned, it is correct that the line1, line2, and whitespace tokens, excluding the """ character, are really on new lines.
I will try to make the modifications quickly within this week. Thank you for your prompt feedback 🙂

Sources/SwiftParser/Lexer/Cursor.swift Outdated

Comment on lines 259 to 261


		/// If we have already lexed a token, the `NewlinePresence` of the previously lexed token
		var previousTokenNewlinePresence: NewlinePresence?

Member

ahoppen May 14, 2023

Just a naming suggestion: I’d like to make it absolutely clear that this is about whether the previous token’s trailing trivia has a newline and avoid any confusion of whether the previous token had a newline inside its leading trivia. So I’d go with

Suggested change

      
                /// If we have already lexed a token, the `NewlinePresence` of the previously lexed token
          
                var previousTokenNewlinePresence: NewlinePresence?
          
                /// If we have already lexed a token, stores whether the previous lexeme‘s trailing trivia contains a newline.
          
                var previousLexemeTrailingTrivaNewlinePresence: NewlinePresence?

Member Author

TTOzzi May 14, 2023

Great suggestion! I will try to make the modifications along with the changes mentioned above. Thank you.

TTOzzi added 2 commits

May 15, 2023 23:47


          Modify parsing logic so that a newline in a multiline string also has…

79c1b1a

… the isAtStartOfLine flag


          Rename the property that stores the previous parse value

d852fb9

TTOzzi requested a review from ahoppen

May 15, 2023 15:13

TTOzzi commented

View reviewed changes

Sources/SwiftParser/Lexer/Cursor.swift Show resolved Hide resolved

ahoppen reviewed

View reviewed changes

Sources/SwiftParser/Lexer/Cursor.swift Outdated

@@ @@ -433,28 +445,24 @@ extension Lexer.Cursor { @@
                   if let stateTransition = result.stateTransition {
                     self.stateStack.perform(stateTransition: stateTransition, stateAllocator: stateAllocator)
                   }

Member

ahoppen May 15, 2023

I think swift-format will complain about this whitespace. Could you run format.py? https://github.com/apple/swift-syntax/blob/main/CONTRIBUTING.md#formatting

Member Author

TTOzzi May 15, 2023

Oops, I missed that!
I'll make sure to run it before every commit.
Thanks for pointing that out 🙇

Sources/SwiftParser/Lexer/Cursor.swift Outdated

+                  if let previousLexemeTrailingNewlinePresence, previousLexemeTrailingNewlinePresence == .present {
+                    flags.insert(.isAtStartOfLine)
+                  }
+                  self.previousLexemeTrailingNewlinePresence = nil

Member

ahoppen May 15, 2023

Resetting to nil doesn’t really make sense here, right? If we follow the same meaning as for previousTokenKind, nil means that there hasn’t been a previous token but in this case there has been. I think the better implementation would be to have in else branch in the if statement in which you set self.previousLexemeTrailingNewlinePresence below and in it set self.previousLexemeTrailingNewlinePresence = .absent.

If you do that, I think you can also move this entire code block below the Token text lexing section and initialize flags with result.flags and avoid the union call when forming the result.

Member Author

TTOzzi May 15, 2023 •

edited

Loading

As you said, it seems like it should have a non-nil value when lexing the previous token.
However, I'm concerned that it would be redundant to assign .absent to each else in the branch that sets previousLexemeTrailingNewlinePresence.

https://github.com/apple/swift-syntax/pull/1661/files#diff-253b01bc981faa185173c1697784a41bf1b586e91ec7b4de806dfd2646db52c3R1900

The code above makes it look like I need to put previousLexemeTrailingNewlinePresence = .absent in the lex logic for each case.
https://github.com/apple/swift-syntax/blob/main/Sources/SwiftParser/Lexer/Cursor.swift#L410-L431

How about assigning .absent as the default value after using previousLexemeTrailingNewlinePresence at that position?
I think we can get some reasonable logic with little modification.

Suggested change

      
                self.previousLexemeTrailingNewlinePresence = nil
          
                self.previousLexemeTrailingNewlinePresence = .absent

Member Author

TTOzzi May 15, 2023

Additionally, if we move this block of code below the token text lexing section, the value assigned in the token text lexing logic will affect the flag value of the current token, so it would be difficult to move it 🤔

Member Author

TTOzzi May 15, 2023

Oh, now that I think about it, inside the lexing logic, it makes more sense to include the flag in the lexer's result and return it.
I'll make a quick fix and commit it.

Member

ahoppen May 15, 2023

Yes, that’s just what I was just about to suggest

Member Author

TTOzzi May 15, 2023 •

edited

Loading

Similar to TriviaResult, I also added a trailingNewlinePresence property to Lexer.Result and changed it so that modifications to the previousLexemeTrailingNewlinePresence value can only be made within the nextToken method.
The flag initialize logic was also moved under the token text lexing section.

TTOzzi added 2 commits

May 16, 2023 01:50


          Add logic to check for newline during token text lexing

1b18fdb


          Formatting

8e6ce86

TTOzzi force-pushed the fix-newline-parsing-at-trailing-trivia branch from 7732f6e to 8e6ce86 Compare

May 15, 2023 16:50


          Rename newlinePresence property of the Lexer Result

fb134e9

TTOzzi requested a review from ahoppen

May 15, 2023 17:10

ahoppen approved these changes

View reviewed changes

Member

ahoppen left a comment

Thanks. This looks good ✅

Member

ahoppen commented May 15, 2023

@swift-ci Please test

ahoppen merged commit 94959f0 into swiftlang:main

Member Author

TTOzzi commented May 16, 2023

I appreciate your kind feedback. Thank you @ahoppen!

Member

ahoppen commented May 16, 2023

@TTOzzi Could you also open a PR with this changes targeting the release/5.9 branch? Since this fixes a proper parsing bug, I would like to include it in the Swift 5.9 release.

Member Author

TTOzzi commented May 16, 2023

@ahoppen Of course, I can do it! When should I create the PR until?

Member

ahoppen commented May 16, 2023

There is no clear deadline at the moment. Earlier is always better than later though 😉

Member Author

TTOzzi commented May 16, 2023

Ok. It's currently working hours here, so I'll create it after I finish work and leave for the day 😉

Member

ahoppen commented May 16, 2023

Anytime this week is equally good

TTOzzi mentioned this pull request

[5.9] Fix newline parsing at trailing trivia #1671

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet