bpo-30377: Simplify handling of COMMENT and NL in tokenize.py #1607

albertjan · 2017-05-16T13:21:14Z

While porting tokenize.py to javascript for skulpt I ran into this bit of code. It check's if a line is a comment more then necessary, this addresses it and makes the code a bit more readable.

Should I also do a PR for this agains 2.7?

the-knights-who-say-ni · 2017-05-16T13:21:16Z

Hello, and thanks for your contribution!

I'm a bot set up to make sure that the project can legally accept your contribution by verifying you have signed the PSF contributor agreement (CLA).

Unfortunately we couldn't find an account corresponding to your GitHub username on bugs.python.org (b.p.o) to verify you have signed the CLA. This is necessary for legal reasons before we can look at your contribution. Please follow the steps outlined in the CPython devguide to rectify this issue.

Thanks again to your contribution and we look forward to looking at it!

mention-bot · 2017-05-16T13:21:18Z

@albertjan, thanks for your PR! By analyzing the history of the files in this pull request, we identified @tiran, @serhiy-storchaka and @1st1 to be potential reviewers.

put both new lines and comments back into one branch and fall though to adding the newline if it's a comment.

brettcannon · 2017-05-16T21:44:54Z

Due to a new release of Sphinx, we had to fix the documentation to build on Travis again. Please do a merge to get these changes to help get Travis passing on your PR.

albertjan · 2017-05-24T09:06:37Z

Apparently @the-knights-who-say-ni don't agree with bpo 😄 as bpo now says that I've signed the CLA.

serhiy-storchaka · 2017-05-24T11:31:30Z

Finally we convinced him. 😄

ambv · 2018-04-22T18:16:56Z

Lib/test/test_tokenize.py

-    COMMENT    '# NEWLINE'   (3, 17) (3, 26)
-    NEWLINE    '\\n'          (3, 26) (3, 27)
-    DEDENT     ''            (4, 0) (4, 0)
+    NL         '\\n'          (3, 4) (3, 5)


@serhiy-storchaka, this looks invalid to me. Why are there two newlines now? Now the tokenizer claims there's 5 lines in the snippet. There's 4 lines in the snippet.

Because there are two lines between the comment an "True". One ends the comment, and other ends the blank line (spaces are ignored).

Look closer on line numbers. "if" is at line 1, the comment is at line 2, "True" is at line 4. Previously the tokenizer reported "True" at line 3.

The fifth line contains only DEDENT. 4 newlines divide the whole text on 5 parts. If you write the sample text in a file and open it with a text editor for sure you could move the cursor on the line 5, just after the 4th newline.

Yes, I was confused by the DEDENT. Thank you for clarification!

I was confused myself too. It looked to me that this change also fixed a bug in line numbering, but I was not able to find a bug in the old code. Later I figured out that the input data for this test was changed, therefore both old and new codes are correct. This PR just cleans up the code, as was claimed.

albertjan added 2 commits May 16, 2017 14:09

add test to explicitly excerisce branches

a6f464a

simplify check for newline or comment

dabd33f

the-knights-who-say-ni added the CLA not signed label May 16, 2017

serhiy-storchaka self-requested a review May 16, 2017 16:12

albertjan added 2 commits May 16, 2017 17:21

changes due to comments by @serhiy-storchaka

6212509

put both new lines and comments back into one branch and fall though to adding the newline if it's a comment.

fix comment, cleaner diff

feaed9d

serhiy-storchaka approved these changes May 16, 2017

View reviewed changes

serhiy-storchaka added the type-feature A feature request or enhancement label May 16, 2017

serhiy-storchaka closed this May 16, 2017

serhiy-storchaka reopened this May 16, 2017

brettcannon removed the CLA not signed label May 23, 2017

the-knights-who-say-ni added the CLA not signed label May 23, 2017

serhiy-storchaka removed the CLA not signed label May 24, 2017

the-knights-who-say-ni added the CLA signed label May 24, 2017

serhiy-storchaka merged commit c471ca4 into python:master May 24, 2017

albertjan deleted the fix-issue-30377 branch May 24, 2017 11:34

ambv reviewed Apr 22, 2018

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

bpo-30377: Simplify handling of COMMENT and NL in tokenize.py #1607

bpo-30377: Simplify handling of COMMENT and NL in tokenize.py #1607

Uh oh!

albertjan commented May 16, 2017

Uh oh!

the-knights-who-say-ni commented May 16, 2017

Uh oh!

mention-bot commented May 16, 2017

Uh oh!

brettcannon commented May 16, 2017

Uh oh!

albertjan commented May 24, 2017

Uh oh!

serhiy-storchaka commented May 24, 2017

Uh oh!

ambv Apr 22, 2018

Uh oh!

serhiy-storchaka Apr 22, 2018

Uh oh!

ambv Apr 22, 2018

Uh oh!

serhiy-storchaka Apr 22, 2018

Uh oh!

Uh oh!

Uh oh!

bpo-30377: Simplify handling of COMMENT and NL in tokenize.py #1607

bpo-30377: Simplify handling of COMMENT and NL in tokenize.py #1607

Uh oh!

Conversation

albertjan commented May 16, 2017

Uh oh!

the-knights-who-say-ni commented May 16, 2017

Uh oh!

mention-bot commented May 16, 2017

Uh oh!

brettcannon commented May 16, 2017

Uh oh!

albertjan commented May 24, 2017

Uh oh!

serhiy-storchaka commented May 24, 2017

Uh oh!

ambv Apr 22, 2018

Choose a reason for hiding this comment

Uh oh!

serhiy-storchaka Apr 22, 2018

Choose a reason for hiding this comment

Uh oh!

ambv Apr 22, 2018

Choose a reason for hiding this comment

Uh oh!

serhiy-storchaka Apr 22, 2018

Choose a reason for hiding this comment

Uh oh!

Uh oh!