gh-119517: Fixes for pasting in pyrepl #120253

godlygeek · 2024-06-08T03:02:51Z

Drastically improve the performance of pyrepl when pasting large amounts of text.

Issue: Odd behavior from pasting large text in the new REPL #119517

Do-over of #120251 which I made a mess of 😓.

This will be replaced by a less specialized optimization. Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net>

Previously echoing was totally suppressed until the entire command had been pasted and the terminal ended paste mode, but this gives the user no feedback to indicate that an operation is in progress. Drawing something to the screen once per line strikes a balance between perceived responsiveness and performance. Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net>

`msg_at_bottom` is always true. Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net>

The Reader in pyrepl doesn't hold a complete representation of the screen area being drawn as persistent state. Instead, it recomputes it, on each keypress. This is fast enough for a few hundred bytes, but incredibly slow as the input buffer grows into the kilobytes (likely because of pasting). Rather than making some expensive and expansive changes to the repl's internal representation of the screen, add some caching: remember some data from one refresh to the next about what was drawn to the screen and, if we don't find anything that has invalidated the results that were computed last time around, reuse them. To keep this caching as simple as possible, all we'll do is look for lines in the buffer that were above the cursor the last time we were asked to update the screen, and that are still above the cursor now. We assume that nothing can affect a line that comes before both the old and new cursor location without us being informed. Based on this assumption, we can reuse old lines, which drastically speeds up the overwhelmingly common case where the user is typing near the end of the buffer. Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net>

Cache the `can_colorize()` call rather than repeatedly recomputing it. This call looks up an environment variable, and is called once per character typed at the REPL. The environment variable lookup shows up as a hot spot when profiling, and we don't expect this to change while the REPL is running. Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net>

Previously, we were checking whether the command should be accepted each time a line break was encountered, but that's not the expected behavior. In bracketed paste mode, we expect everything pasted to be part of a single block of code, and encountering a newline shouldn't behave like a user pressing <Enter> to execute a command. The user should always have a chance to review the pasted command before running it. Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net>

Previously we were reading one byte at a time, which causes much slower IO than necessary. Instead, read in chunks, processing previously read data before asking for more. Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net>

`wlen` finds the width of a multi-character string by adding up the width of each character, and then subtracting the width of any escape sequences. It's often called for single character strings, however, which can't possibly contain escape sequences. Optimize for that case. Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net>

Since every ASCII character is known to display as single width, we can avoid not only the Unicode data lookup in `disp_str` but also the one hidden in `str_width` for them. Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net>

When the current pyrepl command buffer contains many lines, scrolling up becomes slow. We have optimizations in place to reuse lines above the cursor position from one refresh to the next, but don't currently try to reuse lines below the cursor position in the same way, so we wind up with quadratic behavior where all lines of the buffer below the cursor are recomputed each time the cursor moves up another line. Optimize this by only computing one screen's worth of lines beyond the cursor position. Any lines beyond that can't possibly be shown by the console, and bounding this makes scrolling up have linear time complexity instead. Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net>

pablogsal · 2024-06-10T14:46:40Z

@treyhunner can you give this PR a test? This PR doesn't do lazy refresh approach and in my testing, makes pasting as fast as the old REPL when pasting the full Frankenstein text

treyhunner · 2024-06-10T17:16:08Z

@pablogsal This works great for me! 👍

I was able to paste the entire text of Frankenstein in under 4 seconds! (was ~20 seconds before this change)

I can see the full output of the pasted text. This is particularly helpful when pasting just a few hundred lines of text.

This will be a very big improvement for my use case. 👏

The only (very slight) odd thing I noticed is the (paste) prompt and ... prompt are both used (with ... just at the end) in different parts of the pasted text. That leads to a slight misalignment due to (paste) being 4 characters longer than ....

Example snippet:

Python 3.14.0a0 (main, Jun 10 2024, 09:49:48) [GCC 11.4.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
(paste) us_states = [
   [removed states]
(paste)     "Montana",
(paste)     "Nebraska",
(paste)     "Nevada",
...     "New Hampshire",
...     "New Jersey",
...     "New Mexico",
    [removed states]
>>>

As @pablogsal noted in #120234, IPython only shows the last terminal-sized chunk of output when scrolling up through history so this is a fairly minor issue.

However, I do anticipate that the current behavior will occasionally raise eyebrows from new Python users and may require a bit of hand-waving to explain it away.

Personally, I'd love to see ... used in paste mode (for consistency with the old REPL) because I plan to only ever use paste mode via bracketed paste. If lacking a distinct prompt for pastse mode might confuse some folks, maybe using (p) as the prompt (since it's 3 characters) might make sense.

Again though, this issue is minor and I doubt it would cause frequent confusion for new users.

Thank you for the big improvement @godlygeek.

pablogsal · 2024-06-10T17:27:52Z

I think there is an easy fix for the prompt issue: just use the paste prompt on non-bracketed-paste mode (which is the original reason we added it, so a user typing manually can know that's in paste mode). WDYT @godlygeek ?

godlygeek · 2024-06-10T19:04:56Z

I think that only showing the indicator when the user explicitly toggled on paste mode is a good idea. I also think that showing the indicator on every line is surprising. Even in manual paste mode, I'd only do a pasting indicator either on the line currently being edited, or (preferably) on the message line (where messages like "[ not unique ]" show up during tab completion).

pablogsal · 2024-06-10T19:44:36Z

I'm happy to do either, so let's change it

pablogsal · 2024-06-11T15:44:12Z

@godlygeek I have made some small changes like encapsulating the cached data into a class and deactivated the prompt for paste in bracketed paste mode. I will land after the tests pass

hugovk · 2024-06-11T16:20:57Z

This needs a NEWS file or skip-news label to auto-merge.

miss-islington-app · 2024-06-11T16:42:14Z

Thanks @godlygeek for the PR, and @pablogsal for merging it 🌮🎉.. I'm working now to backport this PR to: 3.13.
🐍🍒⛏🤖

* Remove pyrepl's optimization for self-insert This will be replaced by a less specialized optimization. * Use line-buffering when pyrepl echoes pastes Previously echoing was totally suppressed until the entire command had been pasted and the terminal ended paste mode, but this gives the user no feedback to indicate that an operation is in progress. Drawing something to the screen once per line strikes a balance between perceived responsiveness and performance. * Remove dead code from pyrepl `msg_at_bottom` is always true. * Speed up pyrepl's screen rendering computation The Reader in pyrepl doesn't hold a complete representation of the screen area being drawn as persistent state. Instead, it recomputes it, on each keypress. This is fast enough for a few hundred bytes, but incredibly slow as the input buffer grows into the kilobytes (likely because of pasting). Rather than making some expensive and expansive changes to the repl's internal representation of the screen, add some caching: remember some data from one refresh to the next about what was drawn to the screen and, if we don't find anything that has invalidated the results that were computed last time around, reuse them. To keep this caching as simple as possible, all we'll do is look for lines in the buffer that were above the cursor the last time we were asked to update the screen, and that are still above the cursor now. We assume that nothing can affect a line that comes before both the old and new cursor location without us being informed. Based on this assumption, we can reuse old lines, which drastically speeds up the overwhelmingly common case where the user is typing near the end of the buffer. * Speed up pyrepl prompt drawing Cache the `can_colorize()` call rather than repeatedly recomputing it. This call looks up an environment variable, and is called once per character typed at the REPL. The environment variable lookup shows up as a hot spot when profiling, and we don't expect this to change while the REPL is running. * Speed up pasting multiple lines into the REPL Previously, we were checking whether the command should be accepted each time a line break was encountered, but that's not the expected behavior. In bracketed paste mode, we expect everything pasted to be part of a single block of code, and encountering a newline shouldn't behave like a user pressing <Enter> to execute a command. The user should always have a chance to review the pasted command before running it. * Use a read buffer for input in pyrepl Previously we were reading one byte at a time, which causes much slower IO than necessary. Instead, read in chunks, processing previously read data before asking for more. * Optimize finding width of a single character `wlen` finds the width of a multi-character string by adding up the width of each character, and then subtracting the width of any escape sequences. It's often called for single character strings, however, which can't possibly contain escape sequences. Optimize for that case. * Optimize disp_str for ASCII characters Since every ASCII character is known to display as single width, we can avoid not only the Unicode data lookup in `disp_str` but also the one hidden in `str_width` for them. * Speed up cursor movements in long pyrepl commands When the current pyrepl command buffer contains many lines, scrolling up becomes slow. We have optimizations in place to reuse lines above the cursor position from one refresh to the next, but don't currently try to reuse lines below the cursor position in the same way, so we wind up with quadratic behavior where all lines of the buffer below the cursor are recomputed each time the cursor moves up another line. Optimize this by only computing one screen's worth of lines beyond the cursor position. Any lines beyond that can't possibly be shown by the console, and bounding this makes scrolling up have linear time complexity instead. --------- (cherry picked from commit 32a0fab) Co-authored-by: Matt Wozniski <mwozniski@bloomberg.net> Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net> Co-authored-by: Pablo Galindo <pablogsal@gmail.com>

bedevere-app · 2024-06-11T16:42:26Z

GH-120353 is a backport of this pull request to the 3.13 branch.

lysnikolaou · 2024-06-11T16:51:26Z

FWIW it still takes about 20 seconds to paste in the entire text of Frankenstein for me.

godlygeek · 2024-06-11T16:53:55Z

FWIW it still takes about 20 seconds to paste in the entire text of Frankenstein for me.

Did you check how long it takes in 3.13b1? Is this PR faster, or no improvement whatsoever?

pablogsal · 2024-06-11T16:59:31Z

FWIW it still takes about 20 seconds to paste in the entire text of Frankenstein for me.

Maybe you are testing a debug version? Also I noticed that different terminals paste at different rates, so always check against PYTHON_BASIC_REPL=1

godlygeek · 2024-06-11T17:00:42Z

I have made some small changes like encapsulating the cached data into a class

Thanks - I was waffling between keeping the changes localized vs doing that refactoring.

and deactivated the prompt for paste in bracketed paste mode

👍 I looked at what it would take to get the paste mode indicator to show up in the message line, as I had proposed. It doesn't work as nicely as I would have hoped: the Console class may decide not to show that line, if you're scrolled up, or if the cursor line is the bottom line of the screen. I may send another PR tweaking some things there to improve it, but ultimately it won't work as nicely as I had hoped, so just disabling it for bracketed pastes is definitely the quick fix.

lysnikolaou · 2024-06-11T17:01:14Z

Did you check how long it takes in 3.13b1? Is this PR faster, or no improvement whatsoever?

Oh yeah, this is a significant improvement over 3.13b1. The old REPL is still faster and only takes ~6 seconds.

pablogsal · 2024-06-11T17:06:47Z

he old REPL is still faster and only takes ~6 seconds.

Are you comparing using the same compiled version and setting PYTHON_BASIC_REPL on and off?

godlygeek · 2024-06-11T17:07:06Z

The old REPL is still faster and only takes ~6 seconds.

What terminal are you using? Does it supported bracketed pastes? One of the optimizations here only applies when bracketed paste mode is enabled.

lysnikolaou · 2024-06-11T17:08:47Z

Are you comparing using the same compiled version and setting PYTHON_BASIC_REPL on and off?

Yes.

What terminal are you using? Does it supported bracketed pastes? One of the optimizations here only applies when bracketed paste mode is enabled.

I'm using iTerm2, which supports bracketed paste mode AFAIK.

gh-119517: Fixes for pasting in pyrepl (GH-120253) * Remove pyrepl's optimization for self-insert This will be replaced by a less specialized optimization. * Use line-buffering when pyrepl echoes pastes Previously echoing was totally suppressed until the entire command had been pasted and the terminal ended paste mode, but this gives the user no feedback to indicate that an operation is in progress. Drawing something to the screen once per line strikes a balance between perceived responsiveness and performance. * Remove dead code from pyrepl `msg_at_bottom` is always true. * Speed up pyrepl's screen rendering computation The Reader in pyrepl doesn't hold a complete representation of the screen area being drawn as persistent state. Instead, it recomputes it, on each keypress. This is fast enough for a few hundred bytes, but incredibly slow as the input buffer grows into the kilobytes (likely because of pasting). Rather than making some expensive and expansive changes to the repl's internal representation of the screen, add some caching: remember some data from one refresh to the next about what was drawn to the screen and, if we don't find anything that has invalidated the results that were computed last time around, reuse them. To keep this caching as simple as possible, all we'll do is look for lines in the buffer that were above the cursor the last time we were asked to update the screen, and that are still above the cursor now. We assume that nothing can affect a line that comes before both the old and new cursor location without us being informed. Based on this assumption, we can reuse old lines, which drastically speeds up the overwhelmingly common case where the user is typing near the end of the buffer. * Speed up pyrepl prompt drawing Cache the `can_colorize()` call rather than repeatedly recomputing it. This call looks up an environment variable, and is called once per character typed at the REPL. The environment variable lookup shows up as a hot spot when profiling, and we don't expect this to change while the REPL is running. * Speed up pasting multiple lines into the REPL Previously, we were checking whether the command should be accepted each time a line break was encountered, but that's not the expected behavior. In bracketed paste mode, we expect everything pasted to be part of a single block of code, and encountering a newline shouldn't behave like a user pressing <Enter> to execute a command. The user should always have a chance to review the pasted command before running it. * Use a read buffer for input in pyrepl Previously we were reading one byte at a time, which causes much slower IO than necessary. Instead, read in chunks, processing previously read data before asking for more. * Optimize finding width of a single character `wlen` finds the width of a multi-character string by adding up the width of each character, and then subtracting the width of any escape sequences. It's often called for single character strings, however, which can't possibly contain escape sequences. Optimize for that case. * Optimize disp_str for ASCII characters Since every ASCII character is known to display as single width, we can avoid not only the Unicode data lookup in `disp_str` but also the one hidden in `str_width` for them. * Speed up cursor movements in long pyrepl commands When the current pyrepl command buffer contains many lines, scrolling up becomes slow. We have optimizations in place to reuse lines above the cursor position from one refresh to the next, but don't currently try to reuse lines below the cursor position in the same way, so we wind up with quadratic behavior where all lines of the buffer below the cursor are recomputed each time the cursor moves up another line. Optimize this by only computing one screen's worth of lines beyond the cursor position. Any lines beyond that can't possibly be shown by the console, and bounding this makes scrolling up have linear time complexity instead. --------- (cherry picked from commit 32a0fab) Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net> Co-authored-by: Matt Wozniski <mwozniski@bloomberg.net> Co-authored-by: Pablo Galindo <pablogsal@gmail.com>

pablogsal · 2024-06-11T17:13:40Z

Wow, in my Mac, using Iterm2 and a non-debug build, it takes exactly the same to paste the entire text. I wonder what's going on here....

pablogsal · 2024-06-11T17:28:51Z

@lysnikolaou If you go to Iterm > Edit > Paste special > Paste faster. What value you get?

lysnikolaou · 2024-06-11T17:30:43Z

If you go to Iterm > Edit > Paste special > Paste faster. What value you get?

I'm getting 173 kB/sec.

I tested with a non-debug --enable-optimizations build and it now takes about the same, ~7 seconds with both the old and the new REPL. So false alarm.

Thanks again @godlygeek!

* Remove pyrepl's optimization for self-insert This will be replaced by a less specialized optimization. * Use line-buffering when pyrepl echoes pastes Previously echoing was totally suppressed until the entire command had been pasted and the terminal ended paste mode, but this gives the user no feedback to indicate that an operation is in progress. Drawing something to the screen once per line strikes a balance between perceived responsiveness and performance. * Remove dead code from pyrepl `msg_at_bottom` is always true. * Speed up pyrepl's screen rendering computation The Reader in pyrepl doesn't hold a complete representation of the screen area being drawn as persistent state. Instead, it recomputes it, on each keypress. This is fast enough for a few hundred bytes, but incredibly slow as the input buffer grows into the kilobytes (likely because of pasting). Rather than making some expensive and expansive changes to the repl's internal representation of the screen, add some caching: remember some data from one refresh to the next about what was drawn to the screen and, if we don't find anything that has invalidated the results that were computed last time around, reuse them. To keep this caching as simple as possible, all we'll do is look for lines in the buffer that were above the cursor the last time we were asked to update the screen, and that are still above the cursor now. We assume that nothing can affect a line that comes before both the old and new cursor location without us being informed. Based on this assumption, we can reuse old lines, which drastically speeds up the overwhelmingly common case where the user is typing near the end of the buffer. * Speed up pyrepl prompt drawing Cache the `can_colorize()` call rather than repeatedly recomputing it. This call looks up an environment variable, and is called once per character typed at the REPL. The environment variable lookup shows up as a hot spot when profiling, and we don't expect this to change while the REPL is running. * Speed up pasting multiple lines into the REPL Previously, we were checking whether the command should be accepted each time a line break was encountered, but that's not the expected behavior. In bracketed paste mode, we expect everything pasted to be part of a single block of code, and encountering a newline shouldn't behave like a user pressing <Enter> to execute a command. The user should always have a chance to review the pasted command before running it. * Use a read buffer for input in pyrepl Previously we were reading one byte at a time, which causes much slower IO than necessary. Instead, read in chunks, processing previously read data before asking for more. * Optimize finding width of a single character `wlen` finds the width of a multi-character string by adding up the width of each character, and then subtracting the width of any escape sequences. It's often called for single character strings, however, which can't possibly contain escape sequences. Optimize for that case. * Optimize disp_str for ASCII characters Since every ASCII character is known to display as single width, we can avoid not only the Unicode data lookup in `disp_str` but also the one hidden in `str_width` for them. * Speed up cursor movements in long pyrepl commands When the current pyrepl command buffer contains many lines, scrolling up becomes slow. We have optimizations in place to reuse lines above the cursor position from one refresh to the next, but don't currently try to reuse lines below the cursor position in the same way, so we wind up with quadratic behavior where all lines of the buffer below the cursor are recomputed each time the cursor moves up another line. Optimize this by only computing one screen's worth of lines beyond the cursor position. Any lines beyond that can't possibly be shown by the console, and bounding this makes scrolling up have linear time complexity instead. --------- Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net> Co-authored-by: Pablo Galindo <pablogsal@gmail.com>

godlygeek added 10 commits June 7, 2024 23:01

Remove pyrepl's optimization for self-insert

6135335

This will be replaced by a less specialized optimization. Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net>

Remove dead code from pyrepl

0de6080

`msg_at_bottom` is always true. Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net>

Use a read buffer for input in pyrepl

0c4a0cc

Previously we were reading one byte at a time, which causes much slower IO than necessary. Instead, read in chunks, processing previously read data before asking for more. Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net>

Optimize disp_str for ASCII characters

07b3302

Since every ASCII character is known to display as single width, we can avoid not only the Unicode data lookup in `disp_str` but also the one hidden in `str_width` for them. Signed-off-by: Matt Wozniski <mwozniski@bloomberg.net>

godlygeek requested review from pablogsal, lysnikolaou and ambv as code owners June 8, 2024 03:02

bedevere-app bot mentioned this pull request Jun 8, 2024

Odd behavior from pasting large text in the new REPL #119517

Closed

bedevere-app bot added the awaiting review label Jun 8, 2024

pablogsal added 2 commits June 11, 2024 16:43

Encapsulate the cached data in a class

0456835

Do not show paste prompt on bracketed paste

ae1185a

pablogsal enabled auto-merge (squash) June 11, 2024 15:44

lysnikolaou added the topic-repl Related to the interactive shell label Jun 11, 2024

pablogsal added skip news needs backport to 3.13 bugs and security fixes labels Jun 11, 2024

pablogsal approved these changes Jun 11, 2024

View reviewed changes

bedevere-app bot removed the awaiting review label Jun 11, 2024

bedevere-app bot added the awaiting merge label Jun 11, 2024

pablogsal merged commit 32a0fab into python:main Jun 11, 2024
42 checks passed

bedevere-app bot removed the awaiting merge label Jun 11, 2024

bedevere-app bot removed the needs backport to 3.13 bugs and security fixes label Jun 11, 2024

pablogsal mentioned this pull request Jun 11, 2024

gh-119517: Fix several issues when pasting lot of text in the REPL #120234

Closed

treyhunner mentioned this pull request Jul 2, 2024

KeyboardInterrupt during paste breaks the new REPL #121295

Closed

eendebakpt mentioned this pull request Nov 2, 2024

3.13.0 REPL crashes on double ctrl+z in line overflow #126332

Closed

chris-eibl mentioned this pull request Apr 26, 2025

Pasting in new REPL is slow #130328

Closed

Uh oh!

gh-119517: Fixes for pasting in pyrepl #120253

gh-119517: Fixes for pasting in pyrepl #120253

Uh oh!

Conversation

godlygeek commented Jun 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pablogsal commented Jun 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

treyhunner commented Jun 10, 2024

Uh oh!

pablogsal commented Jun 10, 2024

Uh oh!

godlygeek commented Jun 10, 2024

Uh oh!

pablogsal commented Jun 10, 2024

Uh oh!

pablogsal commented Jun 11, 2024

Uh oh!

hugovk commented Jun 11, 2024

Uh oh!

Uh oh!

miss-islington-app bot commented Jun 11, 2024

Uh oh!

bedevere-app bot commented Jun 11, 2024

Uh oh!

lysnikolaou commented Jun 11, 2024

Uh oh!

godlygeek commented Jun 11, 2024

Uh oh!

pablogsal commented Jun 11, 2024

Uh oh!

godlygeek commented Jun 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lysnikolaou commented Jun 11, 2024

Uh oh!

pablogsal commented Jun 11, 2024

Uh oh!

godlygeek commented Jun 11, 2024

Uh oh!

lysnikolaou commented Jun 11, 2024

Uh oh!

pablogsal commented Jun 11, 2024

Uh oh!

pablogsal commented Jun 11, 2024

Uh oh!

lysnikolaou commented Jun 11, 2024

Uh oh!

Uh oh!

godlygeek commented Jun 8, 2024 •

edited

Loading

pablogsal commented Jun 10, 2024 •

edited

Loading

godlygeek commented Jun 11, 2024 •

edited

Loading