PG-1604 fix: preallocate one more record for the cache #539

dutow · 2025-08-18T07:20:42Z

There is at lesat one corner case scenario where we have to load the last record into the cache during a write:

replica crashes, receives last segment from primary
replica replays last segment, reaches end
replica activtes new key
replica replays prepared transaction, has to use old keys again
old key write function sees that we generated a new key, tries to load it

In this scenario we could get away by detecting that we are in a write, and asserting if we tried to use the last key.

But in a release build assertions are not fired, and we would end up writing some non encrypted data to disk, and later if we have to run recovery failing.

It could be a FATAL, but that would still crash the server, and the next startup would crash again and again...

Instead, to properly avoid this situation we preallocate memory for one more key in the cache during initialization. Since we can only add one extra key to the cache during the servers run, this means we no longer try to allocate in the critical section in any corner case.

While this is not the nicest solution, it is simple and keeps the current cache and decrypt/encrypt logic the same as before. Any other solution would be more complex, and even more of a hack, as it would require dealing with a possibly out of date cache.

codecov-commenter · 2025-08-18T07:27:03Z

Codecov Report

❌ Patch coverage is 85.71429% with 2 lines in your changes missing coverage. Please review.
✅ Project coverage is 82.49%. Comparing base (167aef2) to head (d01a7c0).

❌ Your project status has failed because the head coverage (82.49%) is below the target coverage (90.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files

@@                  Coverage Diff                  @@
##           TDE_REL_17_STABLE     #539      +/-   ##
=====================================================
- Coverage              82.50%   82.49%   -0.01%     
=====================================================
  Files                     25       25              
  Lines                   3207     3217      +10     
  Branches                 508      510       +2     
=====================================================
+ Hits                    2646     2654       +8     
  Misses                   452      452              
- Partials                 109      111       +2

Components	Coverage Δ
access	`84.84% <85.71%> (-0.07%)`	⬇️
catalog	`87.68% <ø> (ø)`
common	`77.77% <ø> (ø)`
encryption	`72.97% <ø> (ø)`
keyring	`73.21% <ø> (ø)`
src	`94.15% <ø> (ø)`
smgr	`96.53% <ø> (ø)`
transam	`∅ <ø> (∅)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

AndersAstrand · 2025-08-18T09:34:19Z

I find the names a bit confusing. Maybe prealloc would be clearer than extra and record clearer than alloc for the cache record.

To me it's not clear what "extra" refers to at all at least, while "preallocated" is clear.

AndersAstrand

This is a good solution for now. We need to clean up the whole wal key cache setup at some point anyways.

I think the naming could be made clearer. And maybe InitWrite should always call the preallocate function with a comment like Ensure there are preallocated key and cache entries because they're sometimes needed in the critical section would make it all easier to understand than understanding why this is only done when loading all of the keys.

dutow · 2025-08-18T10:16:47Z

And maybe InitWrite should always call the preallocate function with a comment like

Technically, that will happen, that check there is only a safety net. It could be an assert. But since the init function is the first function that tries to use the WAL cache, it will be the first function to fetch the keys.

There is at lesat one corner case scenario where we have to load the last record into the cache during a write: * replica crashes, receives last segment from primary * replica replays last segment, reaches end * replica activtes new key * replica replays prepared transaction, has to use old keys again * old key write function sees that we generated a new key, tries to load it In this scenario we could get away by detecting that we are in a write, and asserting if we tried to use the last key. But in a release build assertions are not fired, and we would end up writing some non encrypted data to disk, and later if we have to run recovery failing. It could be a FATAL, but that would still crash the server, and the next startup would crash again and again... Instead, to properly avoid this situation we preallocate memory for one more key in the cache during initialization. Since we can only add one extra key to the cache during the servers run, this means we no longer try to allocate in the critical section in any corner case. While this is not the nicest solution, it is simple and keeps the current cache and decrypt/encrypt logic the same as before. Any other solution would be more complex, and even more of a hack, as it would require dealing with a possibly out of date cache.

dutow force-pushed the pg1604-2 branch from 07b0a85 to ee37fc9 Compare August 18, 2025 07:20

AndersAstrand requested changes Aug 18, 2025

View reviewed changes

dutow force-pushed the pg1604-2 branch from ee37fc9 to d01a7c0 Compare August 18, 2025 12:58

dutow requested a review from AndersAstrand August 18, 2025 12:59

AndersAstrand approved these changes Aug 18, 2025

View reviewed changes

artemgavrilov approved these changes Aug 18, 2025

View reviewed changes

dutow merged commit ff8a389 into percona:TDE_REL_17_STABLE Aug 18, 2025
18 of 19 checks passed

dutow deleted the pg1604-2 branch August 18, 2025 16:59

dutow mentioned this pull request Aug 19, 2025

PG-1213 Add SQL function for checking if WAL record is encrypted #514

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PG-1604 fix: preallocate one more record for the cache #539

PG-1604 fix: preallocate one more record for the cache #539

Uh oh!

dutow commented Aug 18, 2025

Uh oh!

codecov-commenter commented Aug 18, 2025 •

edited

Loading

Uh oh!

AndersAstrand commented Aug 18, 2025 •

edited

Loading

Uh oh!

AndersAstrand left a comment •

edited

Loading

Uh oh!

dutow commented Aug 18, 2025

Uh oh!

Uh oh!

Uh oh!

PG-1604 fix: preallocate one more record for the cache #539

PG-1604 fix: preallocate one more record for the cache #539

Uh oh!

Conversation

dutow commented Aug 18, 2025

Uh oh!

codecov-commenter commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

AndersAstrand commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AndersAstrand left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dutow commented Aug 18, 2025

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Aug 18, 2025 •

edited

Loading

AndersAstrand commented Aug 18, 2025 •

edited

Loading

AndersAstrand left a comment •

edited

Loading