Lock free hash set for fstrings #12921

jhawthorn · 2025-03-12T22:06:23Z

This implements a hash set which is wait-free for lookup and lock-free for insert (unless resizing) to use for fstring de-duplication.

As highlighted in https://bugs.ruby-lang.org/issues/19288, heavy use of fstrings (frozen interned strings) can significantly reduce the parallelism of Ractors.

I was at first intimidated by writing the lock-free version. I previously tried a few other approaches: using an RWLock, striping a series of RWlocks (partitioning the hash N-ways to reduce lock contention), and putting a cache in front of it. All of these improved the situation, but were unsatisfying as all still required locks for writes (and granular locks are awkward, since we run the risk of needing to reach a vm barrier) and this table is somewhat write-heavy.

My main reference for this was Cliff Click's talk on a lock free hash-table for java https://www.youtube.com/watch?v=HJ-719EGIts. It turns out this lock-free hash set is made easier to implement by a few properties:

We only need a hash set rather than a hash table (we only need keys, not values), and so the full entry can be written as a single VALUE
As a set we only need lookup/insert/delete, no update
Delete is only run inside GC so does not need to be atomic (It could be made concurrent)
~~I use rb_vm_barrier for the (rare) table rebuilds (It could be made concurrent)~~ We VM lock (but don't require other threads to stop) for table rebuilds, as those are rare
The conservative garbage collector makes deferred replication easy, using a T_DATA object

Another benefits of having a table specific to fstrings is that we compare by value on lookup/insert, but by identity on delete, as we only want to remove the exact string which is being freed. This is faster and provides a second way to avoid the race condition in https://bugs.ruby-lang.org/issues/21172.

This is a pretty standard open-addressing hash table with quadratic probing. Similar to our existing st_table or id_table. Deletes (which happen on GC) replace existing keys with a tombstone, which is the only type of update which can occur. Tombstones are only cleared out on resize.

Unlike st_table, the VALUEs are stored in the hash table itself (st_table's bins) rather than as a compact index. This avoids an extra pointer dereference and is possible because we don't need to preserve insertion order. The table targets a load factor of 2 (it is enlarged once it is half full).

string.c

internal/string.h

string.c

This allows more flexibility in how we deal with the fstring table

This implements a hash set which is wait-free for lookup and lock-free for insert (unless resizing) to use for fstring de-duplication. As highlighted in https://bugs.ruby-lang.org/issues/19288, heavy use of fstrings (frozen interned strings) can significantly reduce the parallelism of Ractors. I tried a few other approaches first: using an RWLock, striping a series of RWlocks (partitioning the hash N-ways to reduce lock contention), and putting a cache in front of it. All of these improved the situation, but were unsatisfying as all still required locks for writes (and granular locks are awkward, since we run the risk of needing to reach a vm barrier) and this table is somewhat write-heavy. My main reference for this was Cliff Click's talk on a lock free hash-table for java https://www.youtube.com/watch?v=HJ-719EGIts. It turns out this lock-free hash set is made easier to implement by a few properties: * We only need a hash set rather than a hash table (we only need keys, not values), and so the full entry can be written as a single VALUE * As a set we only need lookup/insert/delete, no update * Delete is only run inside GC so does not need to be atomic (It could be made concurrent) * I use rb_vm_barrier for the (rare) table rebuilds (It could be made concurrent) We VM lock (but don't require other threads to stop) for table rebuilds, as those are rare * The conservative garbage collector makes deferred replication easy, using a T_DATA object Another benefits of having a table specific to fstrings is that we compare by value on lookup/insert, but by identity on delete, as we only want to remove the exact string which is being freed. This is faster and provides a second way to avoid the race condition in https://bugs.ruby-lang.org/issues/21172. This is a pretty standard open-addressing hash table with quadratic probing. Similar to our existing st_table or id_table. Deletes (which happen on GC) replace existing keys with a tombstone, which is the only type of update which can occur. Tombstones are only cleared out on resize. Unlike st_table, the VALUEs are stored in the hash table itself (st_table's bins) rather than as a compact index. This avoids an extra pointer dereference and is possible because we don't need to preserve insertion order. The table targets a load factor of 2 (it is enlarged once it is half full).

jhawthorn force-pushed the ractor_fstring_hash_set branch from edd59a6 to ca42f48 Compare March 13, 2025 02:12

byroot reviewed Mar 13, 2025

View reviewed changes

string.c Outdated Show resolved Hide resolved

luke-gru reviewed Mar 14, 2025

View reviewed changes

string.c Outdated Show resolved Hide resolved

luke-gru reviewed Mar 15, 2025

View reviewed changes

string.c Outdated Show resolved Hide resolved

nobu reviewed Mar 20, 2025

View reviewed changes

internal/string.h Show resolved Hide resolved

string.c Show resolved Hide resolved

nobu reviewed Mar 21, 2025

View reviewed changes

string.c Outdated Show resolved Hide resolved

This comment has been minimized.

Sign in to view

jhawthorn force-pushed the ractor_fstring_hash_set branch 2 times, most recently from 299423d to 6100413 Compare April 9, 2025 18:18

jhawthorn marked this pull request as ready for review April 16, 2025 01:36

jhawthorn force-pushed the ractor_fstring_hash_set branch 2 times, most recently from e8a13b6 to 20acd94 Compare April 16, 2025 03:10

jhawthorn added 4 commits April 17, 2025 20:11

Extract rb_gc_free_fstring to string.c

cd894ca

This allows more flexibility in how we deal with the fstring table

Work on ATOMIC_VALUE_SET

1cb2fba

Add benchmarks for fstring de-duplication

cf08359

jhawthorn force-pushed the ractor_fstring_hash_set branch from aaaf3ab to cf08359 Compare April 18, 2025 03:12

jhawthorn enabled auto-merge (rebase) April 18, 2025 03:15

jhawthorn merged commit 3a29e83 into ruby:master Apr 18, 2025
76 checks passed

jhawthorn deleted the ractor_fstring_hash_set branch April 18, 2025 04:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Lock free hash set for fstrings #12921

Lock free hash set for fstrings #12921

Uh oh!

jhawthorn commented Mar 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

This comment has been minimized.

Uh oh!

Uh oh!

Lock free hash set for fstrings #12921

Lock free hash set for fstrings #12921

Uh oh!

Conversation

jhawthorn commented Mar 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

This comment has been minimized.

Uh oh!

Uh oh!

jhawthorn commented Mar 12, 2025 •

edited

Loading