set.c: Store `set_table->bins` at the end of `set_table->entries` #14179

byroot · 2025-08-12T08:48:54Z

Followup: #14134

This saves one pointer in struct set_table, which would allow Set objects to still fit in 80B TypedData slots even if RTypedData goes from 32B to 40B large.

The existing set benchmark seem to show this doesn't have a very significant impact. Smaller sets are a bit faster, larger sets a bit slower.

It seem consistent over multiple runs, but it's unclear how much of that is just error margin.

compare-ruby: ruby 3.5.0dev (2025-08-12T02:14:57Z master 428937a536) +YJIT +PRISM [arm64-darwin24]
built-ruby: ruby 3.5.0dev (2025-08-12T07:22:26Z set-entries-bounds da30024fdc) +YJIT +PRISM [arm64-darwin24]
warming up........

|                         |compare-ruby|built-ruby|
|:------------------------|-----------:|---------:|
|new_0                    |     15.459M|   15.823M|
|                         |           -|     1.02x|
|new_10                   |      3.484M|    3.574M|
|                         |           -|     1.03x|
|new_100                  |    546.992k|  564.679k|
|                         |           -|     1.03x|
|new_1000                 |     49.391k|   48.169k|
|                         |       1.03x|         -|
|aref_0                   |     18.643M|   19.350M|
|                         |           -|     1.04x|
|aref_10                  |      5.941M|    6.006M|
|                         |           -|     1.01x|
|aref_100                 |    822.197k|  814.219k|
|                         |       1.01x|         -|
|aref_1000                |     83.230k|   79.411k|
|                         |       1.05x|         -|

This saves one pointer in `struct set_table`, which would allow `Set` objects to still fit in 80B TypedData slots even if RTypedData goes from 32B to 40B large. The existing set benchmark seem to show this doesn't have a very significant impact. Smaller sets are a bit faster, larger sets a bit slower. It seem consistent over multiple runs, but it's unclear how much of that is just error margin. ``` compare-ruby: ruby 3.5.0dev (2025-08-12T02:14:57Z master 428937a) +YJIT +PRISM [arm64-darwin24] built-ruby: ruby 3.5.0dev (2025-08-12T07:22:26Z set-entries-bounds da30024fdc) +YJIT +PRISM [arm64-darwin24] warming up........ | |compare-ruby|built-ruby| |:------------------------|-----------:|---------:| |new_0 | 15.459M| 15.823M| | | -| 1.02x| |new_10 | 3.484M| 3.574M| | | -| 1.03x| |new_100 | 546.992k| 564.679k| | | -| 1.03x| |new_1000 | 49.391k| 48.169k| | | 1.03x| -| |aref_0 | 18.643M| 19.350M| | | -| 1.04x| |aref_10 | 5.941M| 6.006M| | | -| 1.01x| |aref_100 | 822.197k| 814.219k| | | 1.01x| -| |aref_1000 | 83.230k| 79.411k| | | 1.05x| -| ```

jeremyevans

Great work! I think it would be worth considering this approach for st_table, and potentially shrinking hash from 160 bytes to 80 bytes (reducing AR table size from 8 to 3). There are far more hashes than sets, and I would guess that the majority of hashes are likely to have 0-3 entries. Not sure if the potential performance hit for 4-8 entry hashes is worth the memory savings. Hopefully the benchmarks could help determine that.

byroot · 2025-08-12T19:56:53Z

I think it would be worth considering this approach for st_table,

AFAIK, RHash is only 24B, so st_table can already fit in a 80B slot AFAIK.

@peterzhu2118 do you remember if there is a reason hashes default to 160B slots?

peterzhu2118 · 2025-08-12T21:03:07Z

do you remember if there is a reason hashes default to 160B slots?

Yes, because Hashes using AR tables are 160 bytes in size.

byroot · 2025-08-12T21:47:05Z

because Hashes using AR tables are 160 bytes in size.

Yes we know that, but they could be whatever size we want. I suppose they were this size before WVA?

byroot marked this pull request as draft August 12, 2025 08:48

byroot force-pushed the shink-set-table branch from e00968e to b45da2b Compare August 12, 2025 09:08

byroot marked this pull request as ready for review August 12, 2025 09:49

byroot requested a review from jeremyevans August 12, 2025 09:49

byroot mentioned this pull request Aug 12, 2025

RTypedData: keep direct reference to IMEMO/fields #14134

Merged

jeremyevans approved these changes Aug 12, 2025

View reviewed changes

byroot merged commit 85c5207 into ruby:master Aug 12, 2025
93 of 95 checks passed

byroot deleted the shink-set-table branch August 12, 2025 21:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

set.c: Store `set_table->bins` at the end of `set_table->entries` #14179

set.c: Store `set_table->bins` at the end of `set_table->entries` #14179

byroot commented Aug 12, 2025

Uh oh!

jeremyevans left a comment

Uh oh!

byroot commented Aug 12, 2025

Uh oh!

Uh oh!

peterzhu2118 commented Aug 12, 2025

Uh oh!

byroot commented Aug 12, 2025

Uh oh!

Uh oh!

set.c: Store set_table->bins at the end of set_table->entries #14179

set.c: Store set_table->bins at the end of set_table->entries #14179

Conversation

byroot commented Aug 12, 2025

Uh oh!

jeremyevans left a comment

Choose a reason for hiding this comment

Uh oh!

byroot commented Aug 12, 2025

Uh oh!

Uh oh!

peterzhu2118 commented Aug 12, 2025

Uh oh!

byroot commented Aug 12, 2025

Uh oh!

Uh oh!

set.c: Store `set_table->bins` at the end of `set_table->entries` #14179

set.c: Store `set_table->bins` at the end of `set_table->entries` #14179