Towards possibility of precise garbage collector #1161

pfalcon · 2015-03-17T00:29:35Z

It seems that many approaches of further evolution of GC in uPy depend on being able to do precise GC, i.e. tell which fields with an object are pointers are which are not. To achieve this, each allocated memory block has to be explicitly typed (and from type, internal layout can be inferred). Way to achieve this while staying compatible with conservative GC and without updating code largely is to have to set of memory allocation routines: one set is to deal with uPy objects (which already have type header is structure member) and another set to deal with "raw" memory allocations (which will need type header added implicitly for precise GC, and nothing added for conservative GC).

We currently have ~dozen functions to do memalloc. The above means doubling them. Other approach would be to reserve all current functions to uPy objects, add just add 2 funcs for "raw" memory: m_malloc() (will use implicit "all pointers inside" type) and m_malloc_typed() which takes explicit type for allocation.

Other thoughts?

dpgeorge · 2015-03-17T00:40:38Z

It's difficult to be 100% precise because you also need to track the "type" of objects on the stack and in registers.

pfalcon · 2015-03-17T00:55:51Z

Well, I don't know what to say here, but there gotta be way to deal with that. E.g., make sure that any pointers are flushed to the heap or static rootset at checkpoints when GC may happen. There're surely many steps to have memory-efficient compacted heap, and having obvious deficiencies in current memalloc preclude even experimenting with them.

dpgeorge · 2015-03-17T22:12:57Z

We can learn from high quality GC implementations such as V8. See eg this blog post http://jayconrod.com/posts/55/a-tour-of-v8-garbage-collection for a nice overview.

pfalcon · 2015-03-17T22:22:37Z

Sure, but what about deciding on the down-to-earth question this ticket poses: how to distinguish object and non-object allocation?

dpgeorge · 2015-03-17T22:39:37Z

I have thought a lot about how to improve the memory manager and the reason I haven't done anything concrete yet is because of these issues of being 100% precise and of compacting in the presence of interrupts. I wanted to understand how to fully solve the problems before going in a particular direction, and writing code that may end up being unused.

We currently have ~dozen functions to do memalloc. The above means doubling them.

I don't see a problem with having more functions (they are mostly macro wrappers anyway). A more specific malloc function gives more information about your intentions for that memory, and may allow further optimisations with other kinds of memory managers.

To achieve this, each allocated memory block has to be explicitly typed

Yes. For raw memory regions could make a special uPy type which tags them (eg mp_type_raw_memory) and then all allocated chunks within the heap start with a 1-word pointer to an object of type struct mp_obj_type_t.

pfalcon · 2015-03-17T23:49:02Z

I have thought a lot about how to improve the memory manager and the reason I haven't done anything concrete yet is

Yes, I thought a lot too, but I didn't try to do much, because even for experimenting, a lot of prerequisites are required. Now that there're even new people appearing who would like to experiment, maybe it's to move in that direction ;-).

I don't see a problem with having more functions (they are mostly macro wrappers anyway).

I don't have problems with having many functions as we have already, but doubling them - I'm not sure. One of the aim of the refactoring is to make memory management/GC more pluggable, and just understanding (re)defining couple of dozens of functions is a chore and not really welcoming. So, what about 2nd approach I propose - we keep existing functions for objects, and get just couple of functions for "raw" allocations. The latter will of course require auditing and possible updating of existing code, and by the amount of refactoring required, we can see if it makes sense, and if we lose any additional semantics by using bare "malloc" for raw allocations.

all allocated chunks within the heap start with a 1-word pointer to an object of type struct mp_obj_type_t.

Yes, that's the idea (to make that possible, why at the same time not enforce it, and work with existing conservative C with minimal changes).

pfalcon · 2015-03-19T21:58:24Z

Well, not that easy. What I was thinking is that mp_raw_alloc() would allocate block with type header, offset it, and return. But for GC, ptrs stored in objects should be to the head of block. So "transparent" scheme won't work, all requesters of raw memory should be fully aware that they receive a kind of "handle", use that to store in ptr fields, but do explicit cast to access the underlying memory with something like MP_RAW_DATA().

Yeah, it's not easy to retrofit more advanced memory management into existing code, but the longer we won't do that, the more it will fall behind the competition.

And we should have the same cast-accessor to get an mp_obj_t from memory (because in memory it may be stored in other form than just a full pointer).

dpgeorge · 2015-03-24T00:10:16Z

But for GC, ptrs stored in objects should be to the head of block.

Why? Why can't the invariant be that all ptrs to head allocated RAM point to 1 word beyond the head?

pfalcon · 2015-03-24T09:26:28Z

Why? Why can't the invariant be that all ptrs to head allocated RAM point to 1 word beyond the head?

Invariant for all can be anything. But here we talk about how to differentiate pointers to objects from pointers to non-objects (both of which should be differentiated from non-pointers). And you can't store pointers to objects as ptr-to-head and to non-objects as ptr-past-head.

dpgeorge · 2015-03-24T10:29:44Z

But here we talk about how to differentiate pointers to objects from pointers to non-objects

I still don't understand. Aren't all pointers into the heap now pointers to objects? (Because all allocated chunks on the heap start with a type identification word.)

pfalcon · 2015-03-24T11:05:42Z

Hello,

On Tue, 24 Mar 2015 03:29:47 -0700
Damien George notifications@github.com wrote:

But here we talk about how to differentiate pointers to objects
from pointers to non-objects

I still don't understand. Aren't all pointers into the heap now
pointers to objects? (Because all allocated chunks on the heap start
with a type identification word.)

Just have a look at "typedef struct _mp_obj_str_t": "const byte *data;"
is not pointer to object, it's pointer to "raw memory" (which may
have its own structure, irrelevant), and being worked on as such
(str->data[0] is first byte of raw memory, not anything else).

So, we have 2 choices:

My original, underthought way: we allocate block, put type header at
first word, store pointer past that to str->data. Problem: ->data no
longer recognized as pointer per invariant "all ('liveness') pointers
point at the beginning of allocated block, not inside".
Have an explicit cast from "raw memory handle" to "raw memory
pointer". Handles are stored in objects, pointers are used to access
memory. This is parallel to similar explicit distinction between object
pointers and handles discussed in the other ticket.

Best regards,
Paul mailto:pmiscml@gmail.com

dpgeorge · 2015-03-27T23:16:34Z

Ok I now understand, thanks for clarification!

My original thinking was that type-tagging of memory chunks would be transparent to the application, but I see what you are describing is a tight coupling between the memory manager and the application (which it needs to be if we want efficiency).

We can do the type-tagging many different ways. For example:

GC allocates a chunk of memory, stores "type" info in the first word, offsets, and returns the offset pointer. Invariant is that all pointers point to 1-word beyond the start of the chunk. A uPy object must then store the uPy type pointer in the "first" (actually second) word of the allocated memory. A raw pointer (such as str data) would just store characters straight away. This wastes 1 word for all uPy objects.
GC allocates a chunk of memory, stores "type" info in the first word. If "type" is a uPy object then the GC sets a special bit for that chunk and returns the pointer without offset. If "type" is not a uPy object then the GC clears the special bit for that chunk and returns the pointer with a 1-word offset. The special bit is used to distinguish pointers that point to start, and those that point inside.
GC allocates a chunk of memory, stores "type" info in the first word and returns pointer without offset. It's up to the application to offset the pointer for raw memory access.

The first 2 approaches are transparent to the application. Approach 2 is more complicated than approach 1 but uses less RAM overhead. Approach 3 is not transparent to the application and potentially uses more ROM to do the offsetting at many points in the code.

Anyway, anyway, I'm sure there are many other ways of doing it. Point is that we should keep an open mind and make sure the macro wrappers we use can accommodate all these different schemes, since it's not clear a priori which is better.

Neon22 · 2015-03-28T02:28:07Z

Or one (or more) bitfield arrays which indicate the type or other useful aspect. (unreferenced so can be collected, pointers that start, pointers that start inside, type, etc)
However this could slow down as many accesses might be needed for some specific ops vs saving memory.

stinos · 2015-03-28T09:16:01Z

Was wondering: is it ok for a malloc-like allocator function to return pointers which are word-aligned?

dpgeorge · 2015-03-28T13:20:24Z

is it ok for a malloc-like allocator function to return pointers which are word-aligned?

As opposed to..? The specs are that malloc returns a pointer "suitably aligned for any built-in type".

danicampora · 2015-03-28T13:26:06Z

True, AFAIK, every malloc implementation works as @dpgeorge says.

On Mar 28, 2015, at 2:20 PM, Damien George notifications@github.com wrote:

is it ok for a malloc-like allocator function to return pointers which are word-aligned?

As opposed to..? The specs are that malloc returns a pointer "suitably aligned for any built-in type".

—
Reply to this email directly or view it on GitHub.

stinos · 2015-03-28T17:30:44Z

Ok, just asking.

adritium · 2018-05-31T20:10:04Z

This issue dead? It's been 3+ years.

dpgeorge · 2018-06-01T06:59:04Z

It's still open/active, and still interesting.

adritium · 2018-06-01T07:01:05Z

It seems that a breaking change in pointer management and API must be introduced because of need for double indirection.

Is this something you’d consider?

dpgeorge · 2018-06-01T11:57:22Z

It seems that a breaking change in pointer management and API must be introduced because of need for double indirection.

For sure there needs to be a big change to make the GC precise. And even more change to get to a compacting collector. IMO this can't be done by incremental changes to master, but rather it would be worked on in a separate branch to investigate how feasible it is first, and find out if there are any big issues that have not yet been understood.

Nrf52 neopixel

adritium · 2018-09-20T15:18:33Z

@pfalcon

Well, I don't know what to say here, but there gotta be way to deal with that.

Guile traces the C stack and static data segment conservatively. That is to say, Guile just treats every word on the C stack and every C global variable as a potential reference in to the Scheme heap

pfalcon added the rfc Request for Comment label Mar 17, 2015

ryannathans mentioned this issue May 9, 2016

Failure to allocate when free memory is very large - severe fragmentation? #2057

Closed

nevercast mentioned this issue Jan 28, 2018

Backport CircuitPython GC changes in to uPy #3586

Open

jpochyla mentioned this issue May 31, 2018

enhancement - garbage collection - add Concurrent Mark and Sweep (CMS) #3792

Open

nickzoic pushed a commit to nickzoic/micropython that referenced this issue Sep 6, 2018

Merge pull request micropython#1161 from hathach/nrf52_neopixel

a0872e8

Nrf52 neopixel

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Towards possibility of precise garbage collector #1161

Towards possibility of precise garbage collector #1161

pfalcon commented Mar 17, 2015

dpgeorge commented Mar 17, 2015

pfalcon commented Mar 17, 2015

dpgeorge commented Mar 17, 2015

pfalcon commented Mar 17, 2015

dpgeorge commented Mar 17, 2015

pfalcon commented Mar 17, 2015

pfalcon commented Mar 19, 2015

dpgeorge commented Mar 24, 2015

pfalcon commented Mar 24, 2015

dpgeorge commented Mar 24, 2015

pfalcon commented Mar 24, 2015

dpgeorge commented Mar 27, 2015

Neon22 commented Mar 28, 2015

stinos commented Mar 28, 2015

dpgeorge commented Mar 28, 2015

danicampora commented Mar 28, 2015

stinos commented Mar 28, 2015

adritium commented May 31, 2018

dpgeorge commented Jun 1, 2018

adritium commented Jun 1, 2018

dpgeorge commented Jun 1, 2018

adritium commented Sep 20, 2018

Towards possibility of precise garbage collector #1161

Towards possibility of precise garbage collector #1161

Comments

pfalcon commented Mar 17, 2015

dpgeorge commented Mar 17, 2015

pfalcon commented Mar 17, 2015

dpgeorge commented Mar 17, 2015

pfalcon commented Mar 17, 2015

dpgeorge commented Mar 17, 2015

pfalcon commented Mar 17, 2015

pfalcon commented Mar 19, 2015

dpgeorge commented Mar 24, 2015

pfalcon commented Mar 24, 2015

dpgeorge commented Mar 24, 2015

pfalcon commented Mar 24, 2015

dpgeorge commented Mar 27, 2015

Neon22 commented Mar 28, 2015

stinos commented Mar 28, 2015

dpgeorge commented Mar 28, 2015

danicampora commented Mar 28, 2015

stinos commented Mar 28, 2015

adritium commented May 31, 2018

dpgeorge commented Jun 1, 2018

adritium commented Jun 1, 2018

dpgeorge commented Jun 1, 2018

adritium commented Sep 20, 2018