-
-
Notifications
You must be signed in to change notification settings - Fork 8.2k
ESP32 without psram. Socket and Memory Split or ... . #14421
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
“It appears that the issue arises when an asynchronous function is invoked in a synchronous manner. This behavior is particularly noticeable when there’s a significant delay(some time.sleep) in the code execution or when a piece of code takes an extended period to complete its operation in synchronous mode. It seems that asyncio has an impact on these scenarios.” |
When a call is executed within a thread, an error occurs - like that. However, when the call is executed outside of a thread, not that errors. But the WiFi functionality is compromised. Despite showing a connected status, data transmission is not occurring. Neither sending nor receiving of data is happening, and the ping operation is also failing. |
MicroPython is consuming all of the available memory in the ESP32 for its heap, and ESP-IDF is running out of memory for allocating new sockets. After memory has run out, it's likely the Wi-Fi will also stop working as it regularly allocates and frees buffers. You can see this happening at the moment the "GC: total" value goes up in mem_info output, and "max new split" drops to a very low number. "max new split" is the largest free memory block that ESP-IDF can use to allocate buffers for sockets and Wi-Fi. You can also call esp32.idf_heap_info() to confirm this. MicroPython only grows its heap when it needs to avoid a If you're not sure what is causing MicroPython to grow the heap, add some more |
It turns out that ESP-IDF does not have enough memory for wifi operation. Wifi stops working, but sta.isconnected() == True. Ping timeout from PC to ESP32. Everyone thinks everything is fine. Asyncio stream keeps writing as there are no errors. Only if you try to create new sockets, asyncio crashes with Error 5 EIO. Push Ctrl+c stop asyncio.
That look same - (#12819) Try build with CONFIG_LWIP_TCP_MSL=6000. build with LWIPCONFIG_LWIP_TCP_MSL=6000
Stop frame: result -> ping timeout and ... . |
@straga It might be related to the linked issue, but the root cause of ESP-IDF running out of memory is that MicroPython has already moved all the memory into the "Python heap". There are two separate heaps, Python heap and ESP-IDF heap. Even if you have free memory in the Python heap, ESP-IDF can't use it. So to prevent this issue, you need to stop the "GC: total: ..." number from ever increasing. Once this memory is added to the Python heap, it's no longer available for ESP-IDF even if it's free for Python... |
@projectgus I think it is necessary to return back that the buffer for the board without PS
RAM is static, then we will have a little less RAM, but wi-fi will not fall off and everything will work as before.
There will be 20 kilobytes less RAM but everything will work as before.
Or it is necessary to realise this redistribution in some other way.
|
@straga MicroPython is growing the heap automatically as your code is running, to prevent a MemoryError. From your logs:
This is early, there is enough other RAM for ESP-IDF.
This is later, not enough other RAM for ESP-IDF. If you find the function in your code that causes this number to increase and refactor it, then ESP-IDF will start working again. We might be able to add an option to MicroPython to make this simpler, as it's hard for MicroPython to know if it should choose to throw a MemoryError or to grow the heap. |
@projectgus Thanks for information. I am rewrite my code for use less usage ram as possible. Now works better. While free RAM enought for python and esp-idf, all right. Micropython got first all RAM or more agresive method. If micropython not shows not enought allocated RAM it ok for micropython. But when for esp-idf need more RAM and not free. ESP-IDF not print any just randomly can stop something (in my case wifi stack). Right ? |
@straga Yes, that's how it is at the moment. Glad that you got everything working. |
Using the ESP32-C3, after applying a patch and opening a new socket, the board freezes completely. The only way to recover is by pressing the hardware reset button or enabling the watchdog. This issue occurs on different boards as well, suggesting it’s not just related to RAM but could involve other factors too. The ping stops as a result of actions I take. At that moment, I open another socket connection to the board. Despite this, the board remains connected to MQTT and continues sending messages. However, when I start using multiple active socket connections (Telnet, FTP, HTTP), the board freezes. |
@straga Do you have a way for us to reproduce this hang? |
|
The fix from #15952 may help with this issue. Please try the latest nightly build |
ESP32 board However, when all components are active (which only involves up to six sockets), I encounter random freezes. The situation has improved compared to before, as the watchdog now restarts the board when it freezes.
When I attempt to connect to the board and download a 3KB file, the ping drops, and the left side freezes. You can see this behavior in this video: https://youtu.be/-Vujb0btDwc . If I run the asyncio code in a separate thread, you can observe the behavior in this video: https://youtu.be/MfxXQfAYu9g. It demonstrates how memory leaks occur due to the split between the split new and MicroPython, but the system doesn’t freeze—it continues running. The code remains unchanged, with the only difference being that asyncio is now executed in a thread. When running asyncio in a thread, the Wi-Fi occasionally stops working, even though ifconfig still shows an IP, and isconnected() == True. This behavior also seems to occur randomly. |
just in secret file set (ssid password), in board.yml mqtt (ip). main.py https://github.com/cpopp/MicroTelnetServer |
@straga Does it still hang on the latest nightly build? See #14421 (comment) |
@projectgus That was: |
@straga There shouldn't be a "dirty" label in the tag if you used the firmware from the download page. Did you build yourself with modifications? |
@dpgeorge Yes with my board configuration. Now same with: video: https://youtu.be/3zo4YU3zZ-w Only comment binary sensor: That with night build: Result same after loaded all, Try use FTP upload or download file. Freeze all, just wait watchdog reset. |
@straga Some changes recently merged (#16015) that may help with avoiding MemoryErrors in Python, so you could test with a recently nightly preview build and check for any improvement. However, looking at this latest screenshot you are simply running out of RAM. Low memory in ESP-IDF is causing the other weird behaviours like dropped packets and lack of response. There probably isn't anything MicroPython can do to help with this. You'll need to either rewrite your code to use less RAM (for example, by freezing modules into the firmware or other techniques) or you could look at switching to an ESP32 with PSRAM onboard, this will give you much more RAM to use. |
@projectgus I have an old board with an older version, maybe 1.21 or 1.22, that works the same way with no problems. That's why it confused me. I am now looking at defragmenting the memory. And if it's heavily defragmented. The maximum possible piece that can be allocated is too small, although the total amount of free memory is enough. I just did a board reset. I haven't figured out how to understand what went wrong at the system level. |
Is this in the ESP-IDF heap? |
How I am understand
Each tuple shows (total_size, free_size, largest_free_block, min_free_since_boot): 113840, 33744, 29696, 27544), # Region 5: Main heap (DRAM)
|
If you only care about data allocations then it's better to call To determine fragmentation you can do something like: info = esp32.idf_heap_info(esp32.HEAP_DATA)
largest_free_block = max(h[2] for h in info)
total_free = sum(h[1] for h in info) Will give you the largest free block and the total free bytes across all heaps. Comparing these two numbers will help you understand if this is actually fragmentation, or if you're simply running out of RAM (for fragmentation, total_free is high but largest_free_block is very low. If out of RAM, both numbers will be low.) |
I've found this discussion as I'm experiencing similar effects as @straga , although in slightly different use case. I'm attempting to make an MQTT TLS connection, which fails with different random errors, or sometimes just hangs. However, it succeeds if I have more than approx. 40KB of "max new split" (e.g. by trimming down the code). In my case, the memory usage is not high, but fragmented:
The behavior of doubling MicroPython heap doesn't help either - I cross 56000 bytes of heap only upon initialization/precomputation, then fall back below that. Unfortunately, the heap remains at 112000 bytes. One of the workarounds I'm thinking about is to allocate memory outside of MicroPython heap. This way, just after starting, one could allocate 100KB of memory, run initialization part of the code, which would effectively squeeze everything to available ~50KB, and then release those 100KB before connecting to MQTT. The end result would be a max new split of 100k, instead of 31k. I've tested this theory successfully with allocating a byte array (but unfortunately, since it's ending up on the heap, which I cannot resize back down, it's practically useless). So, all in all, I'm wondering if there is a way to allocate memory outside of MicroPython heap? Could this be a viable workaround in some cases? |
Quick update, the idea above is certainly possible and works... at least in my case.
After the "hack", the same code, same place (note I bumped initial stack from 56k->70k as well):
this doesn't give the whole picture, the available blocks here are 64k, 40k and 14k! Connection to wifi and subsequent MQTT w/ TLS is now possible. If this is useful for anyone, those are two functions which I added in micropython:
and this is how I'm using them:
Of course, you will need to tweak the block sizes for your situation. |
We've actually been down the path of trying to tackle this a bit, before. I had this PR which added a What we added instead was MICROPY_GC_INITIAL_HEAP_SIZE. If you're building a custom firmware anyway then suggest setting this value to the amount of heap that you know you'll need, so MicroPython doesn't need to try and grow the heap during startup. You could even set
If you're using v1.24 or earlier, there is another bug that causes TLS buffers to not be freed as early as they could be during a disconnect/reconnect cycle. The fix is in #16015 and is available in the nightly preview builds or the upcoming v1.25 release. |
Checks
I agree to follow the MicroPython Code of Conduct to ensure a safe and respectful space for everyone.
I've searched for existing issues matching this bug, and didn't find any.
Port, board and/or hardware
ESP32 withou PSRAM / ESP32-C3
MicroPython version
Micropython 23.0-preview.346.g64f28dc1e on 2024-05-03
Reproduction
Expected behaviour
No response
Observed behaviour
Connected to WIFI.
Already open 3 socket with FTP, TELNET, MQTT
All works
However, when I attempt to create an additional socket for testing purposes, I encounter an error:
OSError: [Errno 105] ENOBUFS
. Following this error, the WiFi functionality ceases to work.Additional Information
Any guidance on how to resolve this issue would be greatly appreciated.
The text was updated successfully, but these errors were encountered: