buildsystem/CI: Use of `BOARD_INSUFFCIENT_MEMORY` becomes a maintainace burdon #11128

maribu · 2019-03-07T14:22:40Z

Description

Currently for every test and application the BOARD_INSUFFICIENT_MEMORY variable needs to be manually maintained. This makes adding new boards with low RAM/FLASH a nightmare. Also, boards will practically never be removed from BOARD_INSUFFCIENT_MEMORY, even though newer toolchains and improvements in code could result in lower RAM/ROM requirements.

Additionally, the BOARD_INSUFFICIENT_MEMORY approach reduces compilation test coverage. E.g. a test will not be compiled at all when blacklisted via BOARD_INSUFFCIENT_MEMORY, but only the linking stage will fail because of insufficient RAM/flash. Any possible issue that the compilation stage would uncover remain unrevealed.
Update: Only the linking step is skipped when blacklisted via BOARD_INSUFFICIENT_MEMORY

Brainstorming of Possible Alternatives

Let make fail with a custom exit code upon linking stage. The CI can handle treat this return code not as an error
- Pros:
  - Relatively straight forward
- Cons:
  - No obvious disadvantages
Add a special Make target that overrides link time checks
- Pros:
  - Relatively straight forward
  - No changes in the CI required except for using that specific target
- Cons:
  - A really ugly hack
  - A user might not read the doc, notice that the make target "ci_build" magically makes the error go away and will try to flash the result
Add something like RAM_PROVIDED and FLASH_PROVIDED to every MCU, let boards override those in case of bootloaders. Add RAM_REQUIRED and FLASH_REQUIRED to every test and example
- Pros:
  - No obvious advantages
- Cons:
  - A lot of effort to add those to every board
  - A maintenance burden to keep them up to date
  - RAM/flash requirements depend on the toolchain, the CPU/board specific code, the name of the git branch baked into the "hello message" upon boot, and the alignment of the stars

The text was updated successfully, but these errors were encountered:

maribu · 2019-03-07T14:23:49Z

@RIOT-OS/maintainers: Please have a look and extend the brainstorming above as you see fit. (Both new possible approaches and pros/cons I didn't thought about.)

smlng · 2019-03-07T14:50:46Z

Funny, we had the same discussion over lunch today 😄

My conclusion was that all these lists, i.e., BOARD_WHITELIST, BOARD_BLACKLIST, BOARD_INSUFFICIENT_MEMORY, TEST_ON_CI_WHITELIST, and so on, are only used by Murdock-CI and to make that pass without errors by avoid building+testing stuff that will likely fail. And also to safe exec time on the CI.

So ideally (IMHO) we should get rid of such manually maintained lists in RIOT, and e.g. let compiling or linking fail. Because, as you already pointed out, the lists are hard to maintain and once a board is on such a list it likely stays there, even if issues are fixed. Again, these lists mostly serve one purpose: to make Murdock not fail and to me that's not how a CI should run.

I'm in favour of option 1. My 2 cents ...

jcarrano · 2019-03-07T14:51:56Z

I like (1). I'm trying to investigate what is the best way to reliably report that certain step in make failed, and also if there is a reliable way of knowing how much space it would have taken.

kaspar030 · 2019-03-08T16:43:42Z

Additionally, the BOARD_INSUFFICIENT_MEMORY approach reduces compilation test coverage. E.g. a test will not be compiled at all when blacklisted via BOARD_INSUFFCIENT_MEMORY, but only the linking stage will fail because of insufficient RAM/flash.

Not true, currently for all boards in BOARD_INSUFFICIENT_MEMORY, all compilation is done, just the linking step is skipped.

kaspar030 · 2019-03-08T17:18:34Z

So ideally (IMHO) we should get rid of such manually maintained lists in RIOT, and e.g. let compiling or linking fail. Because, as you already pointed out, the lists are hard to maintain and once a board is on such a list it likely stays there, even if issues are fixed.

Again, these lists mostly serve one purpose: to make Murdock not fail and to me that's not how a CI should run.

I don't think it is that simple. The lists represent what we expect to succeed. In these things, it is beneficial to be explicit.

In case of the "BOARDS_INSUFFICIENT_MEMORY", if we'd come up with a way to determine whether the link failed because - well, it failed, or because of insufficient memory, and treat the latter as an "OK fail", boards would cross the line without anyone noticing. A change could increase code size so much that a couple of low end boards suddenly don't fit anymore. The currently used size determination wouldn't catch this, as it only works with completed binaries.

In case of BOARD_BLACKLIST, there are many reasons why a board might be listed there.
Failing compilation is just one of the reasons. Others are e.g., known runtime problems, ....
While compilation could eaily be cought by CI, others are not.
So we'd end up with:

CI always failing. Unless we have a list of which compiles are "OK to fail".

This is important, as "CI succeeded" is a prerequisite for merging a PR.

CI producing binaries that don't work at runtime and the developer knows. Would be nice to document that somewhere. Maybe in a list that both devs and CI can use?
CI having much longer build times, because it wastes substantial amount of time building known failures.

I propose two things:

reduce maintenance burden. There are a couple of tries to make boards groupable, e.g., make: add board grouping #8062 and make: add architecture features and feature blacklisting #9081. Maybe we can have BOARD_INSUFFICIENT_MEMORY be populated by a group, e.g.,
BOARD_INSUFFICIENT_MEMORY += board_has_little_ram.
Report if the link would actually succeed. Thus, instead of just skipping the link if "CI_NO_LINK" is set, try, and if it succeeds, make that an error.

maribu · 2019-03-08T19:41:37Z

A change could increase code size so much that a couple of low end boards suddenly don't fit anymore.

The current approach is hardly solving this issue. Most tests and examples are already blacklisted for the lower end hardware anyway. And e.g. if a board still has 512B RAM free and one PR adds 511B more RAM requirements, this PR will likely get merged without anyone noticing. The next PR that adds 2B more and will however get all the blame.

To actually get feedback on which PRs do bloat RIOT, it would be better to provide statistics at the end of Murdock. E.g. did the number of link failures increase/decrease compared to the current master HEAD? What is the impact on the size .bss, .data and .text in average, what is the standard derivation and what are the outliers?

I'm aware this is something that will not happen over night. But considering the man power invested to keep BOARDS_INSUFFICIENT_MEMORY up to date, the time to develop this seems to be well spent.

kaspar030 · 2019-03-08T23:27:24Z

To actually get feedback on which PRs do bloat RIOT, it would be better to provide statistics at the end of Murdock. E.g. did the number of link failures increase/decrease compared to the current master HEAD?

The sizes are already collected. There are no comparisons ATM because it would require a re-build of master after each merge in order to get a proper baseline.

kaspar030 · 2019-03-08T23:35:12Z

if there is a reliable way of knowing how much space it would have taken.

AFAIK the ld flag --print-memory-usage also outputs stats if the link fails due to size.

maribu · 2019-03-09T06:03:04Z

it would require a re-build of master after each merge in order to get a proper baseline.

This might be a good idea regardless of this discussion. E.g. lets say for PR A and PR B the CI tests pass individually, but when both are merged they fail. Currently both could end up being merged with noone noticing the issue.

jcarrano · 2019-03-12T17:37:55Z

Currently both could end up being merged with noone noticing the issue.

True, but that's what nightly is for.

If we keep all the sizes from nighly and keep track of them we could spot any jump in FLASH usage.

The "real" solution would be to have reliable incremental builds. We'll get there eventually.

cladmi · 2019-03-13T16:16:46Z

The size are saved and accessible already:

Just look at the nightlies and for any run replace "output.html" by "sizes.json"

https://ci.riot-os.org/RIOT-OS/RIOT/master/d562af40e625485bc3e2eefdca8659c5b942ffc5/sizes.json

It was monitored for some time by @bergzand here https://riot-graphs.snt.utwente.nl/d/000000004/full-grid-diff-boards-vertical?orgId=1

stale · 2019-09-14T16:27:52Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. If you want me to ignore this issue, please mark it with the "State: don't stale" label. Thank you for your contributions.

cladmi mentioned this issue Mar 12, 2019

Makefile.include: RIOTNOLINK ensure linking fails #11168

Closed

cladmi mentioned this issue Jun 12, 2019

makefiles/murdock.inc.mk: change policy to run tests by default #11680

Merged

2 tasks

miri64 mentioned this issue Sep 10, 2019

BOARD_INSUFFICIENT_MEMORY alignment #9965

Closed

stale bot added the State: stale State: The issue / PR has no activity for >185 days label Sep 14, 2019

maribu mentioned this issue Oct 9, 2019

examples: Moved CI infos to Makefile.ci #12406

Merged

stale bot closed this as completed Oct 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

buildsystem/CI: Use of `BOARD_INSUFFCIENT_MEMORY` becomes a maintainace burdon #11128

buildsystem/CI: Use of `BOARD_INSUFFCIENT_MEMORY` becomes a maintainace burdon #11128

maribu commented Mar 7, 2019 •

edited

Loading

maribu commented Mar 7, 2019

smlng commented Mar 7, 2019

jcarrano commented Mar 7, 2019

kaspar030 commented Mar 8, 2019

kaspar030 commented Mar 8, 2019

maribu commented Mar 8, 2019

kaspar030 commented Mar 8, 2019

kaspar030 commented Mar 8, 2019

maribu commented Mar 9, 2019

jcarrano commented Mar 12, 2019

cladmi commented Mar 13, 2019

stale bot commented Sep 14, 2019

buildsystem/CI: Use of BOARD_INSUFFCIENT_MEMORY becomes a maintainace burdon #11128

buildsystem/CI: Use of BOARD_INSUFFCIENT_MEMORY becomes a maintainace burdon #11128

Comments

maribu commented Mar 7, 2019 • edited Loading

Description

Brainstorming of Possible Alternatives

maribu commented Mar 7, 2019

smlng commented Mar 7, 2019

jcarrano commented Mar 7, 2019

kaspar030 commented Mar 8, 2019

kaspar030 commented Mar 8, 2019

maribu commented Mar 8, 2019

kaspar030 commented Mar 8, 2019

kaspar030 commented Mar 8, 2019

maribu commented Mar 9, 2019

jcarrano commented Mar 12, 2019

cladmi commented Mar 13, 2019

stale bot commented Sep 14, 2019

buildsystem/CI: Use of `BOARD_INSUFFCIENT_MEMORY` becomes a maintainace burdon #11128

buildsystem/CI: Use of `BOARD_INSUFFCIENT_MEMORY` becomes a maintainace burdon #11128

maribu commented Mar 7, 2019 •

edited

Loading