[FEA] Consider disabling `--expt-relaxed-constexpr` #7795

jrhemstad · 2021-03-31T20:14:16Z

Is your feature request related to a problem? Please describe.

--expt-relaxed-constexpr is a convenient way to reuse existing constexpr host code, e.g., things like std::max.

However, it can lead to some pretty surprising behavior. Consider:

constexpr int bar(int j){
    if(j<0){
        throw;
    }
    return 42;
}
__global__ void kernel(int * i){
    *i = bar(-1);
}

https://godbolt.org/z/frb8c6cd7

One might expect this to fail to compile as throw is not valid in device code. However, not only does it happily compile, but it just stores the value 42.

This example looks pretty harmless:

int foo(int i){
    return i * 2;
}
constexpr int bar(int j){
    if(j<0){
        return foo(j);
    }
    return 42;
}
__global__ void kernel( int * i){
    *i = bar(-1);
}

But this too results in an ill-formed program without a diagnostic.

https://godbolt.org/z/aTzGaMrGd

Describe the solution you'd like

We should think pretty hard about if we want to risk such egregious undefined behavior in libcudf.

As such, we may want to consider moving towards disabling --expt-relaxed-constexpr. At the very least, we should be preferring CUDA_HOST_DEVICE_CALLABLE whenever possible (for functions that need be called from both host and device).

Additional Context

The only place it is 100% safe to use a constexpr function in device code with --expt-relaxed-constexpr is when used in a context that requires constant evaluation. Then it will fail to compile if the constexpr function contains things that would result in an ill-formed program: https://godbolt.org/z/47qfnPnc9

The text was updated successfully, but these errors were encountered:

harrism · 2021-04-01T02:13:32Z

@karthikeyann please be aware of this with respect to #7713

harrism · 2021-04-01T02:14:57Z

At the very least, we should be preferring CUDA_HOST_DEVICE_CALLABLE whenever possible.

No, we should prefer CUDA_DEVICE_CALLABLE or no annotation whenever possible. We should use CUDA_HOST_DEVICE_CALLABLE only for functions that are required to be called from both host and device.

jrhemstad · 2021-04-01T13:46:47Z

only for functions that are required to be called from both host and device.

This was implied. The whole point of --expt-relaxed-constexpr is to call existing host constexpr functions from device, which implies it can be/is called from both host and device.

harrism · 2021-04-05T12:09:14Z

I want to be very explicit because it is not obvious to everyone (a fact that is evident in our code).

github-actions · 2021-05-05T12:25:15Z

This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.

harrism · 2021-05-06T01:08:43Z

Still valid

PointKernel · 2024-08-30T22:02:36Z

This seems to be unblocked with the recent CCCL update.

The changes in NVIDIA cuCollections PR #595 show that cuco can now build and run properly without relaxed constexprs.

Previously, calling `narrow<>()` from CUDA device code should have resulted in a compile error but but did not due to a weird bug in NVCC pertaining to the experimental "--expt-relaxed-constexpr" flag: rapidsai/cudf#7795 This commit makes two changes to mitigate this problem: (1) `narrow<>()` no longer responds to `gsl_CONFIG_CONTRACT_VIOLATION_THROWS` because it does not do contract checking. Therefore, it plainly fails to compile for `gsl_CONFIG( NARROW_THROWS_ON_TRUNCATION )` if exceptions are unavailable (e.g. in device code). (2) For `!gsl_CONFIG( NARROW_THROWS_ON_TRUNCATION )`, `narrow<>()` now makes sure that the program is terminated by issuing a trap instruction if `std::terminate()` is not available.

* Do not call `std::terminate()` in CUDA device code Previously, calling `narrow<>()` from CUDA device code should have resulted in a compile error but but did not due to a weird bug in NVCC pertaining to the experimental "--expt-relaxed-constexpr" flag: rapidsai/cudf#7795 This commit makes two changes to mitigate this problem: (1) `narrow<>()` no longer responds to `gsl_CONFIG_CONTRACT_VIOLATION_THROWS` because it does not do contract checking. Therefore, it plainly fails to compile for `gsl_CONFIG( NARROW_THROWS_ON_TRUNCATION )` if exceptions are unavailable (e.g. in device code). (2) For `!gsl_CONFIG( NARROW_THROWS_ON_TRUNCATION )`, `narrow<>()` now makes sure that the program is terminated by issuing a trap instruction if `std::terminate()` is not available. Some drive-by changes: * Update Xcode toolset versions [ci skip] * Update CI configuration, add newer compilers * Remove GCC 10 from macOS test suite Results in weird errors in libunwind ("_Unwind_GetTextRelBase - _Unwind_GetTextRelBase() not implemented") which I have no interest in tracking down.

vyasr · 2024-12-06T18:49:44Z

NVIDIA/cuCollections#595 and rapidsai/cuspatial#1494 inspired me to look into this again. It's slow going (but low effort) to investigate, so I'll report back when I get compilation far enough to make a determination on whether removing this setting will be possible in cudf. I've run into one very odd case that seemed like a compiler bug, but other than that it seems largely like a lot of tedious but not difficult work.

Contributes to #7795. Also contributes to rapidsai/build-planning#76. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Nghia Truong (https://github.com/ttnghia) - Yunsong Wang (https://github.com/PointKernel) - Bradley Dice (https://github.com/bdice) - David Wendt (https://github.com/davidwendt) URL: #17545

Contributes to #7795 This PR updates `text` to build without depending on the relaxed constexpr build option. Authors: - Yunsong Wang (https://github.com/PointKernel) Approvers: - Basit Ayantunde (https://github.com/lamarrr) - Bradley Dice (https://github.com/bdice) - David Wendt (https://github.com/davidwendt) URL: #17647

Contributes to #7795 This PR updates `binaryop` to build without depending on the relaxed constexpr build option. Authors: - Yunsong Wang (https://github.com/PointKernel) Approvers: - David Wendt (https://github.com/davidwendt) - Vukasin Milovanovic (https://github.com/vuule) URL: #17598

jrhemstad added feature request New feature or request Needs Triage Need team to review and classify labels Mar 31, 2021

jrhemstad added libcudf Affects libcudf (C++/CUDA) code. code quality and removed Needs Triage Need team to review and classify labels Mar 31, 2021

harrism mentioned this issue Apr 26, 2021

[FEA] use constexpr instead of __host__ __device__ whenever possible. #7713

Closed

github-actions bot added the inactive-30d label May 5, 2021

jrhemstad added 0 - Backlog In queue waiting for assignment and removed inactive-30d labels May 5, 2021

harrism mentioned this issue May 5, 2021

Use rmm::device_uvector in place of rmm::device_vector in cuIO #8151

Merged

jrhemstad mentioned this issue Jun 30, 2021

Enable AST-based joining #8214

Merged

jrhemstad mentioned this issue Sep 30, 2021

Use optional-iterator for copy-if-else kernel #9324

Merged

jrhemstad mentioned this issue Mar 7, 2022

Faster struct row comparator #10164

Merged

jrhemstad mentioned this issue May 18, 2022

Make sentinel constructors constexpr NVIDIA/cuCollections#155

Merged

PointKernel mentioned this issue Jun 14, 2022

Refactor lists::contains #11019

Merged

davidwendt mentioned this issue Jun 29, 2022

Support nth_element for window functions #11158

Merged

ttnghia mentioned this issue Jul 15, 2022

Use stod in cuio floating point parsing #11190

Closed

vyasr mentioned this issue Dec 20, 2022

Compile times are growing significantly #581

Closed

PointKernel mentioned this issue Oct 10, 2023

Enable indexalator for device code #14206

Merged

3 tasks

vyasr removed the code quality label Feb 23, 2024

mbeutel mentioned this issue Oct 31, 2024

Do not call std::terminate() in CUDA device code gsl-lite/gsl-lite#351

Merged

harrism mentioned this issue Dec 3, 2024

[FEA]: Remove --expt-relaxed-constexpr from NVCC options rapidsai/cuspatial#1494

Closed

davidwendt mentioned this issue Dec 6, 2024

Fix nvcc-imposed UB in constexpr functions #17534

Merged

3 tasks

vyasr mentioned this issue Dec 6, 2024

Mark more constexpr functions as device-available #17545

Merged

3 tasks

vyasr assigned PointKernel and vyasr Dec 6, 2024

This was referenced Dec 16, 2024

Enable binaryop build without relying on relaxed constexpr #17598

Merged

Enable text build without relying on relaxed constexpr #17647

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Consider disabling `--expt-relaxed-constexpr` #7795

[FEA] Consider disabling `--expt-relaxed-constexpr` #7795

jrhemstad commented Mar 31, 2021 •

edited

Loading

harrism commented Apr 1, 2021

harrism commented Apr 1, 2021 •

edited

Loading

jrhemstad commented Apr 1, 2021

harrism commented Apr 5, 2021

github-actions bot commented May 5, 2021

harrism commented May 6, 2021

PointKernel commented Aug 30, 2024

vyasr commented Dec 6, 2024

[FEA] Consider disabling --expt-relaxed-constexpr #7795

[FEA] Consider disabling --expt-relaxed-constexpr #7795

Comments

jrhemstad commented Mar 31, 2021 • edited Loading

harrism commented Apr 1, 2021

harrism commented Apr 1, 2021 • edited Loading

jrhemstad commented Apr 1, 2021

harrism commented Apr 5, 2021

github-actions bot commented May 5, 2021

harrism commented May 6, 2021

PointKernel commented Aug 30, 2024

vyasr commented Dec 6, 2024

[FEA] Consider disabling `--expt-relaxed-constexpr` #7795

[FEA] Consider disabling `--expt-relaxed-constexpr` #7795

jrhemstad commented Mar 31, 2021 •

edited

Loading

harrism commented Apr 1, 2021 •

edited

Loading