Fix sat math #100

calebzulawski · 2021-04-20T01:00:00Z

Looks good to me, just want to merge it in

workingjubilee · 2021-04-21T06:22:03Z

Looks like PPC now emits an LLVM Error on saturating math on SimdI128. How do we want to proceed?

programmerjake · 2021-04-21T07:20:49Z

iirc, current PPC (until SimpleV comes around) only has 128-bit SIMD: how about just leave off #[repr(simd)] for 128-bit integers?

programmerjake · 2021-04-21T07:22:04Z

or, just get rustc to use the fall-back scalar code for that case for the simd intrinsics

calebzulawski · 2021-04-21T14:15:07Z

There's no reason you can't do 128 bit saturating math in a 128 bit vector. Definitely an LLVM issue. We should probably report it and use a scalar fallback for now.

Also worth noting this would fail on master, and only shows up because of the fixed tests.

workingjubilee · 2021-04-21T15:56:02Z

Something was sus about the last merge, so rerunning CI.

programmerjake · 2021-04-21T16:06:27Z

There's no reason you can't do 128 bit saturating math in a 128 bit vector. Definitely an LLVM issue. We should probably report it and use a scalar fallback for now.

True. I guess my point was that since PPC only supports 128-bit vectors and since a 128-bit vector of i128 should be equivalent to scalar i128, having #[repr(simd)] shouldn't give any performance advantage.

workingjubilee · 2021-04-21T16:09:32Z

error: failed to unpack package console_error_panic_hook v0.1.6

Caused by:
failed to iterate over archive

well that wasn't what I was expecting.
Going to assume the merge errors were spurious.

workingjubilee · 2021-04-21T16:12:17Z

There's no reason you can't do 128 bit saturating math in a 128 bit vector. Definitely an LLVM issue. We should probably report it and use a scalar fallback for now.

True. I guess my point was that since PPC only supports 128-bit vectors and since a 128-bit vector of i128 should be equivalent to scalar i128, having #[repr(simd)] shouldn't give any performance advantage.

#[repr(simd)] is about defining SIMD compatible types from our perspective, and it must be platform agnostic. I would be willing to drop SimdI128 and SimdU128 if we discern however that even AVX2 implements u128x2 / i128x2 "in software" rather than having any special instructions for them and that such is a disqualifying metric.

programmerjake · 2021-04-21T16:25:52Z

#[repr(simd)] is about defining SIMD compatible types from our perspective, and it must be platform agnostic. I would be willing to drop SimdI128 and SimdU128 if we discern however that even AVX2 implements u128x2 / i128x2 "in software" rather than having any special instructions for them and that such is a disqualifying metric.

Guess we're dropping i128 vectors then:
https://gcc.godbolt.org/z/o1bGT859a

even bitwise or is scalarized when avx512dq is enabled -- for both i128x2 and i128x4

programmerjake · 2021-04-21T16:30:42Z

#[repr(simd)] is about defining SIMD compatible types from our perspective, and it must be platform agnostic. I would be willing to drop SimdI128 and SimdU128 if we discern however that even AVX2 implements u128x2 / i128x2 "in software" rather than having any special instructions for them and that such is a disqualifying metric.

Guess we're dropping i128 vectors then:
https://gcc.godbolt.org/z/o1bGT859a

even bitwise or is scalarized when avx512dq is enabled -- for both i128x2 and i128x4

(Edit: misread assembly, clang and gcc do have matching abis) gcc and clang don't even agree on how to pass i128x2 and i128x4 -- gcc passes them in avx2 and avx512 registers, clang passes them in memory.

Both gcc and clang scalarize a bitwise or:
https://gcc.godbolt.org/z/cTKTsfvrv

calebzulawski · 2021-04-21T16:40:16Z

I'd be more interested in implementing these in rust with smaller integers than dropping them entirely.

calebzulawski · 2021-04-23T00:30:57Z

@workingjubilee I opened an LLVM bug and stdsimd bug (#104)

workingjubilee · 2021-04-24T05:02:04Z

I'd be more interested in implementing these in rust with smaller integers than dropping them entirely.

Well, if we're going to write a 2-limb vectorized BigInt why not just add full-featured vectorized BigInts?

calebzulawski · 2021-04-25T01:58:33Z

Because rust only has i128 right now :)

workingjubilee · 2021-04-25T21:21:09Z

Mm.

Even if we resolve the kind of quasi-philosophical objection here, I am somewhat concerned about adding operations we don't actually have reasonably efficient implementations for, unless we intend to bugfix them immediately, or if we are confident that they are useful if you enable higher-level SIMD features. We have actually managed to take things out and then (ahem) "circle back" and readd them once we solved the problem on some level, too, so I think we should do that again here.

It's not much of a SimdU128 if it's not actually doing any SIMD operations even with all the bells and whistles turned on, is it?

calebzulawski · 2021-04-25T21:52:22Z

Well, we can transmute to another type for bit ops, and then use an explicit non-vector implementation for arithmetic with the intention of adding better arithmetic in the future.

calebzulawski · 2021-04-26T01:01:30Z

crates/core_simd/src/math.rs

-            /// let unsat = x.abs();
-            /// let sat = x.saturating_abs();
-            #[doc = concat!("assert_eq!(unsat, ", stringify!($name), "::from_array([MIN, 2, 0, 3]);")]
+            #[doc = concat!("let xs = ", stringify!($name), "::from_array([MIN, -2, 0, 3]);")]


hmm why xs?

mild case of whim / there's a common pattern of for x in xs
thus, implied plurality (a vector)

Personally I would just go with x I think, it is a vector but we're not really concerned about the particular lanes

Also I realized this is my own PR so I can't "approve" anything, but everything looks good to me, don't let this comment hold up a merge

calebzulawski requested a review from workingjubilee April 20, 2021 01:00

workingjubilee closed this Apr 21, 2021

workingjubilee reopened this Apr 21, 2021

workingjubilee mentioned this pull request Apr 25, 2021

Simd{U,I}128 blocked on... LLVM maybe? #108

Open

3 tasks

calebzulawski and others added 5 commits April 25, 2021 16:42

Fix saturating math docs

1f4e902

Finish fixing up abs docs

e8b6bca

Branchless abs

91134e6

Move lanes_at_most_64 to _32

f06427f

Remove Simd{U,I}128

92d643b

workingjubilee force-pushed the fix-sat-math branch from f9a2d83 to 92d643b Compare April 25, 2021 23:45

calebzulawski commented Apr 26, 2021

View reviewed changes

workingjubilee merged commit a9a1c9d into master Apr 26, 2021

calebzulawski deleted the fix-sat-math branch August 7, 2021 18:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix sat math #100

Fix sat math #100

calebzulawski commented Apr 20, 2021 •

edited

Loading

workingjubilee commented Apr 21, 2021

programmerjake commented Apr 21, 2021

programmerjake commented Apr 21, 2021

calebzulawski commented Apr 21, 2021 •

edited

Loading

workingjubilee commented Apr 21, 2021

programmerjake commented Apr 21, 2021

workingjubilee commented Apr 21, 2021

workingjubilee commented Apr 21, 2021

programmerjake commented Apr 21, 2021

programmerjake commented Apr 21, 2021 •

edited

Loading

calebzulawski commented Apr 21, 2021

calebzulawski commented Apr 23, 2021

workingjubilee commented Apr 24, 2021

calebzulawski commented Apr 25, 2021

workingjubilee commented Apr 25, 2021 •

edited

Loading

calebzulawski commented Apr 25, 2021

calebzulawski Apr 26, 2021

workingjubilee Apr 26, 2021 •

edited

Loading

calebzulawski Apr 26, 2021

calebzulawski Apr 26, 2021

Fix sat math #100

Fix sat math #100

Conversation

calebzulawski commented Apr 20, 2021 • edited Loading

workingjubilee commented Apr 21, 2021

programmerjake commented Apr 21, 2021

programmerjake commented Apr 21, 2021

calebzulawski commented Apr 21, 2021 • edited Loading

workingjubilee commented Apr 21, 2021

programmerjake commented Apr 21, 2021

workingjubilee commented Apr 21, 2021

workingjubilee commented Apr 21, 2021

programmerjake commented Apr 21, 2021

programmerjake commented Apr 21, 2021 • edited Loading

calebzulawski commented Apr 21, 2021

calebzulawski commented Apr 23, 2021

workingjubilee commented Apr 24, 2021

calebzulawski commented Apr 25, 2021

workingjubilee commented Apr 25, 2021 • edited Loading

calebzulawski commented Apr 25, 2021

calebzulawski Apr 26, 2021

Choose a reason for hiding this comment

workingjubilee Apr 26, 2021 • edited Loading

Choose a reason for hiding this comment

calebzulawski Apr 26, 2021

Choose a reason for hiding this comment

calebzulawski Apr 26, 2021

Choose a reason for hiding this comment

calebzulawski commented Apr 20, 2021 •

edited

Loading

calebzulawski commented Apr 21, 2021 •

edited

Loading

programmerjake commented Apr 21, 2021 •

edited

Loading

workingjubilee commented Apr 25, 2021 •

edited

Loading

workingjubilee Apr 26, 2021 •

edited

Loading