Initial Support for the RISC-V Vector Extension in ARM NEON #1130

eric900115 · 2024-02-14T15:04:02Z

Hi everyone,

This is Eric from National Tsing Hua University (NTHU) pllab. This PR includes the initialization of the conversion of Neon to RISC-V Vector Extension (RVV) for SIMDe.

NTHU pllab and Andes Technology have collaborated to convert NEON intrinsics to the RISC-V Vector Extension, and we have converted all NEON intrinsics to RVV intrinsics. This PR marks the beginning of our work. We will soon upstream all of our work.

We made a few changes in the SIMDe repo to suit our needs:

We modified type.h to enable RVV types.
We modified files related to memcpy due to memory pollution issues in our implementation.
We modified files related to memory load and store for RVV implementation.
The initial conversion from Neon to RVV can be viewed in add.h and mul.h.
For CI, we've added clang-qemu-rvv in the CI.yml file for RVV testing. The testing runs on a Docker container with Ubuntu 23.10 because we need QEMU 8.0 (which is not available in Ubuntu 22.04). Later, when Ubuntu 24.04 is released on GitHub Actions, testing can be hosted on the Ubuntu 24.04 GitHub Actions machine.

We have included clang-qemu-rvv testing for the following RISC-V V Extension architectures, both with and without ZVFH enabled:

vlen = 128 & elen = 64
vlen = 256 & elen = 64
vlen = 512 & elen = 64

To compile SIMDe with support for the conversion from NEON to the RISC-V Vector Extension, please use Clang-17 and include the flag -mrvv-vector-bits=<vector_length_of_vector_machine> during compilation. Replace <vector_length_of_vector_machine> with the actual vector length of RISC-V vector machine.

* gh-actions macos: skip coverage report

fix : ci.yml feat : modify types.h for risc-v vector extension feat : modify simde utilities for rvv fix : type.h copyright fix : ci files name feat : modify load & store for risc-v v extension feat : modify load & store for risc-v vector fix : ci file fix : reinterpret (due to rvv mem pollution) feat : add and mul neon to rvv fix : copyright feat : modify ci.yml feat : remove TODO

mr-c

Very exciting! Has any testing been done on real RISCV64 RVV 1.0 hardware?

mr-c · 2024-02-14T15:18:29Z

simde/arm/neon/ld1.h

-    simde_memcpy(&r_, ptr, sizeof(r_));
+    #if defined(SIMDE_RISCV_V_NATIVE) && SIMDE_ARCH_RISCV_ZVFH
+      r_.sv64 = __riscv_vle16_v_f16m1((_Float16 *)ptr , 4);
+    #else
+      simde_memcpy(&r_, ptr, 8);


Can the sizeof version stay in the non-RISCV_V branches?

Is that not working for the native RVV types a compiler bug?

Yes, sizeof can remain in the non-RVV branch. I will modify them.

Update : I've already reverted sizeof in SIMDe implementation.

There are no bugs when compiling RVV types. However, the size of simde_xxx_private unions in type.h may not be as expected. The size of these unions can vary due to the differing lengths of RVV vector machines.

To be more specific, the size of simde_uint16x4_private will be 64 bits on vector machines other than RVV. However, for RVV with a VLEN (vector length) of 512, the union size of simde_uint16x4_private may expand to 512 bits, due to the limitations of types in RVV.

eric900115 · 2024-02-14T15:29:26Z

Very exciting! Has any testing been done on real RISCV64 RVV 1.0 hardware?

For testing, we have only tested the code using QEMU and the Spike simulator without real RISC-V RVV 1.0 hardware.

camel-cdr · 2024-02-16T21:09:17Z

Amazing work!
I ran a quick benchmark on the kendryte k230 (thead C908) with this neon mandelbrot code and my handwritten rvv mandelbrot code (slightly adjusted to fit the neon, godbolt link):

rvv LMUL=2:  287907470 cycles
rvv LMUL=1:  419831245 cycles
neon:        536969360 cycles
scalar:     1695304921 cycles

(this was run with 256 iterations and generated a 1440x1080 image)

This is a 3.1x speedup, and close to the hand-optimized rvv LMUL=1 implementation!

rvv LMUL=2 is faster, but we can't really expect SIMDe the vector length. A future avx2 implementation might be able to generate such code. See C910 and C908 for a comparison of rvv implementations.

Edit: ~~give me a minute, the numbers should be roughly correct, but I'm revising the neon code slightly~~ done

mr-c · 2024-02-16T21:23:23Z

Thanks @camel-cdr ! Can you run all the SIMDe tests from this PR on the k230?

camel-cdr · 2024-02-16T21:49:17Z

Thanks @camel-cdr ! Can you run all the SIMDe tests from this PR on the k230?

I'm currently working on that, however I run into problems with the glibc version on the k230. I used a freestanding build for the benchmark.

camel-cdr · 2024-02-16T22:42:56Z

I couldn't figure out how to get the glibc versions to align.

mr-c · 2024-02-17T12:02:40Z

simde/simde-features.h

+  #elif defined(SIMDE_RISCV_V_NATIVE) && defined(__riscv_v_fixed_vlen)
+        //FIXME : SIMDE_NATURAL_VECTOR_SIZE == __riscv_v_fixed_vlen
+        #define SIMDE_NATURAL_VECTOR_SIZE (128)


Does this need fixing before merging? If not, then lets make an issue

mr-c · 2024-02-17T12:04:20Z

To compile SIMDe with support for the conversion from NEON to the RISC-V Vector Extension, please use Clang-17 and include the flag -mrvv-vector-bits=<vector_length_of_vector_machine> during compilation.

What's the plan for GCC support?

What about portable binaries, when will we not have to specify -mrvv-vector-bits?

eric900115 · 2024-02-17T13:22:39Z

Can you add some text to the README.md about using SIMDe on RISC-V?

Sure !

eric900115 · 2024-02-17T13:30:14Z

What about portable binaries, when will we not have to specify -mrvv-vector-bits?

Creating portable binaries for RVV (RISC-V Vector Extension) is not feasible, as explained in the discussion at https://news.ycombinator.com/item?id=37706070. To summarize, the vector size in RVV is determined at compile time, making it impossible to create binaries that can be ported seamlessly between RVV machines with different vector lengths.

mr-c · 2024-02-17T13:52:00Z

Would a binary for a smaller vector size work on a CPU with a larger vector size?

Maybe a future RISC-V profile will mandate a larger vector size.

I guess for Debian and others that want to maximize the performance of SIMDe using apps, we will have to compile multiple times based on the vector widths that are commercially available. Which is what we already do to support the various x86-64 SIMD intrinsics (https://wiki.debian.org/SIMDEverywhere and https://packages.debian.org/source/testing/subarch-select)

camel-cdr · 2024-02-17T14:08:21Z

@eric900115 For neon RVV codegen can be 100% portable. If we require the standard V extension (VLEN>=128 and ELEN=64), then we can use LMUL=1 on all implementations, because even for e.g. LMUL=512 a single vector registed does alteast contain 128 bits. We just need to vsetivli to the fixed element count properly.
I'm not sure if -mrvv-vector-bits=128 guarantees to work with VLEN>=128, but it would certainly be possible.
I played arround with using fixed element count load/stores to implement something like this, when the fixed width support didn't exist, but at the time compilers couldn't do the load/store elimination, so it was kindof useless.

#include <riscv_vector.h>
#include <stddef.h>
#include <stdint.h>

typedef struct { uint8_t arr[16]; } V128;

static
V128 vadd8(V128 a, V128 b)
{
    vuint8m1_t A = __riscv_vle8_v_u8m1((void*)&a,16);
    vuint8m1_t B = __riscv_vle8_v_u8m1((void*)&b,16);
    vuint8m1_t C = __riscv_vadd_vv_u8m1(A, B, 16);
    V128 c;
    __riscv_vse8_v_u8m1((void*)&c, C, 16);
   return c;
}


V128 test1(V128 a, V128 b, V128 c)
{
    return vadd8(vadd8(a, b), vadd8(c, c));
}

V128 test2(V128 a, V128 b, V128 c)
{
    vuint8m1_t A = __riscv_vle8_v_u8m1((void*)&a,16);
    vuint8m1_t B = __riscv_vle8_v_u8m1((void*)&b,16);
    vuint8m1_t C = __riscv_vle8_v_u8m1((void*)&c,16);
    V128 r;
    __riscv_vse8_v_u8m1((void*)&r, __riscv_vadd_vv_u8m1(__riscv_vadd_vv_u8m1(A, B, 16), __riscv_vadd_vv_u8m1(C, C, 16), 16), 16);
    return r;
}

mr-c · 2024-02-17T18:00:01Z

I couldn't figure out how to get the glibc versions to align.

Here's my meson setup --cross ... config for using https://packages.debian.org/unstable/clang-18 and running on the official Debian image:

[binaries]
c = 'clang-18'
cpp = 'clang++-18'
ar = 'llvm-ar-18'
strip = 'llvm-strip-18'
objcopy = 'llvm-objcopy-18'
ld = 'riscv64-linux-gnu-ld'

[properties]
c_args   = ['--target=riscv64-linux-gnu', '-isystem=/usr/riscv64-linux-gnu/include', '-Wextra', '-Werror', '-march=rv64imafdcv_zihintpause_zfh_zba_zbb_zbc_zbs_zicsr_zve32f_zve32x_zve64d_zve64f_zve64x_zvl128b_zvl32b_zvl64b', '-O3', '-mrvv-vector-bits=128']
cpp_args = ['--target=riscv64-linux-gnu', '-isystem=/usr/riscv64-linux-gnu/include', '-Wextra', '-Werror', '-march=rv64imafdcv_zihintpause_zfh_zba_zbb_zbc_zbs_zicsr_zve32f_zve32x_zve64d_zve64f_zve64x_zvl128b_zvl32b_zvl64b', '-O3', '-mrvv-vector-bits=128']
c_link_args = ['--target=riscv64-linux-gnu', '-static', '-static-libgcc']
cpp_link_args = ['--target=riscv64-linux-gnu', '-static', '-static-libgcc', '-static-libstdc++']

[host_machine]
system = 'linux'
cpu_family = 'riscv64'
cpu = 'thead-c906'
endian = 'little'

camel-cdr · 2024-02-17T22:56:28Z

Here's my meson setup --cross ... config for using https://packages.debian.org/unstable/clang-18 and running on the official Debian image:

Thanks it worked. I didn't know about the debian image, and was using the k230_sdk thingy.

Running for i in *native*; do ./$i; done in the arm/neon directory results in the following errors:

../test/arm/neon/fma_lane.c:1163: assertion failed: r1[0] ~= simde_vld1q_f32(test_vec[i].r1)[0] (-382857.250000 ~= 503169.843750)
test/arm/neon/fma_lane.cpp:1163: assertion failed: r1[0] ~= simde_vld1q_f32(test_vec[i].r1)[0] (-382857.250000 ~= 503169.843750)
../test/arm/neon/fms_lane.c:873: assertion failed: r1[0] ~= simde_vld1q_f32(test_vec[i].r1)[0] (-506717.906250 ~= 554470.187500)
test/arm/neon/fms_lane.cpp:873: assertion failed: r1[0] ~= simde_vld1q_f32(test_vec[i].r1)[0] (-506717.906250 ~= 554470.187500)
../test/arm/neon/mul_lane.c:865: assertion failed: r[0] ~= simde_vld1q_f32(test_vec[i].r)[0] (132874.984375 ~= 347499.312500)
test/arm/neon/mul_lane.cpp:865: assertion failed: r[0] ~= simde_vld1q_f32(test_vec[i].r)[0] (132874.984375 ~= 347499.312500)
../test/arm/neon/mulx_lane.c:315: assertion failed: r[0] ~= simde_vld1q_f32(test_vec[i].r)[0] (132874.984375 ~= 347499.312500)
test/arm/neon/mulx_lane.cpp:315: assertion failed: r[0] ~= simde_vld1q_f32(test_vec[i].r)[0] (132874.984375 ~= 347499.312500)
../test/arm/neon/qrdmlah.c:193: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (17480 == 3378)
test/arm/neon/qrdmlah.cpp:193: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (17480 == 3378)
../test/arm/neon/qrdmlah_lane.c:475: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (-18972 == -13752)
test/arm/neon/qrdmlah_lane.cpp:475: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (-18972 == -13752)
../test/arm/neon/qrdmlsh.c:197: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (9556 == -32768)
test/arm/neon/qrdmlsh.cpp:197: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (9556 == -32768)
../test/arm/neon/qrdmlsh_lane.c:475: assertion failed: r[2] == simde_vld1_s16(test_vec[i].r)[2] (30372 == -32768)
test/arm/neon/qrdmlsh_lane.cpp:475: assertion failed: r[2] == simde_vld1_s16(test_vec[i].r)[2] (30372 == -32768)
../test/arm/neon/qrdmulh_lane.c:264: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r0)[0] (0 == 14610)
test/arm/neon/qrdmulh_lane.cpp:264: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r0)[0] (0 == 14610)
../test/arm/neon/uqadd.c:339: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (181 == 32767)
test/arm/neon/uqadd.cpp:339: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (181 == 32767)

mr-c · 2024-02-18T07:57:40Z

@camel-cdr Thanks!

Yeah, I'm now also seeing those errors.

I wonder why the qemu setup in this PR isn't reproducing them?

I hope it isn't a hardware error! :-)

mr-c · 2024-02-18T10:07:07Z

Also, hello @eric900115 and @camel-cdr from the Debian Med Sprint in Berlin. Maybe you can join us in person next year? https://wiki.debian.org/Sprints/2023/DebianMed2024

camel-cdr · 2024-02-18T12:01:49Z

Sounds interesting, Berlin is only 2-3 hours away from me. But I'm not really involved with Debian (except for running it).

Btw, do you know how Debian deals with compiler bugs? I just ran into an gcc-13.2.0 codegen bug, that causes a valid program to not work vsetvli a5,a1,e8,m8,ta,ma should be vsetvli a5,a1,e8,m8,tu,ma. It's been fixed on trunk, but is this a thing that would be back-ported?

mr-c · 2024-02-18T12:23:16Z

Btw, do you know how Debian deals with compiler bugs? I just ran into an gcc-13.2.0 codegen bug, that causes a valid program to not work vsetvli a5,a1,e8,m8,ta,ma should be vsetvli a5,a1,e8,m8,tu,ma. It's been fixed on trunk, but is this a thing that would be back-ported?

I would personally respond positively to a reportbug gcc-13 with a link to the upstream fix, but I don't know that team so I can't make promises.

Sounds interesting, Berlin is only 2-3 hours away from me. But I'm not really involved with Debian (except for running it).

Anyone is welcome! We appreciate the user perspective!

eric900115 · 2024-02-19T16:06:58Z

@camel-cdr Thanks!

Yeah, I'm now also seeing those errors.

I wonder why the qemu setup in this PR isn't reproducing them?

I hope it isn't a hardware error! :-)

I am also wondering. I'll try to use qemu with same configuration for testing (testing with thread-c906 CPU).

mr-c · 2024-02-20T17:38:28Z

@camel-cdr Do you also get failures on the k230 with the current master branch of SIMDe?

I'm seeing failures in

arm/neon/{q,}abs/{emul,native}/{c,cpp}
arm/neon/qrdml{a,s}h_lane/{emul,native}/{c,cpp}
arm/neon/st3/{emul,native}/{c,cpp}

So I guess there are some clang and/or CPU errors .. ?

camel-cdr · 2024-02-20T20:56:51Z

@mr-c Yes, I get similar errors when testing master:

../test/arm/neon/abs.c:711: assertion failed: r[0] == simde_vld1q_s32(test_vec[i].r)[0] (0 == -2147483648)
test/arm/neon/abs.cpp:711: assertion failed: r[0] == simde_vld1q_s32(test_vec[i].r)[0] (0 == -2147483648)
timeout qabs-native-c
timeout qabs-native-cpp
../test/arm/neon/qrdmlah_lane.c:475: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (-18972 == -13752)
../test/arm/neon/qrdmlah_lane.c:678: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (26528 == 18592)
../test/arm/neon/qrdmlah_lane.c:901: assertion failed: r[0] == simde_vld1q_s16(test_vec[i].r)[0] (-1308 == 32767)
../test/arm/neon/qrdmlah_lane.c:1128: assertion failed: r[0] == simde_vld1q_s16(test_vec[i].r)[0] (-13600 == 25250)
test/arm/neon/qrdmlah_lane.cpp:475: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (-18972 == -13752)
test/arm/neon/qrdmlah_lane.cpp:678: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (26528 == 18592)
test/arm/neon/qrdmlah_lane.cpp:901: assertion failed: r[0] == simde_vld1q_s16(test_vec[i].r)[0] (-1308 == 32767)
test/arm/neon/qrdmlah_lane.cpp:1128: assertion failed: r[0] == simde_vld1q_s16(test_vec[i].r)[0] (-13600 == 25250)
../test/arm/neon/qrdmlsh_lane.c:475: assertion failed: r[2] == simde_vld1_s16(test_vec[i].r)[2] (30372 == -32768)
../test/arm/neon/qrdmlsh_lane.c:1128: assertion failed: r[3] == simde_vld1q_s16(test_vec[i].r)[3] (26847 == -32768)
test/arm/neon/qrdmlsh_lane.cpp:475: assertion failed: r[2] == simde_vld1_s16(test_vec[i].r)[2] (30372 == -32768)
test/arm/neon/qrdmlsh_lane.cpp:1128: assertion failed: r[3] == simde_vld1q_s16(test_vec[i].r)[3] (26847 == -32768)

I'm somewhat inclined to believe it's a clang miss-compilation, because I had a gcc-13.2 miss-compilation yesterday, I suppose we need to investigate this somehow.

mr-c · 2024-02-22T15:54:00Z

Hey @camel-cdr ; in #1141 I fixed some of the NEON abs functions. Maybe you have time to re-run the tests?

camel-cdr · 2024-02-22T18:56:38Z

@mr-c Here we go, looks like the abs errors are gone, great work.

../test/arm/neon/qrdmlah_lane.c:475: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (-18972 == -13752)
../test/arm/neon/qrdmlah_lane.c:678: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (26528 == 18592)
../test/arm/neon/qrdmlah_lane.c:901: assertion failed: r[0] == simde_vld1q_s16(test_vec[i].r)[0] (-1308 == 32767)
../test/arm/neon/qrdmlah_lane.c:1128: assertion failed: r[0] == simde_vld1q_s16(test_vec[i].r)[0] (-13600 == 25250)
test/arm/neon/qrdmlah_lane.cpp:475: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (-18972 == -13752)
test/arm/neon/qrdmlah_lane.cpp:678: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (26528 == 18592)
test/arm/neon/qrdmlah_lane.cpp:901: assertion failed: r[0] == simde_vld1q_s16(test_vec[i].r)[0] (-1308 == 32767)
test/arm/neon/qrdmlah_lane.cpp:1128: assertion failed: r[0] == simde_vld1q_s16(test_vec[i].r)[0] (-13600 == 25250)
../test/arm/neon/qrdmlsh_lane.c:475: assertion failed: r[2] == simde_vld1_s16(test_vec[i].r)[2] (30372 == -32768)
../test/arm/neon/qrdmlsh_lane.c:1128: assertion failed: r[3] == simde_vld1q_s16(test_vec[i].r)[3] (26847 == -32768)
test/arm/neon/qrdmlsh_lane.cpp:475: assertion failed: r[2] == simde_vld1_s16(test_vec[i].r)[2] (30372 == -32768)
test/arm/neon/qrdmlsh_lane.cpp:1128: assertion failed: r[3] == simde_vld1q_s16(test_vec[i].r)[3] (26847 == -32768)

eric900115 · 2024-03-05T17:47:29Z

I have modified mul_lane and mulx_lane. Hope the error in fms_lane, fma_lane, mul_lane, and mulx_lane will be eliminated.

OMaghiarIMG · 2024-03-08T11:08:36Z

Hello @eric900115, this is really good stuff.
I have a question, you mentioned you converted all Neon intrinsics to RVV, does that exclude bf16 and cryptography instructions which may not be easily replicated with base V? I think trunk LLVM contains experimental intrinsics for Zvfbfwma and Vector crypto.

Is there anything you might need help with?

mr-c · 2024-03-08T17:33:36Z

Thanks @camel-cdr ; can you retest the latest?

camel-cdr · 2024-03-08T23:37:28Z

@mr-c the errors are still there, but the values are different now:

../test/arm/neon/qrdmlah.c:193: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (17480 == 3378)
test/arm/neon/qrdmlah.cpp:193: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (17480 == 3378)
../test/arm/neon/qrdmlah_lane.c:475: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (-18972 == -13752)
test/arm/neon/qrdmlah_lane.cpp:475: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (-18972 == -13752)
../test/arm/neon/qrdmlsh.c:197: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (9556 == -32768)
test/arm/neon/qrdmlsh.cpp:197: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (9556 == -32768)
../test/arm/neon/qrdmlsh_lane.c:475: assertion failed: r[2] == simde_vld1_s16(test_vec[i].r)[2] (30372 == -32768)
test/arm/neon/qrdmlsh_lane.cpp:475: assertion failed: r[2] == simde_vld1_s16(test_vec[i].r)[2] (30372 == -32768)
../test/arm/neon/qrdmulh_lane.c:264: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r0)[0] (0 == 14610)
test/arm/neon/qrdmulh_lane.cpp:264: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r0)[0] (0 == 14610)
../test/arm/neon/uqadd.c:339: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (181 == 32767)
test/arm/neon/uqadd.cpp:339: assertion failed: r[0] == simde_vld1_s16(test_vec[i].r)[0] (181 == 32767)

mr-c · 2024-03-11T07:53:05Z

@camel-cdr are those errors from the emul or native tests?

camel-cdr · 2024-03-11T16:51:37Z

@mr-c it was the native tests, I ran it via: for i in *native*; do ./$i; done > /dev/null

eric900115 · 2024-03-13T06:38:05Z

@OMaghiarIMG

Hi! Yes, we excluded BF16 and cryptography for conversion.

For the conversion from NEON to RVV, if the performance (instruction counts) of using single or multiple RVV intrinsics is better than automatic vectorization, then we use RVV intrinsics for implementation. Otherwise, we use loop automatic vectorization from SIMDe.

mr-c · 2024-03-14T13:19:30Z

Thank you @eric900115 ! Now that SIMDe 0.8.0 is released we can focus the next development cycle on RVV 1.0 implementations.

mr-c and others added 17 commits February 8, 2024 14:46

gh-actions: add clang-17 (simd-everywhere#1127)

cd8f3eb

* gh-actions macos: skip coverage report

feat : add ci test for RISC-V Vector

00436c5

fix : ci.yml

d73932f

feat : modify types.h for risc-v vector extension

6dfcaa1

feat : modify simde utilities for rvv

dd7a360

fix : type.h copyright

22d9cb3

fix : ci files name

b62a934

feat : modify load & store for risc-v v extension

5074218

feat : modify load & store for risc-v vector

d2e417f

fix : ci file

5511f63

fix : reinterpret (due to rvv mem pollution)

dd4583c

feat : add and mul neon to rvv

28e5bf1

fix : copyright

6adbd47

feat : modify ci.yml

74994a5

feat : remove TODO

93bc7ab

Merge branch 'master' of https://github.com/eric900115/simde

b75d5fd

mr-c reviewed Feb 14, 2024

View reviewed changes

eric900115 added 5 commits February 17, 2024 15:10

feat : add rvv CI without zvfh

357cf2d

fix : ld & st sizeof

bbd6b9c

fix : ld2

8a1b4dc

fix : ld1 & st1 rvv ZVFH

8353355

fix : revert reinterpret sizeof

d8c5401

mr-c reviewed Feb 17, 2024

View reviewed changes

mr-c mentioned this pull request Feb 20, 2024

Consider merging with SIMDe? howjmay/neon2rvv#50

Open

eric900115 and others added 3 commits March 5, 2024 11:14

feat : add rvv implementation (mul_lane)

f26120c

Merge branch 'master' into master

36d8995

feat : add mulx_lane neon2rvv

f34f7a6

mr-c merged commit b4e805a into simd-everywhere:master Mar 14, 2024
81 of 85 checks passed

mr-c mentioned this pull request Apr 11, 2024

How to support fixed-size rvv intrinsic type in gcc ? howjmay/neon2rvv#373

Open

chenrui333 mentioned this pull request May 2, 2024

simde 0.8.2 Homebrew/homebrew-core#170651

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial Support for the RISC-V Vector Extension in ARM NEON #1130

Initial Support for the RISC-V Vector Extension in ARM NEON #1130

eric900115 commented Feb 14, 2024 •

edited

Loading

mr-c left a comment

mr-c Feb 14, 2024

eric900115 Feb 14, 2024 •

edited

Loading

eric900115 Feb 17, 2024 •

edited

Loading

eric900115 commented Feb 14, 2024

camel-cdr commented Feb 16, 2024 •

edited

Loading

mr-c commented Feb 16, 2024

camel-cdr commented Feb 16, 2024

camel-cdr commented Feb 16, 2024

mr-c Feb 17, 2024

mr-c commented Feb 17, 2024

eric900115 commented Feb 17, 2024

eric900115 commented Feb 17, 2024

mr-c commented Feb 17, 2024

camel-cdr commented Feb 17, 2024 •

edited

Loading

mr-c commented Feb 17, 2024

camel-cdr commented Feb 17, 2024

mr-c commented Feb 18, 2024

mr-c commented Feb 18, 2024

camel-cdr commented Feb 18, 2024

mr-c commented Feb 18, 2024

eric900115 commented Feb 19, 2024

mr-c commented Feb 20, 2024

camel-cdr commented Feb 20, 2024

mr-c commented Feb 22, 2024

camel-cdr commented Feb 22, 2024

eric900115 commented Mar 5, 2024

OMaghiarIMG commented Mar 8, 2024

mr-c commented Mar 8, 2024

camel-cdr commented Mar 8, 2024

mr-c commented Mar 11, 2024

camel-cdr commented Mar 11, 2024

eric900115 commented Mar 13, 2024 •

edited

Loading

mr-c commented Mar 14, 2024

Initial Support for the RISC-V Vector Extension in ARM NEON #1130

Initial Support for the RISC-V Vector Extension in ARM NEON #1130

Conversation

eric900115 commented Feb 14, 2024 • edited Loading

mr-c left a comment

Choose a reason for hiding this comment

mr-c Feb 14, 2024

Choose a reason for hiding this comment

eric900115 Feb 14, 2024 • edited Loading

Choose a reason for hiding this comment

eric900115 Feb 17, 2024 • edited Loading

Choose a reason for hiding this comment

eric900115 commented Feb 14, 2024

camel-cdr commented Feb 16, 2024 • edited Loading

mr-c commented Feb 16, 2024

camel-cdr commented Feb 16, 2024

camel-cdr commented Feb 16, 2024

mr-c Feb 17, 2024

Choose a reason for hiding this comment

mr-c commented Feb 17, 2024

eric900115 commented Feb 17, 2024

eric900115 commented Feb 17, 2024

mr-c commented Feb 17, 2024

camel-cdr commented Feb 17, 2024 • edited Loading

mr-c commented Feb 17, 2024

camel-cdr commented Feb 17, 2024

mr-c commented Feb 18, 2024

mr-c commented Feb 18, 2024

camel-cdr commented Feb 18, 2024

mr-c commented Feb 18, 2024

eric900115 commented Feb 19, 2024

mr-c commented Feb 20, 2024

camel-cdr commented Feb 20, 2024

mr-c commented Feb 22, 2024

camel-cdr commented Feb 22, 2024

eric900115 commented Mar 5, 2024

OMaghiarIMG commented Mar 8, 2024

mr-c commented Mar 8, 2024

camel-cdr commented Mar 8, 2024

mr-c commented Mar 11, 2024

camel-cdr commented Mar 11, 2024

eric900115 commented Mar 13, 2024 • edited Loading

mr-c commented Mar 14, 2024

eric900115 commented Feb 14, 2024 •

edited

Loading

eric900115 Feb 14, 2024 •

edited

Loading

eric900115 Feb 17, 2024 •

edited

Loading

camel-cdr commented Feb 16, 2024 •

edited

Loading

camel-cdr commented Feb 17, 2024 •

edited

Loading

eric900115 commented Mar 13, 2024 •

edited

Loading