forked from CIS565-Fall-2017/Project2-Stream-Compaction
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Output.txt
88 lines (86 loc) · 3.69 KB
/
Output.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
[ 48 27 7 35 22 1 46 38 31 23 36 14 10 ... 10 0 ]
SIZE: 16777216
*****************************
**** BEGIN RADIX TESTS ******
*****************************
==== std::sort, power-of-two(full array) ====
elapsed time: 1252.06ms (std::chrono Measured)
[ 0 0 0 0 0 0 0 0 0 0 0 0 0 ... 49 49 ]
==== std::sort, non-power-of-two ====
elapsed time: 1228.38ms (std::chrono Measured)
[ 0 0 0 0 0 0 0 0 0 0 0 0 0 ... 49 49 ]
==== CPU radix sort, power-of-two ====
elapsed time: 2722.52ms (std::chrono Measured)
[ 0 0 0 0 0 0 0 0 0 0 0 0 0 ... 49 49 ]
passed
==== CPU radix sort, non power-of-two ====
elapsed time: 2313.88ms (std::chrono Measured)
[ 0 0 0 0 0 0 0 0 0 0 0 0 0 ... 49 49 ]
passed
==== GPU radix sort, power-of-two ====
elapsed time: 380.855ms (CUDA Measured)
[ 0 0 0 0 0 0 0 0 0 0 0 0 0 ... 49 49 ]
passed
==== GPU radix sort, non power-of-two ====
elapsed time: 367.632ms (CUDA Measured)
[ 0 0 0 0 0 0 0 0 0 0 0 0 0 ... 49 49 ]
passed
****************
** SCAN TESTS **
****************
==== cpu scan, power-of-two ====
elapsed time: 82.2523ms (std::chrono Measured)
[ 0 48 75 82 117 139 140 186 224 255 278 314 328 ... 410943203 410943213 ]
==== cpu scan, non-power-of-two ====
elapsed time: 83.7772ms (std::chrono Measured)
[ 0 48 75 82 117 139 140 186 224 255 278 314 328 ... 410943129 410943167 ]
passed
==== naive scan, power-of-two ====
elapsed time: 54.4172ms (CUDA Measured)
[ 0 48 75 82 117 139 140 186 224 255 278 314 328 ... 410943203 410943213 ]
passed
==== naive scan, non-power-of-two ====
elapsed time: 54.393ms (CUDA Measured)
[ 0 48 75 82 117 139 140 186 224 255 278 314 328 ... 0 0 ]
passed
==== work-efficient scan, power-of-two ====
elapsed time: 20.877ms (CUDA Measured)
[ 0 48 75 82 117 139 140 186 224 255 278 314 328 ... 410943203 410943213 ]
passed
==== work-efficient scan, non-power-of-two ====
elapsed time: 20.83ms (CUDA Measured)
[ 0 48 75 82 117 139 140 186 224 255 278 314 328 ... 410943129 410943167 ]
passed
==== thrust scan, power-of-two ====
elapsed time: 4.25725ms (CUDA Measured)
[ 0 48 75 82 117 139 140 186 224 255 278 314 328 ... 410943203 410943213 ]
passed
==== thrust scan, non-power-of-two ====
elapsed time: 4.14429ms (CUDA Measured)
[ 0 48 75 82 117 139 140 186 224 255 278 314 328 ... 410943129 410943167 ]
passed
*****************************
** STREAM COMPACTION TESTS **
*****************************
[ 0 3 0 0 0 1 0 0 0 2 3 1 3 ... 0 0 ]
==== cpu compact without scan, power-of-two ====
elapsed time: 123.166ms (std::chrono Measured)
[ 3 1 2 3 1 3 2 3 2 2 2 1 2 ... 2 2 ]
passed
==== cpu compact without scan, non-power-of-two ====
elapsed time: 121.839ms (std::chrono Measured)
[ 3 1 2 3 1 3 2 3 2 2 2 1 2 ... 1 2 ]
passed
==== cpu compact with scan ====
elapsed time: 471.391ms (std::chrono Measured)
[ 3 1 2 3 1 3 2 3 2 2 2 1 2 ... 2 2 ]
passed
==== work-efficient compact, power-of-two ====
elapsed time: 29.8227ms (CUDA Measured)
[ 3 1 2 3 1 3 2 3 2 2 2 1 2 ... 2 2 ]
passed
==== work-efficient compact, non-power-of-two ====
elapsed time: 29.8634ms (CUDA Measured)
[ 3 1 2 3 1 3 2 3 2 2 2 1 2 ... 1 2 ]
passed
Press any key to continue . . .