• Name: the name of the matrix (test)
• N: the dimension of the matrix (number of rows and columns)
• NNZ: number of non-zeros
• K: the half-bandwidth after reordering and drop-off
• Solves: indicates whether we managed to solve the problem or not. OK means solved fine, otherwise a reason is provided for failure
• SolAcc: infinity norm of the array storing the relative errors
• NPrtns: the number of partitions used to solve the problem
• T-LU: LU time
• T-SwDef: Deflation sweep time (BCR only, Eq. 5a, 5b)
• T-MMDef: Deflation matrix multiplication time (BCR only, Eq. 5c-5e)
• T-PreP: the sum of all preprocessing times, see NOTES
• Kry-M: the method used in Krylov solving stage (can be BiCGStab2 (0), BiCGStab (1), or CG(2))
• nItrs: the number of Krylov-solve iterations to solve the problem
• T-SwInf: Inflation sweep time (BCR only, Eq. 5f)
• T-MVInf: Inflation matrix vector multiplication time (BCR only, Eq. 5g, 5h)
• T-Kry: time spent in the Krylov solver (on the GPU)
• T-KryPIt: time per Krylov iteration, see NOTES
• Total: total time to solve the problem, sum of T-PreP + T-Kry
NOTES:
1) All times reported are in miliseconds (1E-3 second)
2) T-KryPIt = T-Kry / max(1, nItrs)
Name |
N |
NNZ |
K |
Solves |
SolAcc |
NPrtns |
UseBCR |
T-LU |
T-SwDef |
T-MMDef |
T-PreP |
Kry-M |
nItrs |
T-SwInf |
T-MVInf |
T-Kry |
T-KryPIt |
T-Total |
NetANCF_40by40 |
63603 |
569262 |
607 |
OK |
3.66382e-13 |
1 |
1 |
1022.82 |
1051.26 |
1091.99 |
3335.39 |
P-B2(SI) |
0.25 |
32.1473 |
33.43 |
68.569 |
68.569 |
3403.96 |
NetANCF_40by40 |
63603 |
569262 |
607 |
OK |
3.48565e-11 |
4 |
1 |
155.701 |
180.545 |
163.684 |
637.172 |
P-B2(SI) |
68.75 |
1398.07 |
1396.56 |
2982.8 |
43.3862 |
3619.97 |
NetANCF_40by40 |
63603 |
569262 |
607 |
OK |
1.44723e-10 |
8 |
1 |
102.793 |
94.0903 |
109.224 |
444.646 |
P-B2(SI) |
79.25 |
1358.21 |
1690.34 |
3266.5 |
41.2177 |
3711.15 |
NetANCF_40by40 |
63603 |
569262 |
607 |
OK |
8.97172e-11 |
16 |
1 |
50.716 |
88.3005 |
83.6695 |
379.554 |
P-B2(SI) |
103.75 |
1497.61 |
2574.81 |
4355.16 |
41.9775 |
4734.72 |
NetANCF_40by40 |
63603 |
569262 |
607 |
OK |
9.56015e-11 |
20 |
1 |
47.9605 |
48.3964 |
78.7607 |
340.028 |
P-B2(SI) |
133.25 |
1842.69 |
3609.59 |
5815.88 |
43.6464 |
6155.91 |
NetANCF_40by40 |
63603 |
569262 |
607 |
OK |
2.55569e-13 |
1 |
0 |
693.714 |
|
|
777.527 |
P-B2(SI) |
0.25 |
|
|
432.099 |
432.099 |
1209.63 |
NetANCF_40by40 |
63603 |
569262 |
607 |
OK |
1.32918e-10 |
4 |
0 |
260.632 |
|
|
377.804 |
P-B2(SI) |
66.25 |
|
|
6436.48 |
97.1544 |
6814.29 |
NetANCF_40by40 |
63603 |
569262 |
607 |
OK |
1.06018e-10 |
8 |
0 |
135.699 |
|
|
261.716 |
P-B2(SI) |
84.5 |
|
|
4158.25 |
49.21 |
4419.96 |
NetANCF_40by40 |
63603 |
569262 |
607 |
OK |
1.10074e-10 |
16 |
0 |
80.2771 |
|
|
232.238 |
P-B2(SI) |
95.75 |
|
|
2453.34 |
25.6223 |
2685.58 |
NetANCF_40by40 |
63603 |
569262 |
607 |
OK |
4.34413e-11 |
20 |
0 |
61.4491 |
|
|
226.507 |
P-B2(SI) |
139.25 |
|
|
2891.96 |
20.7681 |
3118.46 |
ANCF31770 |
31770 |
183540 |
248 |
NConv |
4.22808e+07 |
1 |
1 |
95.4856 |
134.568 |
113.656 |
398.23 |
P-B2(SI) |
150.75 |
2248.18 |
1708.78 |
4361.96 |
28.9351 |
4760.19 |
ANCF31770 |
31770 |
183540 |
248 |
OK |
1.65299e-10 |
4 |
1 |
49.7792 |
72.4741 |
51.0232 |
233.944 |
P-B2(SI) |
20.25 |
215.183 |
169.344 |
440.503 |
21.7532 |
674.447 |
ANCF31770 |
31770 |
183540 |
248 |
OK |
6.88379e-10 |
8 |
1 |
21.9066 |
28.5957 |
28.4337 |
137.889 |
P-B2(SI) |
24.5 |
164.73 |
199.82 |
429.971 |
17.5498 |
567.86 |
ANCF31770 |
31770 |
183540 |
248 |
OK |
2.7774e-09 |
16 |
1 |
24.4993 |
28.7084 |
31.7548 |
147.38 |
P-B2(SI) |
37.25 |
223.805 |
350.34 |
672.202 |
18.0457 |
819.582 |
ANCF31770 |
31770 |
183540 |
248 |
OK |
6.70513e-10 |
20 |
1 |
25.2705 |
31.1571 |
24.5468 |
143.266 |
P-B2(SI) |
55.25 |
352.51 |
433.917 |
929.256 |
16.8191 |
1072.52 |
ANCF31770 |
31770 |
183540 |
248 |
OK |
8.95056e-12 |
1 |
0 |
219.048 |
|
|
258.696 |
P-B2(SI) |
0.25 |
|
|
149.594 |
149.594 |
408.29 |
ANCF31770 |
31770 |
183540 |
248 |
OK |
7.76564e-11 |
4 |
0 |
81.9056 |
|
|
135.88 |
P-B2(SI) |
20.25 |
|
|
997.232 |
49.246 |
1133.11 |
ANCF31770 |
31770 |
183540 |
248 |
OK |
1.79715e-10 |
8 |
0 |
57.1762 |
|
|
113.46 |
P-B2(SI) |
24.75 |
|
|
620.847 |
25.0847 |
734.307 |
ANCF31770 |
31770 |
183540 |
248 |
OK |
3.06668e-09 |
16 |
0 |
32.1565 |
|
|
91.428 |
P-B2(SI) |
36.75 |
|
|
507.344 |
13.8053 |
598.772 |
ANCF31770 |
31770 |
183540 |
248 |
OK |
8.55813e-10 |
20 |
0 |
25.7966 |
|
|
85.105 |
P-B2(SI) |
55.25 |
|
|
638.128 |
11.5498 |
723.233 |
ANCF88950 |
88950 |
513900 |
410 |
NConv |
1.57368e+08 |
1 |
1 |
629.179 |
815.612 |
707.106 |
2356.15 |
P-B2(SI) |
40.75 |
1437.9 |
1730.83 |
3298.78 |
80.9516 |
5654.93 |
ANCF88950 |
88950 |
513900 |
410 |
OK |
6.00758e-10 |
4 |
1 |
193.154 |
216.788 |
220.53 |
821.692 |
P-B2(SI) |
23.75 |
566.853 |
619.764 |
1263.81 |
53.2128 |
2085.5 |
ANCF88950 |
88950 |
513900 |
410 |
OK |
6.2538e-09 |
8 |
1 |
85.9367 |
99.7917 |
106.59 |
465.853 |
P-B2(SI) |
42.5 |
846.581 |
1023.58 |
2007.44 |
47.234 |
2473.3 |
ANCF88950 |
88950 |
513900 |
410 |
OK |
6.75068e-10 |
16 |
1 |
66.1119 |
71.2695 |
94.2902 |
414.12 |
P-B2(SI) |
52.75 |
922.988 |
1406.96 |
2491.08 |
47.2243 |
2905.2 |
ANCF88950 |
88950 |
513900 |
410 |
OK |
1.69943e-09 |
20 |
1 |
63.574 |
72.8407 |
96.9791 |
418.972 |
P-B2(SI) |
33.75 |
576.764 |
982.373 |
1664.41 |
49.316 |
2083.39 |
ANCF88950 |
88950 |
513900 |
410 |
OK |
1.10227e-11 |
1 |
0 |
855.102 |
|
|
969.213 |
P-B2(SI) |
0.25 |
|
|
501.764 |
501.764 |
1470.98 |
ANCF88950 |
88950 |
513900 |
410 |
OK |
5.55827e-10 |
4 |
0 |
378.167 |
|
|
539.452 |
P-B2(SI) |
23.25 |
|
|
3151.26 |
135.538 |
3690.71 |
ANCF88950 |
88950 |
513900 |
410 |
OK |
5.0742e-09 |
8 |
0 |
170.641 |
|
|
327.401 |
P-B2(SI) |
57.75 |
|
|
3901.38 |
67.5564 |
4228.78 |
ANCF88950 |
88950 |
513900 |
410 |
OK |
1.46678e-09 |
16 |
0 |
106.784 |
|
|
277.711 |
P-B2(SI) |
51.25 |
|
|
1786.64 |
34.8612 |
2064.35 |
ANCF88950 |
88950 |
513900 |
410 |
OK |
4.34103e-10 |
20 |
0 |
93.5688 |
|
|
273.097 |
P-B2(SI) |
36.75 |
|
|
1063.08 |
28.9274 |
1336.18 |