1-17-2016 Banded Results BCR Test



• Name: the name of the matrix (test)
• N: the dimension of the matrix (number of rows and columns)
• NNZ: number of non-zeros
• SPD: whether the matrix is specified by the user to be symmetric positive definite (values: 0 or 1)
• DB: indicate whether DB reordering is performed. Values: 0 or 1.
• K-DB: the half-bandwidth after DB reordering method (without any drop-off). If DB is specified not to be executed, then this reports the original half-bandwidth
• KnoDrp: the half-bandwidth after DB and CM reordering but before drop-off
• K: the half-bandwidth after reordering and drop-off
• FRate: fill-in rate. See NOTES below
• nuKf: non-uniform K factor. Indicates whether the K changes a lot from row to row. Values are between 0 and 1, with 0 indicating a perfectly uniform bandwidth over the entire matrix. See NOTES below
• Solves: indicates whether we managed to solve the problem or not. OK means solved fine, otherwise a reason is provided for failure
• Bstng: indicates whether we enable diagonal boosting when doing factorization. Values: 0 or 1
• SolAcc: infinity norm of the array storing the relative errors
• T-DB: time to run DB reordering for the matrix on the CPU
• T-CM: time to run CM reordering for the matrix on the CPU
• T-Drop: time to drop off off-diagonal elements to decrease bandwidth. Done on the CPU.
• T-Dtransf: data transfer from CPU to GPU
• T-Asmbl: after reordering and drop-off, copy the sparse matrix to banded matrix stored in GPU memory
• LU-M: LU method (complete, ILUT or ILUULT)
• Fill-in: the fill-in factor of ILUT (-1 indicates complete LU)
• NPrtns: the number of partitions used to solve the problem
• T-BC: time required to get off-diagonal right hand sides (Bs and Cs) from the banded matrix - done on the GPU
• T-LU: LU time
• GFlps-LU: LU GFLOPs
• T-SPK: time to solve for the spikes Vs and Ws - done on the GPU
• T-LUrdcd: time required to factorize the reduced matrices - done on the GPU
• T-PreP: the sum of all preprocessing times, see NOTES
• Kry-M: the method used in Krylov solving stage (can be BiCGStab2 (0), BiCGStab (1), or CG(2))
• nItrs: the number of Krylov-solve iterations to solve the problem
• T-Kry: time spent in the Krylov solver (on the GPU)
• Total: total time to solve the problem, sum of PreProc + T-Kry
• Pardiso: the time for the commercial tool "Pardiso" to solve the problem
• SlwD: the slowdown ratio of our solver compared to Pardiso (a value less than one means that we are faster than Pardiso. The value is shown in green if we run faster and shown in red if we run more than 5 times slower.)
• Fastest: the time when SaP runs fastest historically
• SpdUp: the speedup of this run compared to the historical fastest run (the value is shown in green if the speedup is more than 5% and shown in red if the slowdown is more than 5%)


NOTES:
1) nuKf = 1/(2KN)*[sum over i from 1 to N of (2K - K_{iLeft} - K_{iRight})], where K_{iLeft} is the row half-bandwidth to the left of the diagonal while K_{iRight} is the row half-bandwidth to the right of the diagonal.
2) FRate = the actual number of NNZ / ((2K+1)N).
3) All times reported are in miliseconds (1E-3 second)


N

K

d

NPrtns

Solves

T-LU

T-SwDef

T-MMDef

T-PreP

nItrs

T-SwInf

T-MVInf

T-Kry

T-KryPIt

T-Total

200000

10

1

1

OK

20.1776

12.2331

34.2778

76.519

0.75

10.8142

30.1041

53.527

53.527

130.046

200000

10

1

1

OK

263.954

268.019

0.75

1207.76

1207.76

1475.78

200000

10

1

4

OK

18.7701

11.9884

33.3859

73.939

2.75

27.1903

77.206

131.912

47.968

205.851

200000

10

1

4

OK

66.5346

70.797

2.75

1153.14

419.324

1223.94

200000

10

1

8

OK

18.0989

11.8455

33.0313

72.733

2.75

26.7507

76.7581

130.777

47.5553

203.51

200000

10

1

8

OK

33.6417

37.776

2.75

592.413

215.423

630.189

200000

10

1

16

OK

17.3545

11.7221

32.5925

71.363

2.75

26.3156

76.4494

129.604

47.1287

200.967

200000

10

1

16

OK

17.1686

21.309

2.75

312.163

113.514

333.472

200000

10

1

20

OK

17.9363

11.6456

32.3343

71.734

3.25

29.9155

87.8072

147.762

45.4652

219.496

200000

10

1

20

OK

13.9137

18.088

3.25

294.134

90.5028

312.222

200000

10

1

32

OK

16.6304

11.5866

32.0842

70.056

3.25

33.8743

99.6309

166.565

51.2508

236.621

200000

10

1

32

OK

9.03418

13.192

3.25

221.602

68.1852

234.794

200000

10

1

50

OK

15.9555

11.5021

31.5156

68.702

3.25

33.3183

99.0887

165.061

50.788

233.763

200000

10

1

50

OK

6.09616

10.261

3.25

155.612

47.8806

165.873

200000

20

1

1

OK

28.5589

20.7635

41.8198

101.645

0.75

10.7778

32.3187

57.732

57.732

159.377

200000

20

1

1

OK

285.392

291.724

0.75

1269.21

1269.21

1560.94

200000

20

1

4

OK

27.1405

20.0945

41.2422

98.899

2.75

26.7403

82.9522

143.059

52.0215

241.958

200000

20

1

4

OK

72.2766

78.755

2.75

1208.28

439.374

1287.03

200000

20

1

8

OK

26.5242

19.9859

40.6567

97.613

2.75

26.3711

83.7064

143.187

52.068

240.8

200000

20

1

8

OK

36.8024

43.232

2.75

624.008

226.912

667.24

200000

20

1

16

OK

30.8517

19.7066

53.396

114.541

2.75

26.1296

84.2331

146.611

53.3131

261.152

200000

20

1

16

OK

19.938

26.438

2.75

331.069

120.389

357.507

200000

20

1

20

OK

25.0187

19.4276

39.5193

94.413

2.75

25.0196

82.5858

140.058

50.9302

234.471

200000

20

1

20

OK

16.1734

22.656

2.75

271.544

98.7433

294.2

200000

20

1

32

OK

28.3975

19.1157

53.0335

110.991

2.75

25.3271

82.7323

144.36

52.4945

255.351

200000

20

1

32

OK

11.4661

17.963

2.75

184.051

66.9276

202.014

200000

20

1

50

OK

24.0285

18.8317

33.9805

87.287

2.75

24.1632

81.5533

137.548

50.0175

224.835

200000

20

1

50

OK

8.39261

14.862

2.75

131.399

47.7815

146.261

200000

50

1

1

OK

58.0309

40.8333

72.2963

189.013

0.75

13.3109

32.774

63.572

63.572

252.585

200000

50

1

1

OK

1025.44

1038.49

0.75

1442.67

1442.67

2481.16

200000

50

1

4

OK

56.7111

39.2087

72.1443

185.816

2.25

27.4534

71.9446

133.868

59.4969

319.684

200000

50

1

4

OK

262.625

276.043

2.25

1131.02

502.676

1407.07

200000

50

1

8

OK

55.807

38.382

70.9378

182.941

2.25

26.3346

71.3542

131.849

58.5996

314.79

200000

50

1

8

OK

130.457

143.904

2.25

579.296

257.465

723.2

200000

50

1

16

OK

54.31

36.6615

63.63

172.336

2.25

25.0014

69.7521

128.493

57.108

300.829

200000

50

1

16

OK

77.556

91.055

2.25

308.566

137.14

399.621

200000

50

1

20

OK

55.1252

37.136

64.398

174.509

2.25

29.848

83.8569

152.886

67.9493

327.395

200000

50

1

20

OK

63.7569

77.266

2.25

301.749

134.111

379.015

200000

50

1

32

OK

53.3911

35.2045

60.8202

167.117

2.25

28.3512

81.3307

148.433

65.9702

315.55

200000

50

1

32

OK

46.7509

60.29

2.25

204.184

90.7484

264.474

200000

50

1

50

OK

54.2404

34.9716

61.4407

168.401

2.25

24.0418

69.447

126.962

56.4276

295.363

200000

50

1

50

OK

36.6163

50.102

2.25

125.538

55.7947

175.64

200000

100

1

1

OK

109.522

153.443

144.334

440.031

0.75

17.5976

37.9598

79.031

79.031

519.062

200000

100

1

1

OK

927.946

952.059

0.75

1467.59

1467.59

2419.65

200000

100

1

4

OK

107.419

144.778

141.901

426.66

1.75

28.6168

67.2058

135.72

77.5543

562.38

200000

100

1

4

OK

388.215

412.689

1.75

949.015

542.294

1361.7

200000

100

1

8

OK

106.037

139.491

132.491

410.79

1.75

27.0273

66.326

133.06

76.0343

543.85

200000

100

1

8

OK

248.225

272.675

1.75

491.963

281.122

764.638

200000

100

1

16

OK

106.17

135.52

129.781

404.154

1.75

25.738

65.9356

131.146

74.9406

535.3

200000

100

1

16

OK

175.834

200.396

1.75

270.104

154.345

470.5

200000

100

1

20

OK

105.866

133.929

129.252

401.61

1.75

25.4151

65.7295

130.53

74.5886

532.14

200000

100

1

20

OK

161.845

186.291

1.75

224.224

128.128

410.515

200000

100

1

32

OK

104.571

94.4815

122.67

354.542

1.75

24.2519

63.4156

126.799

72.4566

481.341

200000

100

1

32

OK

141.352

165.819

1.75

157.016

89.7234

322.835

200000

100

1

50

OK

106.64

125.299

122.341

386.786

2.25

29.3997

78.2467

154.65

68.7333

541.436

200000

100

1

50

OK

132.602

157.166

2.25

142.253

63.2236

299.419

200000

200

1

1

OK

316.416

416.579

369.588

1195.87

0.75

27.9105

54.3955

120.302

120.302

1316.17

200000

200

1

1

OK

1189.06

1236.42

0.75

1511.51

1511.51

2747.93

200000

200

1

4

OK

310.789

388.09

349.359

1140.93

1.75

44.1253

95.493

205.398

117.37

1346.32

200000

200

1

4

OK

596.328

644.058

1.75

988.48

564.846

1632.54

200000

200

1

8

OK

309.33

371.548

338.447

1112.44

1.75

41.4088

93.3123

200.678

114.673

1313.11

200000

200

1

8

OK

444.299

492.24

1.75

530.237

302.993

1022.48

200000

200

1

16

OK

315.95

232.071

329.196

971.045

1.75

39.1866

91.047

195.434

111.677

1166.48

200000

200

1

16

OK

379.238

426.832

1.75

302.135

172.649

728.967

200000

200

1

20

OK

311.438

351.667

322.797

1078.15

1.75

38.2812

91.1568

194.612

111.207

1272.76

200000

200

1

20

OK

363.692

411.21

1.75

259.382

148.218

670.592

200000

200

1

32

OK

312.798

209.116

295.373

910.875

1.75

36.0304

84.8755

185.978

106.273

1096.85

200000

200

1

32

OK

345.538

393.077

1.75

191.362

109.35

584.439

200000

200

1

50

OK

321.419

318.497

293.871

1024.76

1.75

38.8344

85.5972

189.333

108.19

1214.09

200000

200

1

50

OK

324.044

371.903

1.75

153.03

87.4457

524.933

200000

500

1

1

OK

1688.47

1532.36

1858.95

5394.93

0.75

66.1727

111.705

259.951

259.951

5654.88

200000

500

1

1

OK

2632.74

2752.61

0.75

1736.32

1736.32

4488.93

200000

500

1

4

OK

1693.06

1431.29

1756.85

5194.32

1.75

104.654

194.251

443.348

253.342

5637.67

200000

500

1

4

OK

1840.45

1960.81

1.75

1156.48

660.847

3117.29

200000

500

1

8

OK

1693.84

1340.26

1659.41

5003.73

1.75

97.0821

186.393

427.996

244.569

5431.73

200000

500

1

8

OK

1682.68

1803.11

1.75

657.856

375.918

2460.96

200000

500

1

16

OK

1698.77

1216.63

1492.09

4712.89

1.75

89.458

173.965

407.338

232.765

5120.23

200000

500

1

16

OK

1583.4

1704.06

1.75

426.946

243.969

2131.01

200000

500

1

20

OK

1726.41

1184.71

1468.31

4683.48

1.75

90.5597

170.954

405.638

231.793

5089.12

200000

500

1

20

OK

1587.73

1708.26

1.75

376.076

214.901

2084.33

200000

500

1

32

NConv

1779.99

975.352

1364.94

4438.82

1000.25

41327.9

72073.3

176271

176.227

180710

200000

500

1

32

OK

1520.89

1641.52

1.75

305.708

174.69

1947.23

200000

500

1

50

OK

1814.07

958.638

1169.92

4232.49

1.75

82.5445

146.353

372.89

213.08

4605.38

200000

500

1

50

OK

1491.85

1612.17

1.75

266.121

152.069

1878.29