1-20-2016 Banded Results BCR Test



• Name: the name of the matrix (test)
• N: the dimension of the matrix (number of rows and columns)
• NNZ: number of non-zeros
• SPD: whether the matrix is specified by the user to be symmetric positive definite (values: 0 or 1)
• DB: indicate whether DB reordering is performed. Values: 0 or 1.
• K-DB: the half-bandwidth after DB reordering method (without any drop-off). If DB is specified not to be executed, then this reports the original half-bandwidth
• KnoDrp: the half-bandwidth after DB and CM reordering but before drop-off
• K: the half-bandwidth after reordering and drop-off
• FRate: fill-in rate. See NOTES below
• nuKf: non-uniform K factor. Indicates whether the K changes a lot from row to row. Values are between 0 and 1, with 0 indicating a perfectly uniform bandwidth over the entire matrix. See NOTES below
• Solves: indicates whether we managed to solve the problem or not. OK means solved fine, otherwise a reason is provided for failure
• Bstng: indicates whether we enable diagonal boosting when doing factorization. Values: 0 or 1
• SolAcc: infinity norm of the array storing the relative errors
• T-DB: time to run DB reordering for the matrix on the CPU
• T-CM: time to run CM reordering for the matrix on the CPU
• T-Drop: time to drop off off-diagonal elements to decrease bandwidth. Done on the CPU.
• T-Dtransf: data transfer from CPU to GPU
• T-Asmbl: after reordering and drop-off, copy the sparse matrix to banded matrix stored in GPU memory
• LU-M: LU method (complete, ILUT or ILUULT)
• Fill-in: the fill-in factor of ILUT (-1 indicates complete LU)
• NPrtns: the number of partitions used to solve the problem
• T-BC: time required to get off-diagonal right hand sides (Bs and Cs) from the banded matrix - done on the GPU
• T-LU: LU time
• GFlps-LU: LU GFLOPs
• T-SPK: time to solve for the spikes Vs and Ws - done on the GPU
• T-LUrdcd: time required to factorize the reduced matrices - done on the GPU
• T-PreP: the sum of all preprocessing times, see NOTES
• Kry-M: the method used in Krylov solving stage (can be BiCGStab2 (0), BiCGStab (1), or CG(2))
• nItrs: the number of Krylov-solve iterations to solve the problem
• T-Kry: time spent in the Krylov solver (on the GPU)
• Total: total time to solve the problem, sum of PreProc + T-Kry
• Pardiso: the time for the commercial tool "Pardiso" to solve the problem
• SlwD: the slowdown ratio of our solver compared to Pardiso (a value less than one means that we are faster than Pardiso. The value is shown in green if we run faster and shown in red if we run more than 5 times slower.)
• Fastest: the time when SaP runs fastest historically
• SpdUp: the speedup of this run compared to the historical fastest run (the value is shown in green if the speedup is more than 5% and shown in red if the slowdown is more than 5%)


NOTES:
1) nuKf = 1/(2KN)*[sum over i from 1 to N of (2K - K_{iLeft} - K_{iRight})], where K_{iLeft} is the row half-bandwidth to the left of the diagonal while K_{iRight} is the row half-bandwidth to the right of the diagonal.
2) FRate = the actual number of NNZ / ((2K+1)N).
3) All times reported are in miliseconds (1E-3 second)


N

K

d

NPrtns

Solves

T-LU

T-SwDef

T-MMDef

T-PreP

nItrs

T-SwInf

T-MVInf

T-Kry

T-KryPIt

T-Total

200000

10

1

1

OK

23.5409

12.3199

46.8256

97.097

0.75

10.9091

30.297

54.916

54.916

152.013

200000

10

1

1

OK

264.042

268.343

0.75

1205.36

1205.36

1473.7

200000

10

1

4

OK

18.7479

12.0145

33.6595

74.338

2.75

27.2533

77.3196

132.168

48.0611

206.506

200000

10

1

4

OK

66.5895

70.811

2.75

1154.64

419.87

1225.45

200000

10

1

8

OK

18.0972

11.8874

33.1109

72.913

2.75

26.8103

76.8312

130.971

47.6258

203.884

200000

10

1

8

OK

33.6487

37.819

2.75

593.676

215.882

631.495

200000

10

1

16

OK

17.4481

11.8513

32.8796

72.072

2.75

26.5944

77.5227

131.206

47.7113

203.278

200000

10

1

16

OK

17.2009

21.337

2.75

312.218

113.534

333.555

200000

10

1

20

OK

16.6575

11.6405

32.1309

70.289

3.25

29.8345

87.5913

147.408

45.3563

217.697

200000

10

1

20

OK

13.9548

18.127

3.25

294.553

90.6317

312.68

200000

10

1

32

OK

16.6361

11.5874

32.0618

70.164

3.25

33.8656

99.5987

166.584

51.2566

236.748

200000

10

1

32

OK

9.04678

13.223

3.25

222.019

68.3135

235.242

200000

10

1

50

OK

15.9171

11.5021

31.5167

68.756

3.25

33.3113

99.0649

165.074

50.792

233.83

200000

10

1

50

OK

6.10282

10.287

3.25

155.651

47.8926

165.938

200000

20

1

1

OK

28.5954

20.7991

41.9407

101.995

0.75

10.7879

32.3563

57.789

57.789

159.784

200000

20

1

1

OK

288.685

295.315

0.75

1282.78

1282.78

1578.09

200000

20

1

4

OK

27.2633

20.3312

41.3687

99.645

2.75

27.0207

84.0684

144.77

52.6436

244.415

200000

20

1

4

OK

72.3433

78.923

2.75

1212.23

440.811

1291.15

200000

20

1

8

OK

26.4665

19.9675

40.6673

97.715

2.75

26.3529

83.6605

143.253

52.092

240.968

200000

20

1

8

OK

36.6552

43.176

2.75

623.252

226.637

666.428

200000

20

1

16

OK

25.5553

19.3905

39.8349

95.343

2.75

25.465

82.3157

140.473

51.0811

235.816

200000

20

1

16

OK

19.8944

26.418

2.75

331.027

120.373

357.445

200000

20

1

20

OK

24.9844

19.4188

39.5519

94.511

2.75

25.0304

82.6227

140.291

51.0149

234.802

200000

20

1

20

OK

16.1953

22.757

2.75

271.909

98.876

294.666

200000

20

1

32

OK

24.8411

19.0522

39.462

93.91

2.75

24.9259

81.9365

139.236

50.6313

233.146

200000

20

1

32

OK

11.5806

18.09

2.75

185.827

67.5735

203.917

200000

20

1

50

OK

27.9744

18.8914

42.9806

100.502

2.75

24.4429

82.0364

142.281

51.7385

242.783

200000

20

1

50

OK

8.40461

14.914

2.75

131.06

47.6582

145.974

200000

50

1

1

OK

58.0237

40.9064

72.3931

189.358

0.75

13.3182

32.7946

63.652

63.652

253.01

200000

50

1

1

OK

1026.15

1040.23

0.75

1444.06

1444.06

2484.3

200000

50

1

4

OK

56.1404

38.8279

71.3165

184.296

2.25

27.161

71.0329

132.63

58.9467

316.926

200000

50

1

4

OK

258.766

272.444

2.25

1120.56

498.024

1393

200000

50

1

8

OK

55.1715

37.8706

70.2052

181.266

2.25

26.0986

70.4586

130.726

58.1004

311.992

200000

50

1

8

OK

130.681

144.316

2.25

579.606

257.603

723.922

200000

50

1

16

OK

54.2884

36.6911

63.5545

172.512

2.25

25.0163

69.7601

128.682

57.192

301.194

200000

50

1

16

OK

77.4865

91.109

2.25

309.25

137.444

400.359

200000

50

1

20

OK

54.5436

36.8165

63.6516

172.966

2.25

29.5838

82.8365

151.702

67.4231

324.668

200000

50

1

20

OK

63.6394

77.238

2.25

302.107

134.27

379.345

200000

50

1

32

OK

53.3134

35.1868

60.6972

167.018

2.25

28.3638

81.3298

148.4

65.9556

315.418

200000

50

1

32

OK

46.9161

60.472

2.25

204.388

90.8391

264.86

200000

50

1

50

OK

54.1571

34.9201

61.3243

168.266

2.25

23.988

69.3876

127.026

56.456

295.292

200000

50

1

50

OK

36.908

50.441

2.25

125.479

55.7684

175.92

200000

100

1

1

OK

110.558

154.845

145.895

444.517

0.75

17.762

38.3844

79.664

79.664

524.181

200000

100

1

1

OK

928.551

953.571

0.75

1469.89

1469.89

2423.46

200000

100

1

4

OK

107.2

144.703

141.491

426.469

1.75

28.5833

67.112

135.724

77.5566

562.193

200000

100

1

4

OK

383.443

408.205

1.75

938.884

536.505

1347.09

200000

100

1

8

OK

106.057

139.669

132.399

411.26

1.75

27.0421

66.3456

133.17

76.0971

544.43

200000

100

1

8

OK

248.764

273.459

1.75

493.355

281.917

766.814

200000

100

1

16

OK

104.919

134.24

128.273

400.55

1.75

25.5376

65.2004

130.123

74.356

530.673

200000

100

1

16

OK

173.004

197.713

1.75

268.804

153.602

466.517

200000

100

1

20

OK

105.783

134.139

129.142

401.96

1.75

25.421

65.7383

130.611

74.6349

532.571

200000

100

1

20

OK

161.879

186.534

1.75

224.411

128.235

410.945

200000

100

1

32

OK

104.83

94.465

122.826

355.241

1.75

24.3843

63.6116

127.425

72.8143

482.666

200000

100

1

32

OK

141.404

166.014

1.75

157.103

89.7731

323.117

200000

100

1

50

OK

106.383

125.237

122.344

386.88

2.25

29.3842

78.1682

154.717

68.7631

541.597

200000

100

1

50

OK

132.73

157.349

2.25

142.081

63.1471

299.43

200000

200

1

1

OK

313.893

416.677

369.662

1194.9

0.75

27.9336

54.3775

120.425

120.425

1315.32

200000

200

1

1

OK

1194.39

1242.82

0.75

1530.92

1530.92

2773.74

200000

200

1

4

OK

310.964

388.024

349.771

1143.15

1.75

44.1324

95.4381

206.749

118.142

1349.9

200000

200

1

4

OK

588.364

636.466

1.75

982.774

561.585

1619.24

200000

200

1

8

OK

313.436

376.04

342.586

1126.09

1.75

41.6869

94.373

201.585

115.191

1327.68

200000

200

1

8

OK

438.971

486.664

1.75

526.323

300.756

1012.99

200000

200

1

16

OK

315.653

231.643

329.174

971.522

1.75

39.1766

90.9358

195.242

111.567

1166.76

200000

200

1

16

OK

379.35

427.45

1.75

303.363

173.35

730.813

200000

200

1

20

OK

311.5

352.037

322.919

1080.03

1.75

38.2949

91.1745

194.662

111.235

1274.69

200000

200

1

20

OK

359.371

407.564

1.75

257.423

147.099

664.987

200000

200

1

32

OK

312.933

209.672

295.869

913.192

1.75

36.0314

84.9204

186.135

106.363

1099.33

200000

200

1

32

OK

346.018

393.865

1.75

191.191

109.252

585.056

200000

200

1

50

OK

317.348

315.724

290.053

1014.74

1.75

38.5028

84.6367

188.253

107.573

1202.99

200000

200

1

50

OK

327.736

375.755

1.75

153.906

87.9463

529.661

200000

500

1

1

OK

1686.43

1532.06

1859.55

5398.76

0.75

66.1874

111.743

259.91

259.91

5658.67

200000

500

1

1

OK

2663.95

2787.04

0.75

1760.24

1760.24

4547.28

200000

500

1

4

OK

1691.84

1427.84

1755.42

5193.44

1.75

104.598

193.96

443.658

253.519

5637.09

200000

500

1

4

OK

1840.94

1963.18

1.75

1157.29

661.309

3120.47

200000

500

1

8

OK

1693.94

1339.79

1658.18

5006.97

1.75

97.2726

186.452

428.681

244.961

5435.65

200000

500

1

8

OK

1706.47

1829.01

1.75

665.322

380.184

2494.34

200000

500

1

16

OK

1700.34

1215.19

1492.33

4717.92

1.75

89.5659

174.2

408.229

233.274

5126.15

200000

500

1

16

OK

1568.47

1690.76

1.75

424.424

242.528

2115.18

200000

500

1

20

OK

1725.46

1183

1466.56

4683.23

1.75

90.5371

170.707

405.515

231.723

5088.75

200000

500

1

20

OK

1567.68

1689.91

1.75

373.043

213.167

2062.95

200000

500

1

32

NConv

1778.92

974.686

1363.68

4440.04

1000.25

41316

72032.2

176376

176.332

180816

200000

500

1

32

OK

1520.26

1642.08

1.75

305.743

174.71

1947.82

200000

500

1

50

OK

1811.32

956.899

1167.86

4230.96

1.75

82.4043

146.134

373.084

213.191

4604.04

200000

500

1

50

OK

1492.46

1614.86

1.75

266.156

152.089

1881.02