1-16-2016 Banded Results BCR Test



• Name: the name of the matrix (test)
• N: the dimension of the matrix (number of rows and columns)
• NNZ: number of non-zeros
• SPD: whether the matrix is specified by the user to be symmetric positive definite (values: 0 or 1)
• DB: indicate whether DB reordering is performed. Values: 0 or 1.
• K-DB: the half-bandwidth after DB reordering method (without any drop-off). If DB is specified not to be executed, then this reports the original half-bandwidth
• KnoDrp: the half-bandwidth after DB and CM reordering but before drop-off
• K: the half-bandwidth after reordering and drop-off
• FRate: fill-in rate. See NOTES below
• nuKf: non-uniform K factor. Indicates whether the K changes a lot from row to row. Values are between 0 and 1, with 0 indicating a perfectly uniform bandwidth over the entire matrix. See NOTES below
• Solves: indicates whether we managed to solve the problem or not. OK means solved fine, otherwise a reason is provided for failure
• Bstng: indicates whether we enable diagonal boosting when doing factorization. Values: 0 or 1
• SolAcc: infinity norm of the array storing the relative errors
• T-DB: time to run DB reordering for the matrix on the CPU
• T-CM: time to run CM reordering for the matrix on the CPU
• T-Drop: time to drop off off-diagonal elements to decrease bandwidth. Done on the CPU.
• T-Dtransf: data transfer from CPU to GPU
• T-Asmbl: after reordering and drop-off, copy the sparse matrix to banded matrix stored in GPU memory
• LU-M: LU method (complete, ILUT or ILUULT)
• Fill-in: the fill-in factor of ILUT (-1 indicates complete LU)
• NPrtns: the number of partitions used to solve the problem
• T-BC: time required to get off-diagonal right hand sides (Bs and Cs) from the banded matrix - done on the GPU
• T-LU: LU time
• GFlps-LU: LU GFLOPs
• T-SPK: time to solve for the spikes Vs and Ws - done on the GPU
• T-LUrdcd: time required to factorize the reduced matrices - done on the GPU
• T-PreP: the sum of all preprocessing times, see NOTES
• Kry-M: the method used in Krylov solving stage (can be BiCGStab2 (0), BiCGStab (1), or CG(2))
• nItrs: the number of Krylov-solve iterations to solve the problem
• T-Kry: time spent in the Krylov solver (on the GPU)
• Total: total time to solve the problem, sum of PreProc + T-Kry
• Pardiso: the time for the commercial tool "Pardiso" to solve the problem
• SlwD: the slowdown ratio of our solver compared to Pardiso (a value less than one means that we are faster than Pardiso. The value is shown in green if we run faster and shown in red if we run more than 5 times slower.)
• Fastest: the time when SaP runs fastest historically
• SpdUp: the speedup of this run compared to the historical fastest run (the value is shown in green if the speedup is more than 5% and shown in red if the slowdown is more than 5%)


NOTES:
1) nuKf = 1/(2KN)*[sum over i from 1 to N of (2K - K_{iLeft} - K_{iRight})], where K_{iLeft} is the row half-bandwidth to the left of the diagonal while K_{iRight} is the row half-bandwidth to the right of the diagonal.
2) FRate = the actual number of NNZ / ((2K+1)N).
3) All times reported are in miliseconds (1E-3 second)


N

K

d

NPrtns

Solves

T-LU

T-SwDef

T-MMDef

T-PreP

nItrs

T-SwInf

T-MVInf

T-Kry

T-KryPIt

T-Total

200000

10

1

1

OK

20.1988

12.2462

34.3715

76.605

0.75

10.828

30.1177

53.443

53.443

130.048

200000

10

1

1

OK

263.989

268.002

0.75

1205.42

1205.42

1473.42

200000

10

1

4

OK

18.9416

12.171

33.8377

74.812

2.75

27.6958

78.6662

134.177

48.7916

208.989

200000

10

1

4

OK

66.6236

70.792

2.75

1154.11

419.678

1224.91

200000

10

1

8

OK

18.2182

12.0145

33.2673

73.308

2.75

27.2284

78.1892

133.002

48.3644

206.31

200000

10

1

8

OK

33.9668

38.111

2.75

597.516

217.279

635.627

200000

10

1

16

OK

17.4869

11.8486

32.9224

72.052

2.75

26.6095

77.5478

131.122

47.6807

203.174

200000

10

1

16

OK

17.1507

21.3

2.75

311.988

113.45

333.288

200000

10

1

20

OK

16.6191

11.6389

32.0302

70.036

3.25

29.8482

87.6031

147.229

45.3012

217.265

200000

10

1

20

OK

14.0461

18.21

3.25

296.314

91.1735

314.524

200000

10

1

32

OK

16.5893

11.5841

32.0038

69.947

3.25

33.8581

99.6027

166.55

51.2462

236.497

200000

10

1

32

OK

9.04013

13.261

3.25

221.578

68.1778

234.839

200000

10

1

50

OK

15.8986

11.4932

31.4694

68.573

3.25

33.2736

98.9716

164.746

50.6911

233.319

200000

10

1

50

OK

6.10733

10.321

3.25

155.507

47.8483

165.828

200000

20

1

1

OK

28.508

20.7564

41.9221

101.678

0.75

10.7706

32.3049

57.75

57.75

159.428

200000

20

1

1

OK

286.308

292.637

0.75

1267.69

1267.69

1560.33

200000

20

1

4

OK

27.0578

20.1234

41.1555

98.766

2.75

26.7544

82.9656

142.935

51.9764

241.701

200000

20

1

4

OK

72.636

79.077

2.75

1210.9

440.327

1289.98

200000

20

1

8

OK

26.5056

19.9587

40.6379

97.616

2.75

26.3327

83.6272

143.243

52.0884

240.859

200000

20

1

8

OK

36.9654

43.437

2.75

628.963

228.714

672.4

200000

20

1

16

OK

25.7761

19.5988

40.0579

95.962

2.75

25.7027

83.37

141.876

51.5913

237.838

200000

20

1

16

OK

20.1263

26.65

2.75

333.671

121.335

360.321

200000

20

1

20

OK

24.9072

19.2296

39.47

94.025

2.75

24.7686

81.5182

138.719

50.4433

232.744

200000

20

1

20

OK

16.1839

22.713

2.75

271.928

98.8829

294.641

200000

20

1

32

OK

28.2673

19.1691

48.5404

106.455

2.75

25.3333

82.7468

144.968

52.7156

251.423

200000

20

1

32

OK

11.558

18.059

2.75

185.528

67.4647

203.587

200000

20

1

50

OK

24.2621

19.0469

34.2086

87.995

2.75

24.3925

82.6502

139.294

50.6524

227.289

200000

20

1

50

OK

8.39536

14.859

2.75

131.311

47.7495

146.17

200000

50

1

1

OK

58.0792

40.8943

72.4042

189.282

0.75

13.3238

32.8045

63.548

63.548

252.83

200000

50

1

1

OK

1039.52

1052.58

0.75

1456.48

1456.48

2509.06

200000

50

1

4

OK

56.1936

38.9235

71.4524

184.402

2.25

27.2069

71.0794

132.581

58.9249

316.983

200000

50

1

4

OK

258.852

272.221

2.25

1119.74

497.664

1391.97

200000

50

1

8

OK

55.2036

38.0594

70.1875

181.147

2.25

26.1138

70.4325

130.551

58.0227

311.698

200000

50

1

8

OK

132.393

145.917

2.25

587.928

261.301

733.845

200000

50

1

16

OK

54.3586

36.7059

63.6818

172.569

2.25

25.0251

69.7769

128.552

57.1342

301.121

200000

50

1

16

OK

77.7356

91.211

2.25

308.339

137.04

399.55

200000

50

1

20

OK

54.4802

36.794

63.6831

172.722

2.25

29.5405

82.7402

151.198

67.1991

323.92

200000

50

1

20

OK

62.8756

76.345

2.25

299.418

133.075

375.763

200000

50

1

32

OK

53.4561

35.2542

60.6781

167.133

2.25

28.391

81.3842

148.459

65.9818

315.592

200000

50

1

32

OK

46.6379

60.166

2.25

204.309

90.804

264.475

200000

50

1

50

OK

54.3378

34.9302

61.3983

168.412

2.25

24.1813

69.6099

127.152

56.512

295.564

200000

50

1

50

OK

37.1709

50.737

2.25

125.574

55.8107

176.311

200000

100

1

1

OK

109.386

153.285

144.238

439.504

0.75

17.6183

37.9787

79.156

79.156

518.66

200000

100

1

1

OK

928.635

952.791

0.75

1468.49

1468.49

2421.28

200000

100

1

4

OK

107.174

144.631

141.449

425.822

1.75

28.5715

67.1252

135.544

77.4537

561.366

200000

100

1

4

OK

383.273

407.705

1.75

938.095

536.054

1345.8

200000

100

1

8

OK

106.06

139.454

132.501

410.723

1.75

27.0294

66.3107

133.094

76.0537

543.817

200000

100

1

8

OK

248.182

272.572

1.75

490.97

280.554

763.542

200000

100

1

16

OK

104.933

134.398

128.344

400.45

1.75

25.5232

65.1932

130.155

74.3743

530.605

200000

100

1

16

OK

173.215

197.626

1.75

268.826

153.615

466.452

200000

100

1

20

OK

105.809

134.143

129.13

401.6

1.75

25.4176

65.7543

130.566

74.6091

532.166

200000

100

1

20

OK

161.855

186.393

1.75

224.299

128.171

410.692

200000

100

1

32

OK

104.623

94.4487

122.631

354.425

1.75

24.2723

63.4225

126.774

72.4423

481.199

200000

100

1

32

OK

141.433

165.945

1.75

156.97

89.6971

322.915

200000

100

1

50

OK

108.146

124.968

122.417

387.918

2.25

29.4755

78.352

155.16

68.96

543.078

200000

100

1

50

OK

132.604

157.149

2.25

142.582

63.3698

299.731

200000

200

1

1

OK

314.255

416.946

369.368

1194.28

0.75

27.9638

54.4369

119.908

119.908

1314.19

200000

200

1

1

OK

1182.11

1229.39

0.75

1511.93

1511.93

2741.32

200000

200

1

4

OK

310.801

387.57

349.481

1141.11

1.75

44.1206

95.4194

205.399

117.371

1346.51

200000

200

1

4

OK

587.512

634.901

1.75

981.342

560.767

1616.24

200000

200

1

8

OK

309.076

371.744

338.193

1111.72

1.75

41.2912

93.2054

199.787

114.164

1311.51

200000

200

1

8

OK

439.757

487.324

1.75

526.582

300.904

1013.91

200000

200

1

16

OK

315.666

231.739

328.761

969.85

1.75

39.1964

90.9807

195.295

111.597

1165.14

200000

200

1

16

OK

379.313

426.645

1.75

302.197

172.684

728.842

200000

200

1

20

OK

311.865

351.919

323.447

1079.3

1.75

38.4795

91.2655

194.895

111.369

1274.19

200000

200

1

20

OK

359.566

406.894

1.75

257.271

147.012

664.165

200000

200

1

32

OK

316.633

212.078

299.909

922.335

1.75

36.3608

85.8309

187.067

106.895

1109.4

200000

200

1

32

OK

345.562

393.145

1.75

191.456

109.403

584.601

200000

200

1

50

OK

317.273

315.095

289.725

1012.76

1.75

38.4762

84.6081

187.935

107.391

1200.69

200000

200

1

50

OK

323.645

371.021

1.75

153.162

87.5211

524.183

200000

500

1

1

OK

1685.87

1532.85

1858.46

5391.85

0.75

66.1752

111.746

259.717

259.717

5651.57

200000

500

1

1

OK

2632.47

2752.57

0.75

1738.13

1738.13

4490.7

200000

500

1

4

OK

1711.17

1446.05

1778.96

5249.06

1.75

105.383

195.187

446.26

255.006

5695.32

200000

500

1

4

OK

1840.49

1960.6

1.75

1155.81

660.463

3116.41

200000

500

1

8

OK

1693.91

1339.28

1659.59

5003.48

1.75

97.1433

186.39

428.191

244.681

5431.67

200000

500

1

8

OK

1695.75

1816.58

1.75

664.281

379.589

2480.86

200000

500

1

16

OK

1699.05

1216.09

1492.61

4712.87

1.75

89.4976

174.019

407.931

233.103

5120.8

200000

500

1

16

OK

1563.1

1683.4

1.75

423.008

241.719

2106.41

200000

500

1

20

OK

1749.23

1198.82

1489.04

4740.83

1.75

91.4772

172.123

407.984

233.134

5148.82

200000

500

1

20

OK

1567.86

1688.36

1.75

373.913

213.665

2062.27

200000

500

1

32

NConv

1779.26

975.132

1364.6

4438.02

1000.25

41369.2

72104.7

176515

176.471

180953

200000

500

1

32

OK

1521.33

1641.96

1.75

305.766

174.723

1947.73

200000

500

1

50

OK

1813.51

959.48

1169.9

4232.38

1.75

82.4432

146.328

372.793

213.025

4605.17

200000

500

1

50

OK

1495.77

1616.54

1.75

266.168

152.096

1882.71