1-19-2016 Banded Results BCR Test



• Name: the name of the matrix (test)
• N: the dimension of the matrix (number of rows and columns)
• NNZ: number of non-zeros
• SPD: whether the matrix is specified by the user to be symmetric positive definite (values: 0 or 1)
• DB: indicate whether DB reordering is performed. Values: 0 or 1.
• K-DB: the half-bandwidth after DB reordering method (without any drop-off). If DB is specified not to be executed, then this reports the original half-bandwidth
• KnoDrp: the half-bandwidth after DB and CM reordering but before drop-off
• K: the half-bandwidth after reordering and drop-off
• FRate: fill-in rate. See NOTES below
• nuKf: non-uniform K factor. Indicates whether the K changes a lot from row to row. Values are between 0 and 1, with 0 indicating a perfectly uniform bandwidth over the entire matrix. See NOTES below
• Solves: indicates whether we managed to solve the problem or not. OK means solved fine, otherwise a reason is provided for failure
• Bstng: indicates whether we enable diagonal boosting when doing factorization. Values: 0 or 1
• SolAcc: infinity norm of the array storing the relative errors
• T-DB: time to run DB reordering for the matrix on the CPU
• T-CM: time to run CM reordering for the matrix on the CPU
• T-Drop: time to drop off off-diagonal elements to decrease bandwidth. Done on the CPU.
• T-Dtransf: data transfer from CPU to GPU
• T-Asmbl: after reordering and drop-off, copy the sparse matrix to banded matrix stored in GPU memory
• LU-M: LU method (complete, ILUT or ILUULT)
• Fill-in: the fill-in factor of ILUT (-1 indicates complete LU)
• NPrtns: the number of partitions used to solve the problem
• T-BC: time required to get off-diagonal right hand sides (Bs and Cs) from the banded matrix - done on the GPU
• T-LU: LU time
• GFlps-LU: LU GFLOPs
• T-SPK: time to solve for the spikes Vs and Ws - done on the GPU
• T-LUrdcd: time required to factorize the reduced matrices - done on the GPU
• T-PreP: the sum of all preprocessing times, see NOTES
• Kry-M: the method used in Krylov solving stage (can be BiCGStab2 (0), BiCGStab (1), or CG(2))
• nItrs: the number of Krylov-solve iterations to solve the problem
• T-Kry: time spent in the Krylov solver (on the GPU)
• Total: total time to solve the problem, sum of PreProc + T-Kry
• Pardiso: the time for the commercial tool "Pardiso" to solve the problem
• SlwD: the slowdown ratio of our solver compared to Pardiso (a value less than one means that we are faster than Pardiso. The value is shown in green if we run faster and shown in red if we run more than 5 times slower.)
• Fastest: the time when SaP runs fastest historically
• SpdUp: the speedup of this run compared to the historical fastest run (the value is shown in green if the speedup is more than 5% and shown in red if the slowdown is more than 5%)


NOTES:
1) nuKf = 1/(2KN)*[sum over i from 1 to N of (2K - K_{iLeft} - K_{iRight})], where K_{iLeft} is the row half-bandwidth to the left of the diagonal while K_{iRight} is the row half-bandwidth to the right of the diagonal.
2) FRate = the actual number of NNZ / ((2K+1)N).
3) All times reported are in miliseconds (1E-3 second)


N

K

d

NPrtns

Solves

T-LU

T-SwDef

T-MMDef

T-PreP

nItrs

T-SwInf

T-MVInf

T-Kry

T-KryPIt

T-Total

200000

10

1

1

OK

20.1886

12.241

34.4159

76.672

0.75

10.8273

30.1116

53.4

53.4

130.072

200000

10

1

1

OK

266.845

270.906

0.75

1213.98

1213.98

1484.88

200000

10

1

4

OK

18.8657

12.1149

33.8215

74.625

2.75

27.5029

78.356

133.584

48.576

208.209

200000

10

1

4

OK

66.3157

70.429

2.75

1152.76

419.184

1223.18

200000

10

1

8

OK

18.0642

11.8712

33.2569

72.964

2.75

26.8152

76.8682

130.925

47.6091

203.889

200000

10

1

8

OK

33.6288

37.768

2.75

592.596

215.489

630.364

200000

10

1

16

OK

17.4995

11.8465

32.9118

72.049

2.75

26.6085

77.5487

131.382

47.7753

203.431

200000

10

1

16

OK

17.3913

21.571

2.75

318.686

115.886

340.257

200000

10

1

20

OK

16.6357

11.6361

32.2286

70.299

3.25

29.8329

87.5701

147.181

45.2865

217.48

200000

10

1

20

OK

13.9186

18.1

3.25

294.213

90.5271

312.313

200000

10

1

32

OK

16.8318

11.7479

32.3378

70.793

3.25

34.4846

101.443

169.642

52.1975

240.435

200000

10

1

32

OK

9.03398

13.185

3.25

221.773

68.2378

234.958

200000

10

1

50

OK

16.0652

11.6234

31.7932

69.233

3.25

33.7042

100.525

167.261

51.4649

236.494

200000

10

1

50

OK

6.15901

10.303

3.25

156.638

48.1963

166.941

200000

20

1

1

OK

28.6018

20.7603

42.0767

101.953

0.75

10.9099

32.5881

60.678

60.678

162.631

200000

20

1

1

OK

287.008

293.373

0.75

1270.98

1270.98

1564.35

200000

20

1

4

OK

27.0702

20.1192

41.1866

98.868

2.75

26.7782

83.0099

143.101

52.0367

241.969

200000

20

1

4

OK

72.2375

78.695

2.75

1208.56

439.476

1287.25

200000

20

1

8

OK

26.3154

19.7537

40.3166

96.803

2.75

26.0926

82.5741

141.622

51.4989

238.425

200000

20

1

8

OK

37.0909

43.58

2.75

629.04

228.742

672.62

200000

20

1

16

OK

28.1593

19.3872

39.7773

97.756

2.75

25.458

82.3382

140.324

51.0269

238.08

200000

20

1

16

OK

19.9168

26.374

2.75

330.451

120.164

356.825

200000

20

1

20

OK

24.9895

19.425

39.7382

94.63

2.75

25.0192

82.6152

140.112

50.9498

234.742

200000

20

1

20

OK

16.2093

22.699

2.75

272.16

98.9673

294.859

200000

20

1

32

OK

24.896

19.0616

39.357

93.72

2.75

24.9184

81.9217

139.041

50.5604

232.761

200000

20

1

32

OK

11.4873

17.972

2.75

184.053

66.9284

202.025

200000

20

1

50

OK

24.0609

18.8312

34.0332

87.367

2.75

24.1772

81.5755

137.581

50.0295

224.948

200000

20

1

50

OK

8.42723

14.918

2.75

131.273

47.7356

146.191

200000

50

1

1

OK

57.9827

40.857

72.3666

189.012

0.75

13.3076

32.7841

63.539

63.539

252.551

200000

50

1

1

OK

1025.3

1038.4

0.75

1441.72

1441.72

2480.12

200000

50

1

4

OK

56.2927

38.9091

71.5598

184.511

2.25

27.4359

71.286

133.073

59.1436

317.584

200000

50

1

4

OK

258.855

272.266

2.25

1118.81

497.248

1391.07

200000

50

1

8

OK

55.2386

38.0425

70.3942

181.478

2.25

26.1108

70.4532

130.616

58.0516

312.094

200000

50

1

8

OK

130.655

144.016

2.25

579.966

257.763

723.982

200000

50

1

16

OK

54.3079

36.6731

63.5915

172.314

2.25

25.0124

69.7348

128.518

57.1191

300.832

200000

50

1

16

OK

77.7438

91.242

2.25

308.488

137.106

399.73

200000

50

1

20

OK

54.5801

36.8492

63.7848

172.985

2.25

29.5916

82.8662

151.478

67.3236

324.463

200000

50

1

20

OK

62.6096

76.136

2.25

299.519

133.12

375.655

200000

50

1

32

OK

53.9894

35.5739

61.3352

168.687

2.25

28.6706

82.4572

149.841

66.596

318.528

200000

50

1

32

OK

46.3989

59.879

2.25

204.326

90.8116

264.205

200000

50

1

50

OK

55.9698

34.8786

61.2322

169.845

2.25

23.9803

69.3812

126.857

56.3809

296.702

200000

50

1

50

OK

36.5884

50.133

2.25

125.374

55.7218

175.507

200000

100

1

1

OK

109.422

153.394

144.211

439.791

0.75

17.6179

37.9765

79.018

79.018

518.809

200000

100

1

1

OK

937.893

962.021

0.75

1469.06

1469.06

2431.08

200000

100

1

4

OK

107.267

144.902

141.567

426.47

1.75

28.6279

67.1791

135.551

77.4577

562.021

200000

100

1

4

OK

383.178

407.562

1.75

938.443

536.253

1346.01

200000

100

1

8

OK

106.091

139.61

132.367

410.8

1.75

27.029

66.3267

132.934

75.9623

543.734

200000

100

1

8

OK

248.358

272.738

1.75

491.797

281.027

764.535

200000

100

1

16

OK

106.109

135.559

129.849

404.203

1.75

25.7405

65.9404

131.011

74.8634

535.214

200000

100

1

16

OK

173.03

197.437

1.75

268.325

153.329

465.762

200000

100

1

20

OK

106.945

135.283

130.749

405.668

1.75

25.6432

66.482

131.44

75.1086

537.108

200000

100

1

20

OK

161.923

186.387

1.75

224.391

128.223

410.778

200000

100

1

32

OK

105.476

94.4311

122.699

355.743

1.75

24.3171

63.4802

127.153

72.6589

482.896

200000

100

1

32

OK

143.264

167.877

1.75

157.707

90.1183

325.584

200000

100

1

50

OK

106.472

125.112

122.359

386.388

2.25

29.4024

78.215

154.67

68.7422

541.058

200000

100

1

50

OK

131.13

155.614

2.25

141.733

62.9924

297.347

200000

200

1

1

OK

317.813

420.762

374.293

1205.68

0.75

28.165

55.0337

120.888

120.888

1326.57

200000

200

1

1

OK

1183.08

1229.95

0.75

1512.91

1512.91

2742.86

200000

200

1

4

OK

310.836

387.498

349.293

1141.15

1.75

44.1359

95.4092

205.26

117.291

1346.41

200000

200

1

4

OK

588.575

636.037

1.75

981.908

561.09

1617.95

200000

200

1

8

OK

309.088

371.396

337.915

1111.64

1.75

41.2775

93.1506

199.979

114.274

1311.62

200000

200

1

8

OK

438.818

486.246

1.75

525.975

300.557

1012.22

200000

200

1

16

OK

315.63

231.848

328.912

970.528

1.75

39.1817

90.9299

195.061

111.463

1165.59

200000

200

1

16

OK

385.611

433.161

1.75

304.509

174.005

737.67

200000

200

1

20

OK

315.291

355.503

327.128

1090.25

1.75

38.7732

92.1071

196.184

112.105

1286.43

200000

200

1

20

OK

359.798

407.392

1.75

257.314

147.037

664.706

200000

200

1

32

OK

312.705

209.401

295.68

911.16

1.75

36.0423

84.8661

185.454

105.974

1096.61

200000

200

1

32

OK

345.575

393.422

1.75

191.336

109.335

584.758

200000

200

1

50

OK

317.205

314.968

289.518

1012.55

1.75

38.5009

84.5463

188.037

107.45

1200.59

200000

200

1

50

OK

323.775

371.06

1.75

152.877

87.3583

523.937

200000

500

1

1

OK

1686.12

1532.38

1857.97

5391.01

0.75

66.1846

111.709

259.948

259.948

5650.96

200000

500

1

1

OK

2663.19

2783.63

0.75

1756.98

1756.98

4540.61

200000

500

1

4

OK

1690.88

1429.26

1753.21

5186.1

1.75

104.463

193.883

443.017

253.153

5629.12

200000

500

1

4

OK

1843.33

1963.77

1.75

1157.42

661.381

3121.19

200000

500

1

8

OK

1693.54

1340.09

1658.54

5001.78

1.75

97.2812

186.432

427.507

244.29

5429.29

200000

500

1

8

OK

1672.54

1792.78

1.75

657.672

375.813

2450.45

200000

500

1

16

OK

1699.11

1215.51

1493.01

4713.31

1.75

89.427

174.009

407.622

232.927

5120.93

200000

500

1

16

OK

1565.77

1685.97

1.75

425.551

243.172

2111.52

200000

500

1

20

OK

1727.52

1184.5

1468.46

4684.53

1.75

90.5446

170.884

405.367

231.638

5089.9

200000

500

1

20

OK

1567.96

1688.34

1.75

373.513

213.436

2061.86

200000

500

1

32

NConv

1779.06

974.463

1363.97

4436.5

1000.25

41309.5

72008.2

176170

176.126

180606

200000

500

1

32

OK

1519.95

1640.55

1.75

305.189

174.394

1945.74

200000

500

1

50

OK

1811.09

957.584

1169.45

4228.34

1.75

82.3389

146.21

372.599

212.914

4600.94

200000

500

1

50

OK

1492.74

1612.96

1.75

265.687

151.821

1878.65