Skip to main content

Table 5 CS-4 running times (milliseconds) of MPI implementation

From: Solving the maximum subsequence sum and related problems using BSP/CGM model and multi-GPU CUDA

n

N16:P1

N16:P2

N16:P4

N16:P8

N32:P1

N32:P2

N32:P4

N32:P8

220

10.730

11.403

9.788

91.368

11.597

19.765

21.263

119.741

221

21.403

15.286

12.880

83.876

16.562

21.004

18.488

79.335

222

99.957

23.938

18.761

16.064

24.395

25.461

21.225

23.279

223

82.786

40.277

29.675

21.406

45.941

33.450

26.846

28.399

224

175.851

75.340

51.073

33.305

82.706

52.493

38.760

33.242

225

357.882

139.970

90.844

53.800

146.140

93.536

59.839

46.379

226

748.831

263.430

161.765

100.714

263.629

164.268

93.531

66.719

227

1546.020

559.685

307.145

171.782

555.407

288.917

170.073

111.906

228

1331.303

2356.562

314.681

854.059

426.578

317.116

203.811

229

24,301.072

4174.692

17,766.883