Changeset 3266 in Sophya for trunk/SophyaLib
- Timestamp:
- Jun 12, 2007, 7:22:07 PM (18 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
trunk/SophyaLib/Manual/perfmachine.txt
r3259 r3266 75 75 ---------------------------------------------------------------------------------------------- 76 76 (a)xeon-lx-2.4GHz | 1290 | 53 53 55 | 338 340 320 77 ( a)xeon-lx-2.8GHzicc | 2040 | 85 80 83 | 914 409 91477 (b)xeon-lx-2.8GHzicc | 2040 | 85 80 83 | 914 409 914 78 78 (c)amd-lx | 1560 | 65 77 68 | 666 314 686 79 79 (cc)amd2-lx | | … … 87 87 (j)xeon-osx 88 88 89 (p)ibm-aix-regatta | 3100 | 130 55 133 | 730 115 1750 90 (p)ibm-aix-meso | 3600 | 150 75 150 | 1480 203 3600 89 (p)ibm-aix-regatta | 3100 | 130 55 133 | 730 115 1750 (32 bits) 90 (p)ibm-aix-meso | 5700 | 240 70 320 | 1500 220 3400 (32 bits) 91 92 (s)sgi-magique | 336 | 14 7 15 | 340 40 460 (32 bits) 91 93 ---------------------------------------------------------------------------------------------- 92 94 … … 99 101 -> MDBLOPS (puissance de calcul sur float) 100 102 (3) int, cpupowerI 0 3000000 5 / cpupowerI 2 101 -> MINTOPS (puissance de calcul sur int )103 -> MINTOPS (puissance de calcul sur int=4 bytes) 102 104 (4) long (ou long long (*)) cpupowerL 0 3000000 5 / cpupowerL 2 103 -> MLONOPS (puissance de calcul sur long )105 -> MLONOPS (puissance de calcul sur long=8 bytes) 104 106 ---------------------------------------------------------------------------------------------- 105 107 MFLOPS | (1)MFLOPS (2)MDBLOPS (3)MINTOPS (4)MLONOPS … … 119 121 120 122 (p)ibm-aix-regatta | 121 (p)ibm-aix-meso | 250/628 160/1260 240/1120 123 (p)ibm-aix-meso | 250/1150 250/1500 250/1200 50/200 (32 bits) 124 | 280/1500 250/1600 250/1100 210/1000 (64 bits -q64) 125 126 (s)sgi-magique | 122 127 ---------------------------------------------------------------------------------------------- 123 128 124 129 X.3/ Comparaison avec SPEC 125 csh> cpupower 2130 csh> cpupower 0 / cpupower 2 126 131 ---------------------------------------------------------------------- 127 132 MFLOPS(1) SPECfp SPECint 128 133 ---------------------------------------------------------------------- 129 (b)xeon-lx-2.8GHz 9001400 1400130 (c)amd-lx 6901600 1300134 (b)xeon-lx-2.8GHz 166/900 1400 1400 135 (c)amd-lx 125/690 1600 1300 131 136 (cc)amd2-lx 675 1600 1300 132 137 133 (e)osf-xp1000 648 500 400 134 (f)superosf 842 1100 700 135 136 (f)G4-osx-1.25GHz 92/380 (-g)/(-O2 -g) 137 (i)core-osx-1.83GHz 310/880 1400 1500 (-g)/(-O2 -g) 138 (j)xeon-osx 2600 2900 - 139 140 (p)ibm-aix-regatta 730/1750 1050 700 (-O -g)/(-O3) 141 (p)ibm-aix-meso 1250/3600 (-O -g)/(-O3) 138 (e)osf-xp1000 32/650 500 400 139 (f)superosf 842 1100 700 140 141 (i)core-osx-1.83GHz 110/850 1400 1500 142 (j)xeon-osx 2600 2900 - 143 144 (p)ibm-aix-regatta 130/700 1050 700 142 145 ---------------------------------------------------------------------- 143 146 … … 187 190 B/ Calcul brut (Tableaux de SOPHYA) avec / sans threads 188 191 -------------------------------------------------------- 189 190 B.1/ Version corrige de zthr.cc (apres 23/05/07) 192 B.1.a/ Calcul sur vecteur 10 * V2 ~= DLO4 (V1) 193 ~ 10 x 10 x 9. 10^6 operations double sur 2 x 9 10^6 double 194 900 M.Ops r_8 / ~ 1500 MO 195 196 (1) time cpupower 0 # compile avec -O (/ -O -g) 197 (2) time zthr arrdl 1 3000 1 thread 198 (3) time zthr arrdl 2 3000 2 thread 199 (4) time zthr arrdl 4 3000 4 thread 200 (5) time zthr arrdl 6 3000 6 thread 201 (6) time zthr arrdl 8 3000 8 thread 202 203 ----------------------------------------------------------------------------------- 204 (1)MFLOPS (2)CPU/Elap/% (3)CPU/Elap/% (4)CPU/Elap/% 205 ----------------------------------------------------------------------------------- 206 (a)xeon-lx-2.4GHz 53 207 (b)xeon-lx-2.8GHz 65 2.6/2.6/100% 5.3/2.9/180% 14.3/4.86/310% 208 (5) 23/7.4/314% 209 (c)amd-lx 95 210 211 212 (e')osf-cool 32 5.7/5.8/98% 11.1/11.3/98% 22.3/22.5/98% 213 (f)superosf 214 215 (g)G5-osx-2GHz 88 2.5/2.6/99% 5.9/3.38/184% 11/6.45/173% [-O2 -g] 216 (h)G4-osx-1.25GHz 25 6.6/7/95% 13.4/13.8/97% [-O2 -g] 217 (i)core-osx-1.83GHz 107 2.1/2.1/98% 4.3/2.9/150% 8.3/30/31% [-O2 -g] 218 (j)xeon-osx 219 220 (p)ibm-aix-regatta 130 221 (q)ibm-aix-meso 150 0.8/1/81% 2./2./100% 3.6/3/122% [-O3] 222 (5) 5.4/4/140% (6) 7.7/4/192% 223 224 (s)sgi-magique 7 78/78/99% 167/95/175% 339/96/352% [-O -g: NON-OPT] 225 14 16.4/16.5/99% 33.8/22.4/150% 79/32/250% [-O -g2 OPT] 226 ----------------------------------------------------------------------------------- 227 228 B.1.b/ Calcul sur vecteur V2 = Sin(V1) + Cos(V1) 229 ~ 50 x 9. 10^6 operations double sur 2 x 9 10^6 double, mem ~ 150 MO 230 ~500 M.Ops r_8 / ~ 600 MO I/O 231 232 (1) time cpupower 0 # compile avec -O (/ -O -g) 233 (2) time zthr arrmf 1 3000 1 thread 234 (3) time zthr arrmf 2 3000 2 thread 235 (4) time zthr arrmf 4 3000 4 thread 236 (5) time zthr arrmf 6 3000 6 thread 237 (6) time zthr arrmf 8 3000 8 thread 238 239 ----------------------------------------------------------------------------------- 240 (1)MFLOPS (2)CPU/Elap/% (3)CPU/Elap/% (4)CPU/Elap/% 241 ----------------------------------------------------------------------------------- 242 (a)xeon-lx-2.4GHz 53 243 (b)xeon-lx-2.8GHz 65 1.7/1.7/100% 3.5/2.1/173% 9.8/3.6/275% 244 (c)amd-lx 95 245 246 (e')osf-cool 32 4.2/4.3/98% 8.2/8.4/98% 16.1/16.2/98% 247 (f)superosf 248 249 (g)G5-osx-2GHz 88 2.3/2.3/100% 5/3/165% 9.6/5.8/167% [-O2 -g] 250 (h)G4-osx-1.25GHz 25 4.5/4.8/95% 10.9/14.6/72% [-O2 -g] 251 (i)core-osx-1.83GHz 107 2.3/2.3/98% 4.8/3.1/158% [-O2 -g] 252 (j)xeon-osx 253 254 (p)ibm-aix-regatta 130 255 (q)ibm-aix-meso 150 1.7/2/87% 2.6/3/86% 5.4/5.4/100% [-O3] 256 (5) 9/6/150% (6) 11.5/15/76% 257 258 (s)sgi-magique 7 11.5/11.7/99% 24/17/140% 51.5/18.4/280% [-O -g NON-OPT] 259 14 6.5/6.6/99% 13.3/12/110% 34.5/17.3/200% [-O -g3 OPT] 260 ----------------------------------------------------------------------------------- 261 262 263 B.1.c/ Version corrige de zthr.cc (apres 23/05/07) 191 264 arr = (c1*a1) + (c2*a2) 192 265 ~ 3 x 4. 10^6 operations int_4 sur 3 x 4 10^6 int_4 … … 213 286 214 287 215 (d)osf-asc216 288 217 289 (e')osf-cool 32 0.43/1.2/35% 0.6/1.33/44% 1.1/1.3/82% [-O -g] … … 234 306 (5) 2.4/1.2/193% (6) 4.25/1.6/265% 235 307 236 237 (s)sgi-magique 460 238 ----------------------------------------------------------------------------------- 239 240 B.1.bis/ ancienne version de zthr (avant 23/05/07) 308 ----------------------------------------------------------------------------------- 309 310 B.1.x/ ancienne version de zthr (avant 23/05/07) 241 311 On faisait 2 multiplications par ctye suivi d'un produit matriciel ! 242 312 arr = c1*a1*c2*a2 ( ~ 3 10^6 op. double) … … 367 437 (6) 29.9/6.74/442% 368 438 369 (s)sgi-magique 4 60 49/49/99% 101/56/181%439 (s)sgi-magique 400 44/44.3/99% 96.5/55/176% 370 440 371 441 ----------------------------------------------------------------------------------- … … 432 502 433 503 (q)ibm-aix-meso 0.43 0.27 0.52 1.12/0.75 0.35 504 505 (s)sgi-magique 2.45 1.9 5.65 7.45/6.3 2.8 (-O -g3) 434 506 ----------------------------------------------------------------------------------- 435 507 … … 505 577 (q)ibm-aix-meso 0.55/~100% 506 578 579 (s)sgi-magique 5.3/~90% 507 580 -------------------------------------------------------------------------------------------- 508 581
Note:
See TracChangeset
for help on using the changeset viewer.