MemJobs

2017-03-22T09:54:33Z

Edith Knoops:

MemJobs

2017-03-22T09:39:00Z

Edith Knoops:

MemJobs

2017-03-22T09:36:30Z

Edith Knoops: /* ATLAS */

2009-02-23T09:56:42Z

2009-01-30T11:20:33Z

Edith Knoops: /* Comments and Errors follow-up */

30.01.09

== Comments and Errors follow-up ==
*http://gangarobot.cern.ch/st/test_124/
*http://gangarobot.cern.ch/st/test_125/
'''Note that ATLAS Production was ON on the FR-Cloud on January 29'''

* IN2P3-LPC_MCDISK: f(w) - Errors due to the load induced by MC production running at that time. Then ST tests jobs (2 x 50 jobs added)were aborted with Logged Reason by wms 
- Got a job held event, reason: Unspecified gridmanager error 
- Job got an error while in the CondorG queue. 
The submission to the batch system has failed because the '''maximum number of jobs accepted in queue by the site was reached ''' 
- queue atlas max_queuable = 200 in the batch system, Attributes 'GlueCEPolicyMaxTotalJobs' on the queue
Jan 29 23:54:46 clrlcgce03 gridinfo: [25608-30993] Job 1233269583:
lcgpbs:internal_ FAILED during submission to batch system lcgpbs
01/29/2009 23:55:07;0080;PBS_Server;Req;req_reject;Reject reply code=15046(Maximum
number of jobs already in queue), aux=0..

*IN2P3-CPPM_MCDISK: The same problem has in previous test. Jobs running forever with error ."send2dpm: DP000 - disk pool manager not running on marwn04.in2p3.fr ". This arrive for 13 jobs, all starts running nearly at the same time Thu Jan 29 22:37:53 and run in error around Jan 30 00:21. I have put two of this stdout, stderr there
http://marwww.in2p3.fr/~knoops/752629.marce01.in2p3.fr/
http://marwww.in2p3.fr/~knoops/752631.marce01.in2p3.fr/

The load of the local DPM server was around 9 at that time.

Atlas:Analysis ST 2009 Errors

2009-01-30T11:18:22Z

Edith Knoops: /* Comments and Errors follow-up */

30.01.09

== Comments and Errors follow-up ==
*http://gangarobot.cern.ch/st/test_124/
*http://gangarobot.cern.ch/st/test_125/
'''Note that ATLAS Production was ON on the FR-Cloud on January 29'''

* IN2P3-LPC_MCDISK: f(w) - Errors due to the load induced by MC production running at that time. Then ST tests jobs (2 x 50 jobs added)were aborted with Logged Reason by wms 
- Got a job held event, reason: Unspecified gridmanager error 
- Job got an error while in the CondorG queue. 
The submission to the batch system has failed because the '''maximum number of jobs accepted in queue by the site was reached ''' 
- queue atlas max_queuable = 200 in the batch system, Attributes 'GlueCEPolicyMaxTotalJobs' on the queue
Jan 29 23:54:46 clrlcgce03 gridinfo: [25608-30993] Job 1233269583:
lcgpbs:internal_ FAILED during submission to batch system lcgpbs
01/29/2009 23:55:07;0080;PBS_Server;Req;req_reject;Reject reply code=15046(Maximum
number of jobs already in queue), aux=0..

*IN2P3-CPPM_MCDISK: The same problem has in previous test. Job running forever with error ."send2dpm: DP000 - disk pool manager not running on marwn04.in2p3.fr ". This arrive for 13 jobs, all starts running nearly at the same time Thu Jan 29 22:37:53 and run in error around Jan 30 00:21. I have put two of this stdout, stderr there
http://marwww.in2p3.fr/~knoops/752629.marce01.in2p3.fr/
http://marwww.in2p3.fr/~knoops/752631.marce01.in2p3.fr/

The load of the local DPM server was around 9 at that time.

CPU-Benches

2008-09-16T13:19:11Z

Edith Knoops:

Les sites français peuvent avoir accès à la suite Spec2000 (license disponible pour le projet).

Des tests de puissance des CPU ont été réalisés au LAPP (Eric Fede) et à Subatech (Jean-Michel Barbet) avec
la suite SPEC CPU2000 suivant la méthodologie recommandée : http://hepix.caspur.it/processors/
Ajout des tests au CPPM (Edith Knoops)

----

Tests réalisés par Jean-Michel :

Scientific-Linux V4.3 i386, gcc v3.4.5

Les autres conditions de test sont disponibles sur demande. A noter que les tests ont été réalisés sur des machines en configuration opérationnelles pour la grille (avec tous les daemons).

{| class="wikitable" style="text-align:center" border="1" cellpadding="5" cellspacing="0"
|+
|-
! style="background:#efefef;" | Machine !! CPU !! Nb cores !! Mémoire RAM !! CERN KSI2K/core [1] !! FZK KSI2K/core [3] !! CERN KSI2K/core corrigé [2]
|-
| Dell 5160 || Woodcrest 3.00GHz || 4 || 8Go || 1409 || 1830 || 2113
|-
| IBM || Clovertown 2.33GHz || 8 || 16Go || 979 || 1875 || 1468
|-
| Dell Optiplex || Pentium4 3.20GHz || 1 || 2Go || 872 || 1128 || 1308
|-
|}

Dell PowerEdge 1955 Woodcrest 5160 : http://www.spec.org/osg/cpu2000/results/res2006q3/cpu2000-20060626-06298.html

IBM 3550 Woodcrest 5160 : http://www.spec.org/osg/cpu2000/results/res2006q3/cpu2000-20060623-06219.html

IBM 3350 Clovertown E5345 : http://www.spec.org/osg/cpu2000/results/res2006q4/cpu2000-20061113-07918.html

----

Tests réalisés au LAPP (eric) :

Scientific-Linux V3.08 i386, gcc v3.4.3

A noter que les tests ont été réalisés sur des machines dont tous les services non nécéssaires étaient desactivés

{| class="wikitable" style="text-align:center" border="1" cellpadding="5" cellspacing="0"
|+
|-
! style="background:#efefef;" | Machine !! CPU !! Nb cores !! Mémoire RAM !! CERN KSI2K/core [1] !! FZK KSI2K/core [3]!! CERN KSI2K/core corrigé [2]
|-
| HP BL 460c || Woodcrest 2.66GHz || 4 || 8Go || 1367 || 1665 || 2050
|-
|}

Scientific-Linux V4.5 x86_64, gcc v3.4

A noter que les tests ont été réalisés sur des machines dont tous les services non nécéssaires étaient desactivés. Un tuning "simple" au niveau du bios donne des disparités dans les résultats des bench qui depassent les 10 %.

{| class="wikitable" style="text-align:center" border="1" cellpadding="5" cellspacing="0"
|+
|-
! style="background:#efefef;" | Machine !! CPU !! Nb cores !! Mémoire RAM !! CERN KSI2K/core [1] !! FZK KSI2K/core [3]!! CERN KSI2K/core corrigé [2]
|-
| HP BL 460c || Woodcrest 2.66GHz || 4 || 8Go || 1485 || 1592 || 2227
|-
| HP BL 460c || Clovertown 2.33GHz || 8 || 16Go || 1225 || x || 1837
|-
| DELL 1950 || Intel 5335 2GHz || 8 || 8Go || 1060 || 1139 || 1590
|-
|}

[1] : Moyenne de 3 exécutions successives avec les optimisations CERN, chaque exécution démarrant un benchmark CPU2000 par CPU core.

[2] : Correction : La valeur moyenne ci-dessus + 50%. C'est la valeur à publier via le système d'information de la grille.

[3] : Une exécution unique avec les optimisations FZK à titre indicatif.
----

Tests réalisés au CPPM (Edith) :

Scientific-Linux 4.6 x86_64, gcc v3.4.6

A noter que les tests ont été réalisés sur des machines dont tous les services non nécéssaires étaient desactivés

{| class="wikitable" style="text-align:center" border="1" cellpadding="5" cellspacing="0"
|+
|-
! style="background:#efefef;" | Machine !! CPU !! Nb cores !! Mémoire RAM !! CERN KSI2K/core !! FZK KSI2K/core !! CERN32 KSI2K/core || CERN KSI2K/core corrigé
|-
| HP DL145 || Opteron 250 2.4Ghz || 2 || 4Go || 1149 || 1278 || 972 || 1725
|-
| SUN || Opteron 250 2.4Ghz || 2 || 4Go || 1173 || 1298 || 988 || 1760
|-
| HP DL145G2 || Opteron 275 2.2 Ghz || 4 ||6Go || 981 || 1087 || 857 || 1471
|-
| DELL || Opteron 2218 2.6 Ghz || 4 || 8Go || 1199 || 1073 ||1045 || 1798
|-
| DELL || Xeon E5420 2.5Ghz || 8 || 16 Go || 1418 ||1528 || x ||2128

|}