[Eeglablist] Running AMICA in a super-computer
Jason Palmer
japalmer29 at gmail.com
Tue Jul 25 12:14:37 PDT 2017
Hi Tyler,
Are you running this on the SCCN cluster or your own cluster? Could you tell
me what command you are using to run amica in matlab, and if it is your own
cluster setup, what mpirun or mpiexec command is running, whether it is sge
or torque and what if any qsub script is being run?
Also the first lines of the stdout generated by Amica where it identifies
the processes and their respective nodes .
I wonder if you are actually running multiple (overlapping) copies of the
same run on several nodes, as can happen with the mpirun commands in some
environment setups, instead of having the processes run in concert and
communicating.
Best,
Jason
From: eeglablist [mailto:eeglablist-bounces at sccn.ucsd.edu] On Behalf Of
Tyler Grummett
Sent: Monday, July 24, 2017 9:00 AM
To: EEGLABLIST
Subject: [Eeglablist] Running AMICA in a super-computer
Dear eeglab,
Recently Im facing a problem when running multiple AMICAs in a
super-computer at once, where I get the following error:
...
iter 258 lrate = 1.0000000000 LL = -0.0238247322 nd = 0.0001777274, D
= 0.89129E-01 0.89129E-01 ( 33.34 s, 182.8 h)
iter 259 lrate = 1.0000000000 LL = -0.0238213053 nd = 0.0001768634, D
= 0.89103E-01 0.89103E-01 ( 34.18 s, 187.4 h)
iter 260 lrate = 1.0000000000 LL = -0.0238178712 nd = 0.0001762253, D
= 0.89076E-01 0.89076E-01 ( 33.08 s, 181.4 h)
forrtl: Permission denied
forrtl: severe (28): CLOSE error, unit 19, file "Unknown"
Image PC Routine Line Source
amica15ub 00000000010D4003 Unknown Unknown Unknown
amica15ub 00000000010D1A6D Unknown Unknown Unknown
amica15ub 0000000000441780 Unknown Unknown Unknown
amica15ub 0000000000419E08 Unknown Unknown Unknown
amica15ub 00000000004021DE Unknown Unknown Unknown
amica15ub 000000000118C1A4 Unknown Unknown Unknown
amica15ub 00000000004020C1 Unknown Unknown Unknown
However, this error doesn the error doesnt generate an error in matlab so it
tries to proceed forward and uses the ICA weights from the most recent
iteration. Obviously this isnt ideal, as AMICA requires a lot of iterations
to get good ICs.
Does anyone have any idea how to fix this or find a work around? I can
confirm that there are no jobs using the same temporary file, temporary
output folder, or any of the functions that are run in AMICA. They are all
copied to their own folder and run independently. At least thats what I hope
is happening.
Im running out of things to try.
Regards,
Tyler
*************************
Tyler Grummett ( BBSc, BSc(Hons I))
PhD Candidate
Brain Signals Laboratory
Flinders University
Rm 5A301
Ext 66125
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://sccn.ucsd.edu/pipermail/eeglablist/attachments/20170726/fc83b291/attachment.html>
More information about the eeglablist
mailing list