Replies: 1 comment 5 replies
-
Often on a large parallel machine they have a queuing system where you provide a shell script that is run on every node to launch the job, in which case the shell script can activate the environment you want. Alternatively, you might be able to set up your shell You should ask your sysadmin how to do this on your cluster system. |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello again friends from the MEEP community,
I have been using parallel meep for a couple of months on my local computer. Now, I have access to more computational resources but these resources are clustered in different nodes.
I have set some basic configuration for sharing the same simulation file between my two nodes and I have the same mpi based meep implementation installed in both of my nodes. (I have also configured the ssh connection between both nodes in such a way that I can connect between both machines in a password-less manner)
The issue is that I have installed the parallel meep python implementation through the conda package manager. As you know, in order to use then this implementation it is needed to activate the conda environment containing the meep parallel python implementation in connection with mpich (with access to mpirun).
Is it possible to run a parallel simulation in this manner using my two nodes? How do I guarantee that the proper conda environment will be started at my "remote"node once I call mpirun from my "localhost" computer?
I have tried running some test under this configuration by using:
mpirun -np 32 -hosts worker,localhost python3.11 dip_flat_GaAs_pos_sweep_ext_eff_NA.py
But I get the following error:
[proxy:0:0@comp-power-2] HYDU_sock_connect (utils/sock/sock.c:141): unable to connect from "comp-power-2" to "localhost" (Connection refused)
comp-power-2 is my "remote"(worker) node and localhost is the computer from where I am running the mpirun command.
If you have any reference on how to properly implement such a simulation in this case (using the conda version of parallel meep), I would appreciate your help and feedback.
Beta Was this translation helpful? Give feedback.
All reactions