Troubleshooting
This section describes some common problems developers have run into and some suggested solutions.
Solver errors when configuring conda environment
When setting up polaris conda environment, spack environment, compilers and system modules, by calling:
./configure_polaris_env.sh ...
you may run into an error like:
Encountered problem while solving:
- nothing provides geos 3.5.* needed by cartopy-0.14.3-np110py27_4
...
subprocess.CalledProcessError: ...
Details of the error may vary but the message indicates in some way that there
was a problem solving for the requested combination of packages. This likely
indicates that you have an existing polaris development environment
(dev_polaris_*
) that can’t be updated to be compatible with the new
set of development packages given in:
deploy_tmp/build*/spec-file*.txt
The solution should be to recreate the environment rather than trying to update it:
./configure_polaris_env.sh --recreate ...
The --recreate
flag will first delete the existing dev_polaris_*
conda
environment before creating it again with the new set of packages required for
developing with the requested compiler and MPI type.
Proxy on LANL Macs
If you are on a LANL Mac using the VPN and you see errors like:
An HTTP error occurred when trying to retrieve this URL.
this likely indicates that you need to set some proxy-related variables.
From a LANL internal site:
Bash, and other shells do not honor the system wide proxy settings on the Mac. There are a couple of ways for a person to manage these settings.
First, would be to the following will work in one’s
~/.profile
- that way every time a shell is spawned with your username, these settings will be enabled:proxy_enable() { export ALL_PROXY=proxyout.lanl.gov:8080 export all_proxy=proxyout.lanl.gov:8080 export NO_PROXY=localhost,127.0.0.1,*.lanl.gov,lanl.gov export no_proxy=localhost,127.0.0.1,*.lanl.gov,lanl.gov } proxy_disable() { unset ALL_PROXY unset all_proxy unset NO_PROXY unset no_proxy } proxy_enable
If you follow this approach, you would then call proxy_disable
anytime you
want to turn off the proxy (e.g. if the VPN is not running).
gethostbyname failed
If you see errors like the following (particularly on LANL Macs):
Fatal error in MPI_Init: Other MPI error, error stack:
MPIR_Init_thread(159)..............:
MPID_Init(164).....................: channel initialization failed
MPIDI_CH3_Init(95).................:
MPID_nem_init(314).................:
MPID_nem_tcp_init(173).............:
MPID_nem_tcp_get_business_card(395):
GetSockInterfaceAddr(369)..........: gethostbyname failed, pn2034311.lanl.gov (errno 0)
this likely indicates that MPI is having a problem finding the local host.
The solution is to set the following config option in the parallel
section
of your user config file:
# The parallel section describes options related to running tasks in parallel
[parallel]
# whether to use mpirun or srun to run the model
parallel_executable = mpirun -host localhost
The example config file has been updated to include this flag.