I got an error message when bootstrap saltmaster using aws template. I think the errors were resulted from the following lines of code when it was trying to install Anaconda parcels:
def install_parcel(cloudera_manager, cluster, product, parcel_repo, parcel_version):
for _ in xrange(120):
try:
parcel = cluster.get_parcel(product, parcel_version)
break
except Exception:
logging.info("failed to get_parcel %s", parcel_version)
time.sleep(5)
logging.info("Got %s Parcel %s : Current State %s", product, parcel_version, parcel.stage)
error messages:
Stack is: CREATE_IN_PROGRESS
Bootstrapping saltmaster. Expect this to take a few minutes, check the debug log for progress (logs/pnda-cli.1492894564.52.log).
Bootstrapping other instances. Expect this to take a few minutes, check the debug log for progress (logs/pnda-cli.1492894564.52.log).
Running salt to install software. Expect this to take 45 minutes or more, check the debug log for progress (logs/pnda-cli.1492894564.52.log).
10.0.1.43 Failed: 1
Traceback (most recent call last):
File "pnda-cli.py", line 820, in <module>
main()
File "pnda-cli.py", line 810, in main
console_dns = create(template_data, pnda_cluster, flavor, keyname, no_config_check, branch)
File "pnda-cli.py", line 433, in create
cluster, saltmaster['private_ip_address'])
File "pnda-cli.py", line 214, in ssh
ret_val = subprocess_to_log.call(parts, LOG, host, scan_for_errors=[r'lost connection', r'\s*Failed:\s*[1-9].*'])
File "/Users/syu/mariner/pnda/release/3.4/cli/subprocess_to_log.py", line 29, in call
fetch_child_output()
File "/Users/syu/mariner/pnda/release/3.4/cli/subprocess_to_log.py", line 26, in fetch_child_output
raise Exception(msg_with_id)
Exception: 10.0.1.43 Failed: 1
Total execution time: 0:29:43.318795
pnda-cli logs
2017-04-23 18:50:13,025 - INFO - 10.0.1.149 ----------
2017-04-23 18:50:13,025 - INFO - 10.0.1.149 ID: cdh-execute_cloudera_installation_script
2017-04-23 18:50:13,025 - INFO - 10.0.1.149 Function: cmd.run
2017-04-23 18:50:13,025 - INFO - 10.0.1.149 Name: /tmp/pnda-install/cdh.setup_hadoop/venv/bin/python /tmp/pnda-install/cdh.setup_hadoop/cloudera_config.py
2017-04-23 18:50:13,025 - INFO - 10.0.1.149 Result: False
2017-04-23 18:50:13,025 - INFO - 10.0.1.149 Comment: Command "/tmp/pnda-install/cdh.setup_hadoop/venv/bin/python /tmp/pnda-install/cdh.setup_hadoop/cloudera_config.py" run
2017-04-23 18:50:13,025 - INFO - 10.0.1.149 Started: 00:34:33.758792
2017-04-23 18:50:13,026 - INFO - 10.0.1.149 Duration: 939069.718 ms
2017-04-23 18:50:13,026 - INFO - 10.0.1.149 Changes:
2017-04-23 18:50:13,026 - INFO - 10.0.1.149 ----------
2017-04-23 18:50:13,026 - INFO - 10.0.1.149 pid:
2017-04-23 18:50:13,026 - INFO - 10.0.1.149 28228
2017-04-23 18:50:13,026 - INFO - 10.0.1.149 retcode:
2017-04-23 18:50:13,026 - INFO - 10.0.1.149 1
2017-04-23 18:50:13,026 - INFO - 10.0.1.149 stderr:
2017-04-23 18:50:13,026 - INFO - 10.0.1.149 Traceback (most recent call last):
2017-04-23 18:50:13,026 - INFO - 10.0.1.149 File "/tmp/pnda-install/cdh.setup_hadoop/cloudera_config.py", line 52, in <module>
2017-04-23 18:50:13,026 - INFO - 10.0.1.149 anaconda_repo='https://repo.continuum.io/pkgs/misc/parcels/', anaconda_version='4.0.0')
2017-04-23 18:50:13,027 - INFO - 10.0.1.149 File "/tmp/pnda-install/cdh.setup_hadoop/cm_setup.py", line 714, in setup_hadoop
2017-04-23 18:50:13,027 - INFO - 10.0.1.149 install_parcel(cloudera_manager, cluster, 'Anaconda', anaconda_repo, anaconda_version)
2017-04-23 18:50:13,027 - INFO - 10.0.1.149 File "/tmp/pnda-install/cdh.setup_hadoop/cm_setup.py", line 227, in install_parcel
2017-04-23 18:50:13,027 - INFO - 10.0.1.149 logging.info("Got %s Parcel %s : Current State %s", product, parcel_version, parcel.stage)
2017-04-23 18:50:13,027 - INFO - 10.0.1.149 UnboundLocalError: local variable 'parcel' referenced before assignment
2017-04-23 18:50:13,027 - INFO - 10.0.1.149 stdout:
2017-04-23 18:50:13,027 - INFO - 10.0.1.149
2017-04-23 18:50:13,027 - INFO - 10.0.1.149 Summary for pnda-horizon2-cdh-edge
2017-04-23 18:50:13,027 - INFO - 10.0.1.149 ------------
2017-04-23 18:50:13,027 - INFO - 10.0.1.149 Succeeded: 7 (changed=7)
2017-04-23 18:50:13,028 - INFO - 10.0.1.149 Failed: 1
2017-04-23 18:50:13,028 - ERROR - 10.0.1.149 Failed: