Questions & Answers

failed in install anaconda parcels and bootstrap saltmaster using AWS template

0 votes
asked Apr 24 by yusong0926 (240 points)
edited Apr 24 by yusong0926

I got an error message when bootstrap saltmaster using aws template. I think the errors were resulted from the following lines of code when it was trying to install Anaconda parcels:

def install_parcel(cloudera_manager, cluster, product, parcel_repo, parcel_version):

    for _ in xrange(120):

        try:

            parcel = cluster.get_parcel(product, parcel_version)

            break

        except Exception:

            logging.info("failed to get_parcel %s", parcel_version)

            time.sleep(5)

    logging.info("Got %s Parcel %s : Current State %s", product, parcel_version, parcel.stage)

error messages:

Stack is: CREATE_IN_PROGRESS

Bootstrapping saltmaster. Expect this to take a few minutes, check the debug log for progress (logs/pnda-cli.1492894564.52.log).

Bootstrapping other instances. Expect this to take a few minutes, check the debug log for progress (logs/pnda-cli.1492894564.52.log).

Running salt to install software. Expect this to take 45 minutes or more, check the debug log for progress (logs/pnda-cli.1492894564.52.log).

10.0.1.43     Failed:    1

Traceback (most recent call last):

  File "pnda-cli.py", line 820, in <module>

    main()

  File "pnda-cli.py", line 810, in main

    console_dns = create(template_data, pnda_cluster, flavor, keyname, no_config_check, branch)

  File "pnda-cli.py", line 433, in create

    cluster, saltmaster['private_ip_address'])

  File "pnda-cli.py", line 214, in ssh

    ret_val = subprocess_to_log.call(parts, LOG, host, scan_for_errors=[r'lost connection', r'\s*Failed:\s*[1-9].*'])

  File "/Users/syu/mariner/pnda/release/3.4/cli/subprocess_to_log.py", line 29, in call

    fetch_child_output()

  File "/Users/syu/mariner/pnda/release/3.4/cli/subprocess_to_log.py", line 26, in fetch_child_output

    raise Exception(msg_with_id)

Exception: 10.0.1.43     Failed:    1

Total execution time: 0:29:43.318795

pnda-cli logs

2017-04-23 18:50:13,025 - INFO - 10.0.1.149     ----------

2017-04-23 18:50:13,025 - INFO - 10.0.1.149               ID: cdh-execute_cloudera_installation_script

2017-04-23 18:50:13,025 - INFO - 10.0.1.149         Function: cmd.run

2017-04-23 18:50:13,025 - INFO - 10.0.1.149             Name: /tmp/pnda-install/cdh.setup_hadoop/venv/bin/python /tmp/pnda-install/cdh.setup_hadoop/cloudera_config.py

2017-04-23 18:50:13,025 - INFO - 10.0.1.149           Result: False

2017-04-23 18:50:13,025 - INFO - 10.0.1.149          Comment: Command "/tmp/pnda-install/cdh.setup_hadoop/venv/bin/python /tmp/pnda-install/cdh.setup_hadoop/cloudera_config.py" run

2017-04-23 18:50:13,025 - INFO - 10.0.1.149          Started: 00:34:33.758792

2017-04-23 18:50:13,026 - INFO - 10.0.1.149         Duration: 939069.718 ms

2017-04-23 18:50:13,026 - INFO - 10.0.1.149          Changes:

2017-04-23 18:50:13,026 - INFO - 10.0.1.149                   ----------

2017-04-23 18:50:13,026 - INFO - 10.0.1.149                   pid:

2017-04-23 18:50:13,026 - INFO - 10.0.1.149                       28228

2017-04-23 18:50:13,026 - INFO - 10.0.1.149                   retcode:

2017-04-23 18:50:13,026 - INFO - 10.0.1.149                       1

2017-04-23 18:50:13,026 - INFO - 10.0.1.149                   stderr:

2017-04-23 18:50:13,026 - INFO - 10.0.1.149                       Traceback (most recent call last):

2017-04-23 18:50:13,026 - INFO - 10.0.1.149                         File "/tmp/pnda-install/cdh.setup_hadoop/cloudera_config.py", line 52, in <module>

2017-04-23 18:50:13,026 - INFO - 10.0.1.149                           anaconda_repo='https://repo.continuum.io/pkgs/misc/parcels/', anaconda_version='4.0.0')

2017-04-23 18:50:13,027 - INFO - 10.0.1.149                         File "/tmp/pnda-install/cdh.setup_hadoop/cm_setup.py", line 714, in setup_hadoop

2017-04-23 18:50:13,027 - INFO - 10.0.1.149                           install_parcel(cloudera_manager, cluster, 'Anaconda', anaconda_repo, anaconda_version)

2017-04-23 18:50:13,027 - INFO - 10.0.1.149                         File "/tmp/pnda-install/cdh.setup_hadoop/cm_setup.py", line 227, in install_parcel

2017-04-23 18:50:13,027 - INFO - 10.0.1.149                           logging.info("Got %s Parcel %s : Current State %s", product, parcel_version, parcel.stage)

2017-04-23 18:50:13,027 - INFO - 10.0.1.149                       UnboundLocalError: local variable 'parcel' referenced before assignment

2017-04-23 18:50:13,027 - INFO - 10.0.1.149                   stdout:

2017-04-23 18:50:13,027 - INFO - 10.0.1.149

2017-04-23 18:50:13,027 - INFO - 10.0.1.149     Summary for pnda-horizon2-cdh-edge

2017-04-23 18:50:13,027 - INFO - 10.0.1.149     ------------

2017-04-23 18:50:13,027 - INFO - 10.0.1.149     Succeeded: 7 (changed=7)

2017-04-23 18:50:13,028 - INFO - 10.0.1.149     Failed:    1

2017-04-23 18:50:13,028 - ERROR - 10.0.1.149     Failed:  

1 Answer

0 votes
answered Apr 24 by James Clarke (1,630 points)

The version of Anaconda at the location we specify by default (https://repo.continuum.io/pkgs/misc/parcels/) has changed from 4.0.0 to 4.1.1. You can fix this by setting anaconda: parcel_repo to https://repo.continuum.io/pkgs/misc/parcels/archive in the salt pillar (platform-salt/pillar/services.sls).

...