DLWorkspace

Deploy DL workspace cluster on Ubuntu.

This document describes the procedure to deploy DL workspace cluster on a single Ubuntu node. The target deployment node can be either your local machine, or a remote node.

  1. [Run Once] Setup development environment.

  2. Please make sure that the deployment node satisfy the following:
    • We assume that it is Ubuntu OS, preferably 16.04
    • It has a “core” account that you can ssh into
    • The “core” account can sudo to gain root priviledge. Please follow the instruction to setup sudo without password.
    • Please turn of apache2 server if it is running:
      sudo systemctl disable apache2
      

      DL workspace runs WebUI on port 80, which conflicts with apache2.

  3. Configuration the cluster. You will need the following as a minimum.

    • cluster_name
    • An existing SQL or Azure SQL database, or you can create one using scripts. Please note that Azure SQL charge may apply.
    • Authentication
    • Any shared file system. For single node, you can use a local drive for jobs.
  4. Configure the information of the servers used in the cluster. Please write the following entries in config.yaml.
  ssh_cert: ~/.ssh/id_rsa 

  machines:
    <<hostname_of_the_machine>>:
      role: infrastructure

  etcd_node_num: 1

If use domain, please add:

  network:
    domain: <<domain_of_the_machine>>

Please don’t use domain: “”, this adds a “.” to hostname by the script, and causes the scripts to fail.

  1. Run deployment script block:
      ./deploy.py --verbose scriptblocks ubuntu_uncordon