This tutorial shows you how to deploy Sourcegraph via Docker Compose to a single node running on Google Cloud.
NOTE: Trying to decide how to deploy Sourcegraph? See our recommendations for how to chose a deployment type that suits your needs.
- Open your Google Cloud console to create a new VM instance and click Create Instance
- Choose an appropriate machine type (use the resource estimator to find a good starting point for your deployment).
- Under the "Boot Disk" options, select the following:
- Operating System: Ubuntu
- Version: Ubuntu 18.04 LTS
- Boot disk type: SSD persistent disk
- Check the boxes for Allow HTTP traffic and Allow HTTPS traffic in the Firewall section
- Open the Management, disks, networking, and SSH keys dropdown section
WARNING: To configure your Sourcegraph instance, you must create and use a fork of the reference repository - refer to Configuring Sourcegraph with Docker Compose for more details. Then update the following variables in the script below:
DEPLOY_SOURCEGRAPH_DOCKER_FORK_CLONE_URL: Your fork's git clone URLDEPLOY_SOURCEGRAPH_DOCKER_FORK_REVISION: The git revision containing your fork's customizations to the base Sourcegraph Docker Compose YAML. Most likely,DEPLOY_SOURCEGRAPH_DOCKER_FORK_REVISION='release'if you followed our branching recommendations in the Configuration guide
- Under the Management section, add the following in the Startup script field:
#!/usr/bin/env bash
set -euxo pipefail
PERSISTENT_DISK_DEVICE_NAME='/dev/sdb'
DOCKER_DATA_ROOT='/mnt/docker-data'
DOCKER_COMPOSE_VERSION='1.25.3'
DEPLOY_SOURCEGRAPH_DOCKER_CHECKOUT='/root/deploy-sourcegraph-docker'
# 🚨 Update these variables with the correct values from your fork!
DEPLOY_SOURCEGRAPH_DOCKER_FORK_CLONE_URL='https://github.com/sourcegraph/deploy-sourcegraph-docker.git'
DEPLOY_SOURCEGRAPH_DOCKER_FORK_REVISION='v3.30.1'
# Install git
sudo apt-get update -y
sudo apt-get install -y git
# Clone Docker Compose definition
git clone "${DEPLOY_SOURCEGRAPH_DOCKER_FORK_CLONE_URL}" "${DEPLOY_SOURCEGRAPH_DOCKER_CHECKOUT}"
cd "${DEPLOY_SOURCEGRAPH_DOCKER_CHECKOUT}"
git checkout "${DEPLOY_SOURCEGRAPH_DOCKER_FORK_REVISION}"
# Format (if necessary) and mount GCP persistent disk
device_fs=$(sudo lsblk "${PERSISTENT_DISK_DEVICE_NAME}" --noheadings --output fsType)
if [ "${device_fs}" == "" ] ## only format the volume if it isn't already formatted
then
sudo mkfs.ext4 -m 0 -E lazy_itable_init=0,lazy_journal_init=0,discard "${PERSISTENT_DISK_DEVICE_NAME}"
fi
sudo mkdir -p "${DOCKER_DATA_ROOT}"
sudo mount -o discard,defaults "${PERSISTENT_DISK_DEVICE_NAME}" "${DOCKER_DATA_ROOT}"
# Mount GCP disk on reboots
DISK_UUID=$(sudo blkid -s UUID -o value "${PERSISTENT_DISK_DEVICE_NAME}")
sudo echo "UUID=${DISK_UUID} ${DOCKER_DATA_ROOT} ext4 discard,defaults,nofail 0 2" >> '/etc/fstab'
umount "${DOCKER_DATA_ROOT}"
mount -a
# Install Docker
curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo apt-key add -
sudo add-apt-repository "deb [arch=amd64] https://download.docker.com/linux/ubuntu $(lsb_release -cs) stable"
sudo apt-get update -y
apt-cache policy docker-ce
apt-get install -y docker-ce docker-ce-cli containerd.io
# Install jq for scripting
sudo apt-get update -y
sudo apt-get install -y jq
# Edit Docker storage directory to mounted volume
DOCKER_DAEMON_CONFIG_FILE='/etc/docker/daemon.json'
## initialize the config file with empty json if it doesn't exist
if [ ! -f "${DOCKER_DAEMON_CONFIG_FILE}" ]
then
mkdir -p $(dirname "${DOCKER_DAEMON_CONFIG_FILE}")
echo '{}' > "${DOCKER_DAEMON_CONFIG_FILE}"
fi
## update Docker's 'data-root' to point to our mounted disk
tmp_config=$(mktemp)
trap "rm -f ${tmp_config}" EXIT
sudo cat "${DOCKER_DAEMON_CONFIG_FILE}" | sudo jq --arg DATA_ROOT "${DOCKER_DATA_ROOT}" '.["data-root"]=$DATA_ROOT' > "${tmp_config}"
sudo cat "${tmp_config}" > "${DOCKER_DAEMON_CONFIG_FILE}"
## finally, restart Docker daemon to pick up our changes
sudo systemctl restart --now docker
# Install Docker Compose
curl -L "https://github.com/docker/compose/releases/download/${DOCKER_COMPOSE_VERSION}/docker-compose-$(uname -s)-$(uname -m)" -o /usr/local/bin/docker-compose
chmod +x /usr/local/bin/docker-compose
curl -L "https://raw.githubusercontent.com/docker/compose/${DOCKER_COMPOSE_VERSION}/contrib/completion/bash/docker-compose" -o /etc/bash_completion.d/docker-compose
# Run Sourcegraph. Restart the containers upon reboot.
cd "${DEPLOY_SOURCEGRAPH_DOCKER_CHECKOUT}"/docker-compose
docker-compose up -d-
Under the Disks section, click Add new disk and add a disk (for storing Docker data) with the following settings:
- Type: SSD Persistent Disk
- Description: "Disk for storing Docker data for Sourcegraph" (or something similarly descriptive)
- (optional, recommended) Snapshot schedule: The most straightfoward way of automatically backing Sourcegraph's data is to set up a snapshot schedule for this disk. We strongly recommend that you take the time to do so here.
- Mode: Read/write
- Deletion rule: Keep disk
- Size:
250GB minimum (As a rule of thumb, Sourcegraph needs at least as much space as all your repositories combined take up. Allocating as much disk space as you can upfront helps you avoid resizing this disk later on.)
-
Create your VM, then navigate to its public IP address.
-
If you have configured a DNS entry for the IP, configure
externalURLto reflect that. -
You may have to wait a minute or two for the instance to finish initializing before Sourcegraph becomes accessible. You can monitor the status by SSHing into the instance and running the following diagnostic commands:
# Follow the status of the user data script you provided earlier
tail -c +0 -f /var/log/syslog | grep startup-script
# (Once the user data script completes) monitor the health of the "sourcegraph-frontend" container
docker ps --filter="name=sourcegraph-frontend-0"To update to the most recent version of Sourcegraph (X.Y.Z), SSH into your instance and run the following:
cd /root/deploy-sourcegraph-docker/docker-composeThen follow the upgrade guide.
The Sourcegraph Docker Compose definition uses Docker volumes to store its data. The script above configures Docker to store all Docker data on the additional persistent disk that was attached to the instance (mounted at /mnt/docker-data - the volumes themselves are stored under /mnt/docker-data/volumes) There are a few different ways to backup this data:
-
(recommended) The most straightfoward method to backup this data is to snapshot the entire
/mnt/docker-datapersistsent disk on an automatic, scheduled basis. The directions above tell you how to set up this schedule when the instance is first created, but you can also create the schedule afterwards. -
Using an external Postgres instance (see below) lets a service such as Cloud SQL for PostgreSQL take care of backing up all of Sourcegraph's user data for you. If the VM instance running Sourcegraph ever dies or is destroyed, creating a fresh instance that's connected to that external Postgres will leave Sourcegraph in the same state that it was before.