Skip to main content
All CollectionsGetting started in Glean
Glean Cloud-prem (Self-hosted): GCP Preparation Guide
Glean Cloud-prem (Self-hosted): GCP Preparation Guide

This page will cover the steps required by your GCP admins to prepare a GCP project that is ready for your Glean build.

Dan Iacono avatar
Written by Dan Iacono
Updated over a month ago

Overview

Glean provides our customers the ability to deploy Glean software inside their own Google Cloud Platform (GCP) project. This deployment requires your GCP admin to:

  1. Create a new GCP project.

  2. Associate a valid billing account.

  3. Enable applicable GCP APIs.

  4. Request the required quota increases from GCP.

  5. Create a Service Account with a Project Owner role and associate a JSON account key.

  6. Notify Glean of the GCP zone selected, the Project Name, Project ID, and Project Number.

After completing the above, Glean's systems will automatically build and deploy the required compute, workflows, and software into your GCP project.

At this stage, Glean will advise you that your tenant is ready, allowing your admins to proceed with the setup process, such as configuring Single Sign-On (SSO) and connecting your data sources.

Preferred GCP Regions

To ensure our customers enjoy the highest quality of machine learning performance, reduced latency, and adherence to regional data residency laws, Glean advises deploying our services in one of the following recommended GCP regions:

  • Iowa, USA (us-central1)

  • Taiwan, APAC (asia-east1)

  • The Netherlands, EMEA (europe-west4)

These regions are equipped with Tensor Processing Units (TPUs), which are specifically designed to accelerate machine learning tasks. Additionally, selecting these regions can lead to cost savings and help maintain compliance with various regulations.

While Glean does support additional GCP regions, it's important to be aware that these alternative locations utilize Graphics Processing Units (GPUs) rather than TPUs for machine learning workloads. Utilizing GPUs for these workloads means that they will take longer to complete; increasing the cost of running the workflows. Additionally, depending on the region, only a limited range of GPU models may be available, some of which will result in a lower-quality output. As such, we strongly recommend hosting your Glean projects within the preferred regions listed above to achieve the best balance of performance and cost-efficiency.

Should you need more information or assistance in determining the most suitable GCP region for your Glean deployment, please don't hesitate to reach out to your dedicated Glean engineer.

GCP Environment Preparation Process

1. Select a GCP Region

  1. Select a supported GCP region for Glean to build your environment in. You must notify Glean of the GCP zone selected, e.g. us-central1-a

2. Create a new GCP Project

  1. Go to the Manage Resources page in the GCP console and click Create Project.

  2. In the New Project window that appears, add a project name, organization, and location. For project name, the preferred format is glean-{customer name} or glean-{customer name}-{prod/sandbox}

  3. Make sure your project is created under the same organization as your GSuite account, not “No Organization”.

  4. Click Create.

  5. Notify Glean of the following:

    1. Project name, e.g. “glean-company”

    2. Project ID, e.g. “glean-company”

    3. Project number, e.g. "715000000000”

  6. Go to Billing in the GCP console.

  7. Click Link a billing account to set up billing for this project.

    1. Ensure that the billing account has a corporate credit card attached to it as a “free trial billing tier” will not work.

3. Enable GCP APIs for the Project

Enable the following APIs by going to the link and clicking Enable API:

4. Quota Changes Requests

Per the Glean setup, Glean will check quota requirements and will make quota change requests as needed. Customers will be alerted when quota requests need approval. All of the quota needs are listed in this spreadsheet. Please note that some quota requests will require filing a ticket with GCP support. They usually respond within 2 days at the latest.

The quotas you will request will differ depending on the size of the Glean tenant that will be built:

Use Case

Deployment Size

Sandbox / UAT environments / < 1M docs

Small Deployment

<50M documents to be indexed

Medium Deployment

>50M documents to be indexed

Large Deployment

If you are not sure how to proceed, please consult with your Glean engineer.

Note: Quota requests for some resources for a Large Deployment may fail depending on the GCP region you have selected due to the compute types available. If this is the case, please work with your Glean engineer, who can advise you further.

5. Create a Service Account

Go to the Service Accounts page in the GCP console and click Select a Project.

  1. Select your project and click Open.

  2. Click Create Service Account. Enter the service account name (glean-admin), ID, and description (optional), then click Create.

  3. Click the Select a role dropdown to make your service account an Owner of the project.

  4. Click Continue.

  5. Ignore the Grant users access to this service account.

  6. Click Create Key. In the panel that appears, select the key type JSON then Create.

  7. A private JSON key will be saved to your computer.

  8. Contact your Glean representative to let them know that the project has been created, and provide them with the project name and project ID.

Note: A service account that is generated with an owner role for a specific project in GCP is limited to the resources and services within that specific project. It does not have permission to access or modify resources outside of that project, even if it's within the same GCP tenant. The owner role grants full access to all resources in the project where it is assigned, but it does not extend to other projects in the GCP tenant.

Glean Self-Service GCP Environment Validation

  1. After a confirmation from Glean that you are set up in the setup portal, browse to https://app.glean.com/admin and enter your email address to generate and send a magic link to your email. The first screen is an admin setup screen. Either add additional admins or skip to the next step.

  2. Follow the on-screen instructions (The instruction is also provided below as a preview)

  3. Upload your JSON key into the Glean portal. The portal will validate the key and provide correctional instruction, which includes quota updates (step 9) and organizational constraints if you have that in the parent organization.

If the validation passes, then Glean is ready to deploy to your GCP environment! 🎉

Troubleshooting

If the validation fails, the error message will indicate the issue.

Typically this is due to:

  • Org Constraints that have been applied that will interfere with the Glean build.

  • Missing or insufficient quotas.

  • Incorrect permissions or roles assigned to the Service Account.

  • GCP APIs that have not been enabled.

Please correct the issues indicated before attempting validation again. If you are unsure of anything, please contact your Glean engineer, who will assist you.

Did this answer your question?