For some apps, you can specify exclusion and inclusion rules to manage what data is crawled by Glean. This can be found on the "Manage data" tab within a data source in the admin console.
Learn more about how to specify these rules below.
Google Drive
Overview
Exclusion rules allow you to exclude certain folders, shared drives, and groups from being crawled by Glean.
Inclusion rules allow Glean to only crawl certain shared drives. Any folders within a shared drive would also be crawled. If no shared drives are specified, Glean will crawl all of Google Drive except for content in the Exclusion rules above.
How to find folder IDs
To specify certain folders to exclude in Google Drive, you'll need to get the folder ID.
Go to the folder in Google Drive. In the URL bar, select everything after drive.google.com/drive/folders/
. This is the folder ID.
You can copy and paste this folder ID into Glean.
How to find shared drive IDs
Shared drives are special folders in Google Drive. Because they are folders, the process for finding the shared drive ID is the same as it is for a folder.
Go to the shared drive in Google Drive. In the URL bar, select everything after drive.google.com/drive/folders/
. This is the shared drive ID.
You can copy and paste this shared drive ID into Glean.
Jira
Overview
Exclusion rules allow you to exclude certain Jira projects from being crawled by Glean.
Inclusion rules allow Glean to only crawl certain Jira projects. No other projects will be crawled. If no projects are specified, Glean will crawl all projects except for those in the Exclusion rules.
How to find project IDs
In Jira, click on Projects. In the dropdown that appears, the project IDs are shown in capitalized letters after the name of the project. They are typically short abbreviations.
Box
Overview
Exclusion rules allow you to exclude content belonging to specific users from being crawled by Glean.
How to find user IDs
If you are a Box admin, you should have access to the Content Manager. Click on a user from the user list, and the URL will reveal their user ID. For example, in https://app.box.com/master/content/2267862105/0/0
, the user ID is 2267862105
.
Slack Enterprise Grid
Overview
Inclusion rules allow Glean to only crawl certain Slack workspaces. No other workspaces will be crawled. If no workspaces are specified, Glean will crawl all workspaces.
How to find workspace org IDs
From your desktop, open Slack in a web browser using your org URL (ex. acmeinc.enterprise.slack.com).
Once the page loads, the URL will be in the following format: https://app.slack.com/client/EXXXXXXX/CXXXXXXX.
Your org ID is the string beginning with E.