User Docs
PlatformProduct updates
  • Getting started
    • What is DSPM?
    • Use DSPM in your company
    • Choose how to run DSPM
  • Quick start
  • Deployment guide
    • Sizing
    • Installation
      • Understand installation requirements
        • K3S installation
        • Configuring a HA K3s cluster
        • Configuring Rancher and Fleet agent to run behind an HTTP proxy
        • Install Synergy/Focus/Enterprise using Helm without Rancher
        • Install Synergy/Focus/Enterprise using Rancher
        • Air Gap Installation
        • Uploads to Rancher
      • Upgrade K3s
        • K3s - Upgrade
      • Troubleshooting
        • K3s on RHEL/CentOS/Oracle Linux
        • Networking
        • Configuring Rancher and Fleet agent to run behind a HTTP proxy if cluster was previously registered
    • Estimate hardware capacity needs
  • Administration guide
    • Customer Support Portal
    • Pattern matching
    • Data Controls
    • Analytics
    • Detectors
    • Import custom TLS certificate
    • GQL Quick Guide
    • Critical & Sensitive Classification Attribute Modification
    • How to Check AI Mesh Version
    • Webhooks
    • AI Mesh Overview
    • Is Customer Data Saved by Getvisibility?
  • Enterprise setup
    • Authentication
      • Keycloak configuration
      • Single Sign-on (SSO)
        • Using Azure AD as Keycloak Identity Provider
      • Keycloak User Federation Configuration (LDAP/AD)
      • Enable 2FA
      • Role-Based Access Control (RBAC)
      • Keycloak User Federation using LDAP over SSL
  • Implementation
    • Configuring Taxonomies & Labels
  • Integrations
    • GQL
    • Template Language
    • Multi-Language Machine Learning
    • SIEM Integration
    • Google Drive Auto-labelling
  • Scan with Getvisibility
    • Configure detectors
    • Configure data sources
      • Scan Configuration Fields
      • AWS IAM
      • AWS S3
      • Azure AD
      • Azure Blob
      • Azure Files
      • OneDrive
      • SharePoint Online
      • SharePoint on-premise
      • Box
      • Confluence Cloud
      • LDAP
      • SMB
      • Google IAM
      • Google Drive
      • ChatGPT
      • iManage
      • Dropbox
    • Scanning
      • Data Source Permissions
      • Scan Scheduler
      • Types of Scan
      • Scan History
      • Scan Analytics
      • Supported Languages for ML Classifiers
      • Rescan Files
    • Streaming
      • What is DDR?
      • How to Configure DDR Rules
      • Import Data Controls
      • Monitoring New Files via DDR Streaming
      • DDR Supported Events
      • Lineage
      • Supported Data Sources
      • Azure Blob Streaming Configuration
      • Azure Files Streaming Configuration
      • Confluence Cloud Streaming Configuration
      • Sharepoint Online Streaming Configuration
      • SMB Streaming Configuration
      • OneDrive Streaming Configuration
      • Azure AD Streaming Configuration
      • AWS S3 Streaming Configuration
      • Google Drive Streaming Configuration
      • Google IAM Streaming Configuration
      • AWS IAM Streaming Configuration
      • Box Streaming Configuration
      • Dropbox Streaming Configuration
    • Enterprise Search columns meaning
    • Supported File Types
  • Glossary
  • FAQ
  • EDC - All Documents
    • Deployment - Onboarding
      • EDC-Server Installation Guide
      • EDC-Deployment Flow Guide
        • EDC-installerConfig.json and CLI config Details
      • Deploying the agent using ManageEngine
      • EDC-Mac Agent - Installation Guide
      • Windows Agent Precheck Script
    • Functionality - Guides
      • EDC - Admin Guide - v4
      • EDC -Guide for writing Visual Labels
      • EDC- Guide for Header Footer Options
      • EDC-Metadata Details
      • EDC Supported File Types
      • Agent V4 - Configuration Options for Expert Mode
      • File Lineage - Agent Activities
      • Endpoint Data Discovery
    • Troubleshooting Documents
      • Preventing Users From Disabling Agent
      • Generate Installation Logs
      • Troubleshooting Agent for Windows
      • Guide for missing suggestions
      • Reseller Keycloak Quick Installation Guide
      • Alternative authentication methods for agent
  • EDC - All Documents
Powered by GitBook
On this page
  • 1) Discovery
  • 2) Metadata classification
  • 3) Content Classification
  • Trustee Scan

Was this helpful?

Export as PDF
  1. Scan with Getvisibility

Scanning

Scanning process and statuses

PreviousDropboxNextData Source Permissions

Last updated 3 months ago

Was this helpful?

To review Scans and their status go to Data Sources in the Administration drop-down.

The scanning process discovers and analyses files across all configured data sources. It operates in three steps:

1) Discovery

  • The system searches through all files and folders.

    • If a specific path has not been set, the entire Data Source will be scanned.

    • Metadata (path, size, format, etc.) and permissions are extracted and recorded for each file.

  • This step ensures that every every file and folder is identified and that access permissions are understood.

The scan discovery process can have the following statuses, reflecting its progress:

These statuses can be seen in the Last Scan Status column.

2) Metadata classification

This is the continuation of the Discovery process where:

  • Metadata information is processed for each file that has been collected as part of the Discovery step.

  • A detailed analysis of each file's metadata is performed .

3) Content Classification

  • Permissions are analysed and the shared level is identified.

  • A detailed analysis of each file's content is performed.

  • Content is extracted and the sensitivity level and risk of each file is determined for classification.

    • This is determined by the Patterns/Detector setting and the AI Mesh

  • This ensures that sensitive information is properly identified and protected.

Trustee Scan

This is a scan to determine the Users and Groups present in a Data Source.

  • Metadata is extracted for each user, with specific fields depending on the data source. Some of the fields that will be picked up by the scan include Enabled, Last Login, Last Modified, etc.

The statuses for these scans are the same as for files but there are two additional ones.

To see additional information on a running or completed scan click on the Scan Analytics Icon.

This will pop out the Analytics sidebar where there is information such as scan duration, how many files have been scanned, classification insights, etc.

Not Started: Data Source has been added but the scan has not started.

Queued: Scan has been put into the queue for the execution.

Failed To Start: Scan unable to start, usually due to issues with permissions or network.

In Progress: Scan is actively running and processing data discovery.

Cancelled: Scan was manually stopped or automatically aborted.

Incomplete: Scan is partially completed but permissions to files were changed during scan.

Completed: Scan has successfully finished Discovery phase.

Completed Only Users: The scan has been completed only for user-specific policies.

Completed Only Groups: The scan has been completed only for group-specific policies.