Data Loss Prevention

Overview

Harmony Email & Collaboration's Data Loss Prevention (DLP) engine safeguards the organization's data from breaches or unauthorized sharing. It scans emails, attachments, shared files, and text messages, even extracting text from images using OCR. The DLP engine identifies patterns that should not be shared with unauthorized people or destinations.

The DLP engine enables you to create universal policies across multiple cloud applications to control how files are shared amongst internal and external users. DLP identifies and marks files containing confidential, financial, and personally identifiable information, including credit card numbers, social security numbers, bank routing numbers, or data protected under HIPAA, etc.

Note - DLP is not available for Infinity Portal accounts residing in the United Arab Emirates (UAE) region. If required, you can request to enable DLP. However, sensitive data analysis will be performed in the United Kingdom (UK) and not within the borders of the UAE. If you wish to enable DLP, contact Check Point Support.

DLP Policies

This chapter discusses defining the DLP categories, Data Types and other DLP security engine settings.

To enforce your organization’s DLP standards, you need to define DLP policies for different protected SaaS applications.

To configure DLP policy, see the relevant SaaS application:

DLP Categories

DLP categories are containers of multiple data types used in different DLP policies to describe data sharing that can be considered as a DLP violation and should trigger a DLP workflow.

For example, the PII DLP category includes the Passport Number DLP Data Type.

Managing DLP Categories

You can configure all the available DLP categories and manage them under Security Settings > Security Engines > DLP.

Editing DLP Categories

To edit the list of DLP Data Types each category contains:

  1. Access the Harmony Email & Collaboration Administrator Portal.

  2. Click Security Settings > Security Engines.

  3. Click Configure for DLP.

  4. Scroll down to Detection Types and find the relevant DLP category.

  5. Add or remove data types from the category.

    Note - To exclude Universal Air Travel Plan (UATP) card numbers from detecting as credit card numbers, under PCI detection type, enable the Exclude UATP cards from the Credit Card data types checkbox.

  6. Click Save.

For more information about the default DLP Data Types and their DLP categories, see Appendix C: DLP Built-in Data Types and Categories.

DLP Data Types

DLP Data Types describe the content the DLP engine tries to detect. Every time the engine detects a data type, it adds 1 to the hit count of every DLP category containing this data type.

Managing DLP Data Types

To view and manage the available Data Types, go to Security Settings > DLP Data Types.

Custom DLP Data Types

Harmony Email & Collaboration allows you to create custom DLP Data Types. These Data Types provide organizations the flexibility to add any DLP data type to each of the DLP categories.

Note - You must add the custom DLP Data Type to a DLP category before it is enforced. To add the custom DLP Data Type to a DLP category, see DLP Categories.

Creating Custom DLP Data Types

Regular Expression DLP Data Types

Data Types based on regular expressions are data types that will add a hit count to their parent category every time a string in the inspected email/file/message is matched against the defined Regular Expression.

To create a regular expression Data Type:

  1. Click Security Settings > DLP Data Types.

  2. Click Create Data Type.

    Create Custom DLP Data Type section appears.

  3. Enter the required Name and Description for the Data Type.

  4. Under Match type, select Regular Expression and enter the required regular expressions.

    Note - Harmony Email & Collaboration supports Regular Expression 2 syntax. For more information about the syntax, see this article.

  5. Click Save.

Dictionary DLP Data Types

A dictionary is a list of custom strings. These Data Types add a hit count to their parent category every time a string in the inspected email/file/message matches one of the strings in the dictionary.

To create a Dictionary DLP Data Type:

  1. Click Security Settings > DLP Data Types.

  2. Click Create Data Type.

    Create Custom DLP Data Type section appears.

  3. Enter the required Name and Description for the Data Type.

  4. Under Match type, select Dictionary and add the required keywords:

    • To add a keyword to the dictionary, enter the required keyword and click Add Keyword.

    • To import keywords to the dictionary from a CSV file:

      1. Click Import dictionary.

      2. Under Upload Dictionary File, select the required CSV file.

      3. To override the existing keywords, enable the Override all existing words checkbox.

      Note - To export the keywords in the dictionary to a CSV file, click Export dictionary.

  5. Click Save.

Compound DLP Data Types

Compound DLP Data Types are parent DLP Data Types that contain other child DLP Data Types, divided into two groups:

  • TriggersDLP Data Types that must match otherwise, the parent DLP Data Type will not match

  • ChildrenDLP Data Types that could match and add to the parent DLP Data Type hit count.

In addition, each Compound DLP Data Type has a Minimum Match Type Count of its own so that the number of matches across all contained data types must be above it for the parent DLP Data Type to match.

For example, you can create a compound DLP Data Type named MyCompany Internal Documents the following way:

  1. Triggers

    1. A string “MyCompany”

    2. A string “Confidential”

  2. Children

    1. Source Code

    2. Bank Swift routing numbers

  3. Minimum Match Type Count = 4

Example scenarios:

Scenario

Findings

Match?

Reason

“My Company”

"Confidential"

Source Code

Bank SWIFT Routing Numbers

Only Triggers

2

3

0

0

Yes

All triggers plus match count above the threshold

Some Triggers

3

0

2

2

No

One of the triggers not matched

Not enough matches

1

1

1

0

No

Match count below the threshold

Triggers and Children

1

1

2

2

Yes

All triggers plus match count above the threshold

Creating a Compound DLP Data Type

Harmony Email & Collaboration allows you to define a custom Compound DLP Data Type.

To create a compound DLP Data Type:

  1. Click Security Settings > Security Engines.

  2. Click Configure for DLP.

  3. Scroll down and find Patient Information below Compound Info Types.

  4. Edit the Triggers, Children, and Minimum Match Type Count.

  5. Add Patient Information to one of the DLP Categories so that it can be used in the DLP policy rules. For more details, see DLP Categories.

  6. Click Save.

Other Custom Data Types

If you need a different custom data type, open a support ticket or contact Check Point Support.

Edit, Clone, or Delete Custom DLP Data Types

To edit a custom DLP Data Type:

  1. Click Security Settings > DLP Data Types.

  2. Select a custom DLP Data Type.

  3. Click on the vertical ellipses icon (in the top right corner of the selected custom DLP Data Type), and then select Edit.

  4. Make the required changes to the DLP Data Type and click Save.

To clone a custom DLP Data Type:

  1. Click Security Settings > DLP Data Types.

  2. Select a custom DLP Data Type.

  3. Click on the vertical ellipses icon (in the top right corner of the selected custom DLP Data Type), and then select Clone.

  4. Make the required changes to the DLP Data Type and click Save.

To delete a custom DLP Data Type:

  1. Click Security Settings > DLP Data Types.

  2. Select a custom DLP Data Type.

  3. Click on the vertical ellipses icon (in the top right corner of the selected custom DLP Data Type), and then select Delete.

  4. Click OK.

Configuring Advanced Data Type Parameters

To refine the definitions of a DLP category or to handle cases of false-positive detections, you can control how to match a DLP Data Type in an email/file/message.

Match Hit Count Settings

By default, a DLP Data Type's hit count increases every time a string in the email/file/message matches with the DLP Data Type’s definitions. If the same matched string appears multiple times in the email/file/message, the hit count increases accordingly.

To configure Harmony Email & Collaboration to ignore duplications of the same string when calculating the hit count, enable the Unique detections only box in the Configure DLP window.

Occurrence Threshold

By default, if a DLP Data Type is matched X times, the hit count of the DLP Category containing this DLP Data Type increases by X.

Setting the occurrence threshold for the DLP Data Type to Y means that:

  • If the DLP Data Type matches < Y times, the hit count of the containing DLP Category will not be increased at all.

  • If the DLP Data Type matches >= Y times, the hit count of the containing DLP Category will be increased by the total number of matches.

To configure Occurrence Threshold, open a support ticket or contact Check Point Support.

Likelihood Adjustment

By default, the DLP engine returns a specific likelihood level (Minimal Likelihood) to a DLP Category.

If you want to determine if one of the DLP Data Types is matched, the likelihood will automatically increase or decrease. You can configure the Likelihood Adjustment value for every DLP Data Type with positive or negative values accordingly.

To configure Likelihood Adjustment, open a support ticket or contact Check Point Support.

Hot/Cold Words

Every DLP Data Type is searched across the entire email/file/message by default.

You can define the scope of the search so that it happens in the vicinity of certain words and/or not in the vicinity of others.

To configure Hot/Cold Words, open a support ticket or contact Check Point Support.

Configuring DLP Engine Settings

To configure DLP engine settings:

  1. Click Security Settings > Security Engines.

  2. Click Configure for DLP.

  3. Configure the different configuration options and click Save.

Storage of Detected Strings

When the DLP engine matches strings to a DLP Data Type, Harmony Email & Collaboration stores these strings and displays them for administrators with sufficient permissions when they investigate the security events.

Since these strings are considered sensitive and private end-user data, you can select how they are stored and presented in the system called Detected Text Storage Mode.

To update Detected Text Storage Mode:

  1. Click Security Settings > Security Engines.

  2. Click Configure for DLP.

  3. Scroll down to Detected Text Storage Mode and select one of these options.

    • Store detected text strings (default): This is the default option, and the detected data is saved and displayed on the security events for the forensic process.

    • Obfuscate detected text prior to storage: Detected data is saved and displayed on the security events obfuscated. The original data is discarded and cannot be accessed.

    • Do not store detected text: No detected data is stored or displayed on the security events.

  4. Click Save.

Minimal Likelihood

Whenever the DLP engine detects a possible data leak, it assigns the detection a Likelihood level. Likelihood levels are mostly affected by context around the detected strings.

For example, when a Social Security Number (SSN) is discovered, the DLP engine also checks for the presence of relevant strings close to the discovered pattern, i.e., "SSN" or "Social Security."

Likelihood scale:

  • Very Unlikely

  • Unlikely

  • Possible

  • Likely

  • Very Likely

DLP Exceptions

See DLP Exceptions.

DLP - Supported File Types

Harmony Email & Collaboration detects DLP violations in a large list of file types, including EML, HTML, PDF, Microsoft Office files, images, and many more.

For more information, see Appendix G: Supported File Types for DLP.

DLP Inspection - File Size Limit

The DLP security engine inspects the email, its attachments and files that are less than 50 MB only.

Note - At times, the DLP security engine might inspect the archived files larger than 50 MB.

Forensics

DLP detections are recorded as events for forensic and auditing purposes. The events include what type of sensitive information was potentially leaked (PII, HIPAA, etc.).

You can see events from Events.