Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
contentdiscovery [2018_09_21 19:01] – [Activities] stevencontentdiscovery [2020_03_12 11:50] – [Detecting Content] jim
Line 1: Line 1:
-# Content Discovery +# Content Discovery and DLP
  
-The Content Discovery feature allows files to be tagged based on text that they contain. Whether for compliance or competitive reasons, organisations often need to identify documents of special interest.+The File Fabric Content Discovery feature enables files to be discovered based on text that they contain. Actions can be set on what happens to such data once it is discovered and flag it to a nominated person so that an appropriate action or process can be taken. This prevents Data Loss or Compliance breach exposure (Data Loss Prevention). 
 + 
 +Data stored and uploaded is scanned for data stored in files, based on pre-selected templates, in real-time. 
 + 
 +Whether for DLP, compliance or competitive reasons, companies often have a need to identify documents of special interest ie.
  
  * Which documents contain personal data restricted under GDPR?  * Which documents contain personal data restricted under GDPR?
Line 7: Line 11:
  * Which files contain the name "Ernie Madoff"?  * Which files contain the name "Ernie Madoff"?
  
-Content rules can also be used with [[automationrules]] that react to content discovery events by taking actions such as sending an email or moving a file.+Content rules can also be used with [[automationrules]] that react to content discovery events by taking actions such as sending an emailmoving a file etc.
  
 Applies to: Applies to:
  
- * Enterprise File Fabric Appliance [Add-On] (since [[cloudappliance/applupdatev1808|v1808]])+ * Enterprise File Fabric Appliance [add-on] (since [[cloudappliance/applupdatev1808|v1808]])
  
 See also: See also:
Line 24: Line 28:
 ### Detecting Content ### Detecting Content
  
-Content Discovery works by looking for content of interest after files are indexed by the search engine. This happens when files are added or updated, and when storage providers are added or synchronised.  A set of content detectors is used to look for different kinds of information.  (We refer to data identified by a content detector as “matching content”.)+Content Discovery works by looking for content of interest after files are indexed by Content Search. This happens when files are added or updated, and when storage providers are added or synchronised (if Content Search is active for the provider).  A set of content detectors is used to look for different kinds of information.  (We refer to data identified by a content detector as “matching content”.)
  
 Our example company operating within the GDPR might have a detector for UK NHS numbers and a detector for Spanish NIF numbers, among others. Our example sales organization company might have detectors for a specific set of SKUs Our example company operating within the GDPR might have a detector for UK NHS numbers and a detector for Spanish NIF numbers, among others. Our example sales organization company might have detectors for a specific set of SKUs
Line 72: Line 76:
  
 When matching content is detected in a file, a category tag is added to the file metadata indicating the type of content that was detected.  For example, if the File Fabric is configured to scan for US social security numbers as part of the “North America - National Identifiers” Content Detection Category, and one or more matching data values are found by the social security number detector when the file is scanned, then a tag with the value “US Social Security Number” will be added to the file's metadata under the “North America - National Identifiers” classification. When matching content is detected in a file, a category tag is added to the file metadata indicating the type of content that was detected.  For example, if the File Fabric is configured to scan for US social security numbers as part of the “North America - National Identifiers” Content Detection Category, and one or more matching data values are found by the social security number detector when the file is scanned, then a tag with the value “US Social Security Number” will be added to the file's metadata under the “North America - National Identifiers” classification.
 +
 {{ :contentdiscovery:tag.png?direct&400 |}} {{ :contentdiscovery:tag.png?direct&400 |}}
    
Line 80: Line 85:
  
 Content Discovery users, including administrators, receive a notification by email: Content Discovery users, including administrators, receive a notification by email:
 +
 {{ :contentdiscovery:detected_email.png?direct&400 |}} {{ :contentdiscovery:detected_email.png?direct&400 |}}
  
 The file owner (the user who uploaded the file), receives both an email and a message: The file owner (the user who uploaded the file), receives both an email and a message:
 +
 {{ :contentdiscovery:detected_message.png?direct&400 |}} {{ :contentdiscovery:detected_message.png?direct&400 |}}
  
Line 92: Line 99:
  
 The folder icons for folders that contain files with matching content - either directly or in a child folder - are marked with a special decoration in the File Manager: The folder icons for folders that contain files with matching content - either directly or in a child folder - are marked with a special decoration in the File Manager:
 +
 {{ :contentdiscovery:folder_icon_decoration.png?direct&150 |}} {{ :contentdiscovery:folder_icon_decoration.png?direct&150 |}}
    
  
 File icons for files with matching content also have a special decoration in the File Manager:  File icons for files with matching content also have a special decoration in the File Manager: 
 +
 {{ :contentdiscovery:file_icon_decoration.png?direct&150 |}} {{ :contentdiscovery:file_icon_decoration.png?direct&150 |}}
    
  
 When the contents of a folder that contains files with matching content, including within subfolders, a notice about the presence of files with matching content is added to the top of the file listing area in the right-hand panel of the File Manager: When the contents of a folder that contains files with matching content, including within subfolders, a notice about the presence of files with matching content is added to the top of the file listing area in the right-hand panel of the File Manager:
 +
 {{ :contentdiscovery:files_containing_content_detected.png?direct&600 |}} {{ :contentdiscovery:files_containing_content_detected.png?direct&600 |}}
  
Line 109: Line 119:
  
 A confirmation dialog is presented to users who share documents that contain matching content: A confirmation dialog is presented to users who share documents that contain matching content:
 +
 {{ :contentdiscovery:sharing_warning.png?direct&400 |}} {{ :contentdiscovery:sharing_warning.png?direct&400 |}}
    
  
 When the file is shared notifications are sent by email to Content Discovery users, including administrators: When the file is shared notifications are sent by email to Content Discovery users, including administrators:
 +
 {{ :contentdiscovery:sharing_link_email.png?direct&400 |}} {{ :contentdiscovery:sharing_link_email.png?direct&400 |}}
  
Line 122: Line 134:
  
 To search for files with matching content use either or both of the Content Detection Categories control and the Detected Content control on the File Manager’s Search tab:  To search for files with matching content use either or both of the Content Detection Categories control and the Detected Content control on the File Manager’s Search tab: 
 +
 {{ :contentdiscovery:cd_search_screen_controls.png?direct&400 |}} {{ :contentdiscovery:cd_search_screen_controls.png?direct&400 |}}
  
 When searching by Content Detection Categories, check the category or categories for which you want to search: When searching by Content Detection Categories, check the category or categories for which you want to search:
 +
 {{ :contentdiscovery:cd_search_categories_selected.png?direct&400 |}} {{ :contentdiscovery:cd_search_categories_selected.png?direct&400 |}}
    
Line 131: Line 145:
  
 When searching by content detectors, tick the detectors for the kinds of matching content for which you want to search: When searching by content detectors, tick the detectors for the kinds of matching content for which you want to search:
 +
 {{ :contentdiscovery:cd_search_detectors_selected.png?direct&400 |}} {{ :contentdiscovery:cd_search_detectors_selected.png?direct&400 |}}
  
Line 142: Line 157:
  
 Each of the Content Discovery Groups belonging to an organisation is treated as a tag classification and shown in the list of tag classifications on the File Manager’s Tags tab.  As with other classifications, when a Content Detection Category is selected from the classifications list on the Tags tab, the tags belonging to the selected classification will be displayed in a tag cloud: Each of the Content Discovery Groups belonging to an organisation is treated as a tag classification and shown in the list of tag classifications on the File Manager’s Tags tab.  As with other classifications, when a Content Detection Category is selected from the classifications list on the Tags tab, the tags belonging to the selected classification will be displayed in a tag cloud:
 +
 {{ :contentdiscovery:tag_cloud.png?direct&400 |}} {{ :contentdiscovery:tag_cloud.png?direct&400 |}}
    
  
 Also, as with other classifications, a list of the files to which a specific tag has been attached can be displayed by clicking on the tag in the tag cloud: Also, as with other classifications, a list of the files to which a specific tag has been attached can be displayed by clicking on the tag in the tag cloud:
 +
 {{ :contentdiscovery:files_with_tag.png?direct&400 |}} {{ :contentdiscovery:files_with_tag.png?direct&400 |}}
 +
 ### Info Pane ### Info Pane
 +
 When the File Manager’s Info pane is shown for a file that contains matching content, the Classifications (Content Detection Category names) and Tags (content detector name) for the matching content that was found in the file are displayed for administrators and those in the Content Discovery role. When the File Manager’s Info pane is shown for a file that contains matching content, the Classifications (Content Detection Category names) and Tags (content detector name) for the matching content that was found in the file are displayed for administrators and those in the Content Discovery role.
 +
 {{ ::file_info.png?direct&400 |}} {{ ::file_info.png?direct&400 |}}
  
 Clicking on the “Show discovered content” link causes the matching content to be displayed: Clicking on the “Show discovered content” link causes the matching content to be displayed:
 +
 {{ ::filre_info_content.png?direct&400 |}} {{ ::filre_info_content.png?direct&400 |}}