Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
contentdiscovery [2018_09_22 00:34] – [Content Discovery] stevencontentdiscovery [2024_03_19 18:11] (current) – [Content Discovery and DLP] steven
Line 1: Line 1:
-# Content Discovery +# Content Discovery and DLP
  
-The Content Discovery feature allows files to be tagged based on text that they contain. Whether for compliance or competitive reasons, organisations often need to identify documents of special interest.+File Content Discovery enables files to be discovered based on text that they contain. Actions can be set on what happens to such data once it is discovered and flag it to a nominated person so that an appropriate action or process can be taken. This prevents Data Loss or Compliance breach exposure (Data Loss Prevention). 
 + 
 +Data stored and uploaded is scanned for data stored in files, based on pre-selected templates, in real-time. 
 + 
 +Whether for DLP, compliance or competitive reasons, companies often have a need to identify documents of special interest ie.
  
  * Which documents contain personal data restricted under GDPR?  * Which documents contain personal data restricted under GDPR?
Line 7: Line 11:
  * Which files contain the name "Ernie Madoff"?  * Which files contain the name "Ernie Madoff"?
  
-Content rules can also be used with [[automationrules]] that react to content discovery events by taking actions such as sending an email or moving a file. +Content rules can also be used with [[automationrules]] that react to content discovery events by taking actions such as sending an emailmoving a file etc.
- +
-Applies to: +
- +
- * Enterprise File Fabric Appliance [add-on] (since [[cloudappliance/applupdatev1808|v1808]])+
  
 See also: See also:
Line 18: Line 18:
  * [[automationrules]]  * [[automationrules]]
  
-This feature replaces [[piidiscovery]]. 
  
 ## Feature Summary ## Feature Summary
Line 24: Line 23:
 ### Detecting Content ### Detecting Content
  
-Content Discovery works by looking for content of interest after files are indexed by the search engine. This happens when files are added or updated, and when storage providers are added or synchronised.  A set of content detectors is used to look for different kinds of information.  (We refer to data identified by a content detector as “matching content”.)+Content Discovery works by looking for content of interest after files are indexed by Content Search. This happens when files are added or updated, and when storage providers are added or synchronised (if Content Search is active for the provider).  A set of content detectors is used to look for different kinds of information.  (We refer to data identified by a content detector as “matching content”.)
  
 Our example company operating within the GDPR might have a detector for UK NHS numbers and a detector for Spanish NIF numbers, among others. Our example sales organization company might have detectors for a specific set of SKUs Our example company operating within the GDPR might have a detector for UK NHS numbers and a detector for Spanish NIF numbers, among others. Our example sales organization company might have detectors for a specific set of SKUs
Line 52: Line 51:
 ### Metadata Indexing ### Metadata Indexing
  
-The Enterprise File Fabric updates it's metadata index when files are added, updated or deleted through the fabric, or when storage providers are synchronised.+Nasuni Access Anywhere updates it's metadata index when files are added, updated or deleted through the fabric, or when storage providers are synchronised.
  
 The metadata index is a cache of the file name, size, timestamps and other information that provide fast file searches and directory listings. The metadata index is a cache of the file name, size, timestamps and other information that provide fast file searches and directory listings.
Line 71: Line 70:
 ### Classification and Tagging of Files ### Classification and Tagging of Files
  
-When matching content is detected in a file, a category tag is added to the file metadata indicating the type of content that was detected.  For example, if the File Fabric is configured to scan for US social security numbers as part of the “North America - National Identifiers” Content Detection Category, and one or more matching data values are found by the social security number detector when the file is scanned, then a tag with the value “US Social Security Number” will be added to the file's metadata under the “North America - National Identifiers” classification.+When matching content is detected in a file, a category tag is added to the file metadata indicating the type of content that was detected.  For example, if Access Anywhere is configured to scan for US social security numbers as part of the “North America - National Identifiers” Content Detection Category, and one or more matching data values are found by the social security number detector when the file is scanned, then a tag with the value “US Social Security Number” will be added to the file's metadata under the “North America - National Identifiers” classification.
  
 {{ :contentdiscovery:tag.png?direct&400 |}} {{ :contentdiscovery:tag.png?direct&400 |}}
Line 138: Line 137:
    
  
-In the initial release of v1808 only files containing matching content for at least one of the content detectors in each of the selected categories will be candidates for inclusion in the search results. This behaviour may change in future versions.+ Files containing matching content for at least one of the content detectors in each of the selected categories will be candidates for inclusion in the search results. This behaviour may change in future versions.
  
 When searching by content detectors, tick the detectors for the kinds of matching content for which you want to search: When searching by content detectors, tick the detectors for the kinds of matching content for which you want to search: