Enabling Deep Content Search and PDF Burn Service

last updated on July 16, 2021

SME can index the content of federated storage endpoints to provide to provide searching of their contents. Apache Solr is used to index the content. Contents of the following file types can be indexed for searching:

7zdocxjarodtpubvsd
afmdotmjpgogaqpwwar
aifdwgjsoggrdfwav
apkearkeyopusrsswb3
aremfkmlp7srtfwebarchive
asfemlkmzpagessdawma
auemlxm4apbmsdcwmf
bmpepubmboxpctsddwmv
boxepubmdbpdfsdwwps
cexemhtmlpgmshwxhtml
chmfb2midpngsvgxlr
classfitsmp3potmsvgzxls
cpioflacmp4ppmtarxlsb
cssflvmppppsmtbz2xlsm
csvgifmsgppsxtgzxlsx
dathdfncpptthmxxml
ditahe5numberspptmtifxmp
ditamaphtmodfpptxttfxps
dochtmlodpprttxtzip
docmibooksodspsdvor

For evaluation the standard appliance is configured for deep content search. The service is disabled out of the box. This guide walks you thorough the steps to enable deep content search

A dedicated File Fabric appliance should be used for Solr in production.

The SME Enterprise Appliance also provides a PDF Annotation feature that allows PDFs to be annotated and burnt. For that service to work it needs to be enabled and this guide will also step through how the burn service can be enabled.

Content indexing and searching is delivered for recent File Fabric versions as a Docker Compose service, “solr”. If high availability is required then a second service, “solr-replicas”, must also be used.
PDF burning is delivered for recent File Fabric versions as a Docker Compose service, “pdfburner.

“Information on starting the File Fabric's Docker Compose services can be found here. Information on high availability for content indexing and searching can be found here.

ssh as root

For these commands you will need to su as root

$ ssh smeconfiguser@appliance IP address

after establishing the ssh session su as root

-bash-3.2$ su - root
Password:

Start the search and Burn PDF Service

Execute the following 2 commands:

service jetty start
chkconfig jetty on

The first command will start the service and second command will automatically start the service after a reboot.

Login as appladmin and enable the PDF Annotations tool in the extra options section for the package and press save.

Configure Search Values

Login as appladmin and from the right hand menu select search integration For the internal service you can use the following default values

Solr URI http://127.0.0.1:7070/sme/
Solr login solr
Solr password drom6etsh9Onk
Max file size to index 10485760

Assign Search to User Package

Login as appladmin and enable Content Search Enabled in the extra options section for the package and press save

After Content Search has enabled in the package, each Storage Provider Settings page will present the option to enable content search for the provider. Use this option to control whether COntent Search will be available for each provider.

If you wish to enable Content Search and you have already started using one or more storage providers with your File Fabric, how you should proceed depends on whether Content Search has been integrated with your appliance and Content Search been enabled in your organization's package for the entire time that your File Fabric has been in use:

  • If Content Search was both integrated and enabled then you need only tick the “Index content for search” box on the Provider Settings page for each provider for which you want Content Search enabled. Don't forget to save the change to the settings.
  • If, however, Content Search was not integrated or was not enabled then please contact SME Support.
This website uses cookies. By using the website, you agree with storing cookies on your computer. Also you acknowledge that you have read and understand our Privacy Policy. If you do not agree leave the website.More information about cookies