Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
cloudfilesearch [2018_07_19 19:48]
steven [3 Deeper Search Options]
cloudfilesearch [2018_08_02 14:27]
steven
Line 32: Line 32:
 ===== 3 Content Search ===== ===== 3 Content Search =====
  
-Content search is supported with our on-premises cloud appliance with the [[http://​lucene.apache.org/​core/​|Apache Lucene search engine]]. ​Please ​[[mailto:​sales@storgemadeeasy.com|contact us]] for more information.+Content search is supported with our on-premises cloud appliance with the [[http://​lucene.apache.org/​core/​|Apache Lucene search engine]]. ​For more information see [[cloudappliance/​solr]].
  
 {{:/​cloudfilesearch:​steps:​sme_3_deeper_search_option.png}} {{:/​cloudfilesearch:​steps:​sme_3_deeper_search_option.png}}
  
-==== Search Options ====+==== 3.1 Content ​Search Options ====
  
-Search input: The input string is first split into search tokens, by its spaces and punctuation characters (which are ignored). Each of these search tokens are queried in Solr using AND operators. How these tokenized search terms are used in the query is dependent on the modifier flags that you have set when running the query. Searches are case insensitive.+Search input: The input string is first split into search tokens, by its spaces and punctuation characters (which are ignored). Each of these search tokens are queried in Solr (using AND or OR operators). How these tokenized search terms are used in the query is dependent on the modifier flags that you have set when running the query. Searches are case insensitive.
  
 Here's a worked example for the modifiers, if we are searching for "a summer holiday"​. ​ Here's a worked example for the modifiers, if we are searching for "a summer holiday"​. ​
  
- * **Match exact word or phrase** - This searches for documents containing the exact phrase. If this string is found in the middle of a document, Solr will expect a proceeding space before the "​a"​ and after the "​holiday"​. If there'​s a string "a summer holidays"​ then that will not be flagged as a match.  + * **Match exact word or phrase** - This searches for documents containing the exact phrase. If this string is found in the middle of a document, Solr will expect a proceeding ​word break (space or symbol) ​before the "​a"​ and after the "​holiday"​. If there'​s a string "a summer holidays"​ then that will not be flagged as a match. ​ 
- * **Whole word** - Given each of the word tokens "​a",​ "​summer"​ and "​holiday"​ it would look for documents containing at least one words "​a",​ "​summer" ​and "​holiday",​ in any part of the document, regardless of order. So it might flag a document that only contains the word "​holiday"​. The pseudo query would like like: "​a"​ OR "​summer"​ OR "​holiday"​ +   * To find documents that include every word, but not necessarily together, use the "​AND"​ operator. For example, "a AND summer AND holidays"​ 
-So, this could match "I would like a little holiday" ​ ("​a"​ and "​holiday"​ match)+ 
 + * **Whole word** - Given each of the word tokens "​a",​ "​summer"​ and "​holiday"​ it would look for documents containing at least one of the words "​a",​ "​summer" ​or "​holiday",​ in any part of the document, regardless of order. So it might flag a document that only contains the word "​holiday"​. The pseudo query would like like: "​a"​ OR "​summer"​ OR "​holiday"​So, this could match "I would like a little holiday" ​ ("​a"​ and "​holiday"​ match) 
 +   * To find documents that must include all whole words, use the "​AND"​ operator. For example, "a AND summer AND holidays"​ 
  * **Begins with word** - Given each of the word tokens, it searches for documents either containing at least one of those words exactly, or where a word starts with either of those terms. The pseudo query looks like: "​a\*"​ OR "​summer\*"​ OR "​holiday"​.  * **Begins with word** - Given each of the word tokens, it searches for documents either containing at least one of those words exactly, or where a word starts with either of those terms. The pseudo query looks like: "​a\*"​ OR "​summer\*"​ OR "​holiday"​.
 So, this could match "Where are you going on your holidays?" ​ ("​are"​ and "​holidays"​ match) So, this could match "Where are you going on your holidays?" ​ ("​are"​ and "​holidays"​ match)
 +
  * **Ends with word** - This is the inverse of Begins with Word. So, it searches for words ending in the tokens. The pseudo query looks like: "​\*a"​ OR "​\*summer"​ OR "​\*holiday"​.  * **Ends with word** - This is the inverse of Begins with Word. So, it searches for words ending in the tokens. The pseudo query looks like: "​\*a"​ OR "​\*summer"​ OR "​\*holiday"​.
-So, this could match "could you find my bra?" ("bra" ​match)+So, this could match "could you find my cake?" ("cake" ​matches) 
  * **Use wildcards** - This is a combination of Ends with and Starts with.  The pseudo query looks like: "​\*a\*"​ OR "​\*summer\*"​ OR "​\*holiday\*"​.  * **Use wildcards** - This is a combination of Ends with and Starts with.  The pseudo query looks like: "​\*a\*"​ OR "​\*summer\*"​ OR "​\*holiday\*"​.
-So, this could match "could you find my suncream for my summer holidays?"​ ("suncream", "​summer"​ and "​holidays"​ match)+So, this could match "could you find my suncream for my summer holidays?"​ ("cake", "​summer"​ and "​holidays"​ match)
  
 If you use multiple modifiers, it becomes an OR between them. Here's some notes on that behaviour: If you use multiple modifiers, it becomes an OR between them. Here's some notes on that behaviour:
  
  * Ticking "​Begins with word" and "Ends with word" is just the same as ticking "use wildcards"​  * Ticking "​Begins with word" and "Ends with word" is just the same as ticking "use wildcards"​
- * The modifiers are OR's between each one. So, "Whole word" and "​Begins with word" are probably quite useful together. ​+ * The modifiers are OR's between each one. So, "Whole word" and "​Begins with word" are probably quite useful together.