-
Notifications
You must be signed in to change notification settings - Fork 1
/
search-syntax.txt
27 lines (23 loc) · 1.26 KB
/
search-syntax.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
CORPUS SEARCH SYNTAX
--------------------
The search query syntax is a work in progress, but mostly follows
established text/web search conventions.
Search is based on matching the terms in the query string with the
contents of text files in the corpus. Relevance is determined by
ranking the number of search term occurrences, or 'hits', in each
file. The file with the most hits will appear at the top of the
results list. When multiple search terms are present in the query
string, the ranking is based on the term with the least number of
hits.
If multiple terms are present in the query string, there is an
implicit logical AND between each term. A file will only match the
string if it has hits for all of the terms.
By default a search term will match any occurrence of a word that
starts with the term. For example, a search for cat will match the
words cat, cats, catlike, cateye, cattle, and so on. If only exact
matches are desired, that can be accomplished by putting quotes around
the term. If quotes are placed around multiple words -- a phrase --
instead of a single word, the algorithm will match the exact
phrase. However, the current implementation is very slow for this type
of search. The gopher client will disconnect before most searches of
this type will complete.