Skip to main content.
home | support | download

Back to List Archive

Question on searching for double quote chars...

From: David Wood <dwood(at)not-real.inter.nl.net>
Date: Wed Oct 15 2003 - 13:48:39 GMT
Using SWISH-E 2.4.0-pr4 on HP-UX 11, with the following config file:

IndexDir /var/opt/web/rrc-web/htdocs/rrc/performance/html_src/cbo/products
IndexFile /var/opt/web/rrc-web/swish-e.v2/cbo_products.index
IndexName "Index file of HPIS-sourced content."
IndexReport 3
FileRules pathname contains /CVS
FollowSymLinks yes
ReplaceRules remove "/var/opt/web/rrc-web/htdocs"
IndexOnly .htm .html
MetaNames keywords
MinWordLimit 2
MaxWordLimit 30
WordCharacters abcdefghijklmnopqrstuvwxyz0123456789_\|/-+?!@$%^'"`~.[]{}()


this search:

swish-e -w 'vf17' -f cbo_products.index

gives the following result:

# SWISH format: 2.4.0-pr4
# Search words: vf17
# Removed stopwords:
# Number of hits: 1
# Search time: 0.000 seconds
# Run time: 0.060 seconds
1000 
/rrc/performance/html_src/cbo/products/7DD2278DE65B38AD85256D8E006EB286/7DD2278DE65B38AD85256D8E006EB286_1.html 
"hp pavilion vf17 17" LCD flat panel display" 16196
.


But if I try to search on:

swish-e -w '17"' -f cbo_products.index

even though the string '17"' is in the title, I get:

# SWISH format: 2.4.0-pr4
# Search words: 17"
# Removed stopwords:
err: Syntax error in query (missing end quote or unbalanced parenthesis?)
.


and if I try to search on:

swish-e -w '17\"' -f cbo_products.index

I get:

# SWISH format: 2.4.0-pr4
# Search words: 17\"
# Removed stopwords:
err: No search words specified
.


Do these results make sense?  The double quote char is listed in 
WordCharacters, so shouldn't one or both of these search strings return a 
result?


Thanks for any assistance,

David Wood













*********************************************************************
Due to deletion of content types excluded from this list by policy,
this multipart message was reduced to a single part, and from there
to a plain text message.
*********************************************************************
Received on Wed Oct 15 13:48:45 2003