Found 2,541 results
in 0.02 seconds.
Displaying page 1 of 255,
sorted by
-
Sent 2005-07-18 by Rajesh Munavalli <rajeshm@...>
At what point do I add n-grams? Does the order in which I add n-grams
affect exact phrase queries later? My questions are
(1) Should I add all the 1-grams followed by 2-grams followed by
3-grams..etc sentence by sentence OR
(2) Add all the 1 grams of entire document first before starting 2-grams
for the entire document?
What is the general accepted notion of adding n-grams of a document?
thanks,
Rajesh
http://find.searchhub.org/document/46047fcdef4c9116
-
Sent 2008-08-13 by Ryan McKinley <ryantxu@...>
I'm looking for a way to get common word groups within documents.
That is, what are the top two, three, ... n word groups within the
index.
I was messing with indexing adjacent words together (sorry about the
earlier commit)... is this a reasonable approach? Any other ideas for
pulling…
http://find.searchhub.org/document/4ef435c23afc6bb5
-
Sent 2012-05-23 by parkhekishor <kishor.parkhe@...>
Hi,
I made n-gram analyzer, but I am not able to set threshold during searching
corresponding to index.please help me.
-----
REACH YOUR GOAL BEFORE GOAL KICKS YOU.
Thanks.
--
View this message in context: http://lucene.472066.n3.nabble.com/N-Gram-Threshould-tp3985614.html
Sent from the Lucene - Java Developer mailing list archive at Nabble.com.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org
http://find.searchhub.org/document/ccbe20675a0448e0
-
Sent 2011-05-30 by Denis Kuzmenok <forward_ua@...>
I have a database with n-gram field, about 5 millions documents. QTime
is about 200-1000 ms, database is not optimized because it must reply
to queries everytime and data are updated often. Is it normal?
Solr: 3.1, java -Xms2048M -Xmx4096M
Server: i7, 12Gb
http://find.searchhub.org/document/1bd6068ae767b81f
-
Sent 2011-05-30 by Tor Henning Ueland <tor.henning@...>
2011/5/30 Denis Kuzmenok :
Start by optimizing it, it wont "stop working" due to a optimize. Some
other vital info is the size of the index, disk type used etc (SSD,
SATA, IDE..)
--
Mvh
Tor Henning Ueland
http://find.searchhub.org/document/31e46c3a6882ea2d
-
Sent 2011-05-30 by Otis Gospodnetic <otis_gospodnetic@...>
Denis,
Also, what are your documents and queries like? Maybe give a few examples so we
can help.
Otis
----
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
----- Original Message ----
http://find.searchhub.org/document/8c1fd15aae3f8d8c
-
Sent 2011-12-05 by Herman Kiefus <hermank@...>
Has anyone ever considered using a multi-valued, n-grammed, text field containing geohashes to support a 'near-some-coordinate' type of search?
I've read that geohashes will mostly have same/similar prefixes when several of them are 'close' together, but have no idea what 'mostly' comes out to. …
http://find.searchhub.org/document/6375b0e47aab5535
-
Sent 2005-07-18 by Rajesh Munavalli <rajeshm@...>
At what point do I add n-grams? Does the order in which I add n-grams
affect exact phrase queries later? My questions are
(1) Should I add all the 1-grams followed by 2-grams followed by
3-grams..etc sentence by sentence OR
(2) Add all the 1 grams of entire document first before starting 2-grams
for the entire document?
What is the general accepted notion of adding n-grams of a document?
thanks,
Rajesh
http://find.searchhub.org/document/f857d2fbe391bb
-
Sent 2005-07-18 by Andy Roberts <mail@...>
On Monday 18 Jul 2005 21:27, Rajesh Munavalli wrote:
I can't see any real advantage of storing n-grams explicitly. Just index the
document and use phrase queries. Order is significant with phrase queries if
I recall correctly, although you can use SpanNearQueries to look for
unordered ngrams, although I don't know why you would want to!
Perhaps if you explain a little more about what you are trying to achieve more
generally, we can confirm that you don't need to mess with explicit indexing
of indexing.
Andy
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
http://find.searchhub.org/document/3fb0e7426995446b
-
Sent 2005-07-18 by Rajesh Munavalli <rajeshm@...>
Intution behind adding n-grams is to boost naturally occurring larger
phrases versus using phrase queries. For example, if I am searching for
"united states of america", I want the search results to return the
documents ordered as follows
Rank 1 - Documents containing all the words occurring together
Rank 2 - Documents containing maximum number of words in the same
sentence
Rank 3 - Documents containing all the words but some might appear in the
same sentence some may not
Rank 4 - Documents containig atleast one or two words
If we have a n-gram index, most probably document talking about
http://find.searchhub.org/document/f31d7c9cefc4ea55