Mailing List
Home
Forum Home
Maven - Project building tool
Axis - Java SOAP implementation
Lucene - Full-featured text search engine APIs
Cocoon - MVC web framework based on XML/XSL
Fop - Create PDF, PCL, PS, SVG, XML driven by XSL formatting objects.
Log4J - A log library
POI - Java Excel, Word and other Microsoft Office files manipulating library
Oracle database error code ...
Subjects
log4j warning: No appenders could be found
java security AccessControlException: access denied (java io FilePermission clie
java lang InstantiationException: org apache tools ant Main
Apache Axis Tutorial
Subject: Struts <logic iterate >
log4j properties How to parse outpu to multiple files
configuring log4j with BEA Weblogic 8 1
How to use XSL FOP Java together
JSP precompile
Proposal: Adding jar manifest classpath in jar and war plugins
Servlet File Download dialog problem (IE6,Adobe 6 0)
java security AccessControlException: access denied (java io FilePermission
Unsupported major minor version 48 0 problem while running the an
   telope task
Subject: axis wsdl2java Ant Task usage
net sf hibernate MappingException: Error reading resource: test/User hbm xml
Building EAR ANT Script for websphere 5 0
CREATING WAR Files
Classpath problem
jsp data into Excel
Jboss 3 2 3+ vs Tomcat Axis Question
RE: How to include jars and add them into the MANIFEST MF/Class Path
attribute
Printing problem
Subject: InstantiationException
Couldn 't find trusted certificate
Please : How can one install ant 1 6 0 under Eclipse 2 1 ?
Excel: Too many different cell formats
Subject: AXIS: tomcat timeout ?
1 3 final: now giving me java io FileNotFoundException (Too many
open files)
XDoclet, Struts and Maven: Where to start? SOLUTION
Subject: Running junit tests fails
 
Lucene
Page 3 of 247 1   2   3   4   5   6   7   8   9   10   Next 10   Next 100  

Subject: blank space before special characters

Hello I have the following problem with my lucene index. When indexing fields containing special characters (like &) a blank space is inserted before the special character. For example the

Group by in Lucene ?

Solr has an issue outstanding right now that implements something that may be close to what you want. They are calling it Field Collapsing. See https //issues.apache.org/jira/browse/SOLR-236 -G

Subject: Re: Does someone know how to sort the hits list by a specified document fiel

Hi. Just add a Sort object to the search. Sort sort new Sort(sortField !ascending) Hits[] hits searcher.search(query sort) Kindly //Marcus On 11/5/07 jackxin <jackxin2100@(protected) > wrote

Subject: Pointers on Messaging Server and Lucene.

Hi Folks we have started on a project ( evaluating ) using Lucene with our Groupware services ( Messaging / Calendar and IM for now ). Are there any good pointers or links we can reference

Subject: Does someone know how to sort the hits list by a specified document field?

Does someone know how to sort the hits list by a specified document field? Even if the field is numeric or datetime etc. -- View this message in context http //www.nabble.com/Does-someone-know-how

Group by in Lucene ?

Hi. I have a situation where I 'm searching amongst some 100K feeds and only want one result per site in return. I have developed a really simple method of grouping which just scrolls through the resu

Subject: How do we limit the growth of a Lucene Index?

Hi We have been developing an enterprise logging service at the Wachovia bank. The logs (Busines application error) for all the bank related applications are consolidated at one single locatio

EdgeNGramTokenizer

http //www.shifttab.cn 8001/wiki 2007/10/31 Marco <spinmar@(protected) > > > It seems that the problem is when I add the token created by > EdgeNGramTokenizer in in the index. > If the token contains

EdgeNGramTokenizer

http //www.shifttab.cn 8001/wiki 2007/10/31 Marco <spinmar@(protected) > > > It seems that the problem is when I add the token created by > EdgeNGramTokenizer in in the index. > If the token contains

Subject: Re: RE : Re: problem undestanding the hits.score

I strongly recommend against this. Simple word counts are a poor measure of relevance. Which is why Lucene doesn 't score that way. Do you have an example showing why the default scoring is inadequate

Subject: Re: Best way to count tokens

This works and I can reuse token streams. But why TokenStream.reset() does not work which was in my earlier case. Is this a marker method in TokenStream without implementation and CachingTokenFilter i

Subject: Re: how to use Field.TermVector

On Nov 2 2007 at 4 37 AM Jamal H Tandina wrote > Hi > > Iam having problem using Field.TermVector i dont know how to use > it. does some one have an exemple or an address how to use the >

Subject: Re: problem understanding the hits.score

I found this page extremely helpful in finding out EXACTLY what Lucene is doing (and how if I wanted to to change it). Like Erik said it does pretty darn well just as it is. I 'm not sure if anyon

Subject: Re: RE : Re: problem undestanding the hits.score

That is already in the similarity formula in tf term documents that have more occurrences of a given term receive a higher score. Jamal H Tandina wrote > < < < < > > If you want to give priority to

Subject: RE : Re: problem undestanding the hits.score

< < < < If you want to give priority to documents that are larger like z1 you should change the DefaultSimilarity (at index time) more exactly the method public float lengthNorm(String fieldNa

Subject: Re: RE : Re: problem undestanding the hits.score

For your specific problem you need to change the DefaultSimilarity only at index time because the lengthNorm is written to the index when is created. So... first you 'll need to extend the DefaultSi

Subject: RE : Re: problem undestanding the hits.score

Thank you for your reply How can i change the defaultSimilarity in the indexing and the searching do you have an example or an url how to set the Similarity ? http //lucene.zones.apache.org 8080/

Subject: Re: problem undestanding the hits.score

Try too look at Similarity there you will find thinks about the scoring. Your query is more "similar " with the shorter document. If you have 2 documents with a field body first with words "red flow

Subject: how to use Field.TermVector

Hi Iam having problem using Field.TermVector i dont know how to use it. does some one have an exemple or an address how to use the termVector in indexing an searching? doc.add(new Field( "title " h

Subject: Parsing text containing forward slash and wildcard

Hi Using StandardAnalyzer when we indexed the text "/123xcv " QueryParser.parse() produced "123xcv ". During searching using the same Analyzer parsing a search text of "/123 " produced "123 " but p

Subject: Re: problem undestanding the hits.score

There are many factors that go into scoring. Erick gave a nice link that will help you out. Also check out Query.explain(). That will tell you how your score was resolved. To give you a start no

Hits.score mystery

One of many options is to copy the StandardAnalyzer but change it so that + and # are considered letters. Just add + and # to the LETTER definition in the JavaCC file if you are using a release or

Subject: Re: Best way to count tokens

reset is optional. StandardAnalyzer does not implement it. Check out CachingTokenFilter and wrap StandardAnalzyer in it. Cool Coder wrote > Currently I have extended StandardAnalyzer and counting t

Hits.score mystery

Well you might have to pre-process your strings before you give them to an analyzer. Or roll your own analyzer. What you 're asking for in effect is an analyzer "that does exactly what I want it to

Subject: Re: problem undestanding the hits.score

What leads you to expect that ordering? Scoring in Lucene is NOT simply counting the number of times a word appears. That said I really have no clue how the scoring algorithm works since it 's always

Subject: Re: Best way to count tokens

Currently I have extended StandardAnalyzer and counting tokens in the following way. But the index is not getting created though I call tokenStream.reset(). I am not sure whether reset() on token st

Subject: Re: Best way to count tokens

1 nov 2007 kl. 18.09 skrev Cool Coder > prior to adding into index Easiest way out would be to add the document to a temporary index and extract the term frequency vector. I would recommend usin

Subject: Re: Best way to count tokens

This is what I am looking for prior to adding into index. SO that it can help me to display in my site first 10 tokens that has got maximum occurences in my index. In otherword user can add weightag

Hits.score mystery

The reason seems to be that I found I needed to implement an analyser that lowercases terms as well as *not* ignoring trailing characters such as # +. (i.e. I needed to match C# and C++) public fina

Subject: Re: Question regarding proximity search

On Thursday 01 November 2007 10 45 Sonu SR wrote > I got confused of proximity search. I am getting different results for > the queries TTL "test device "~2 and TTL "device test "~2 Order is signifi
Page 3 of 247 1   2   3   4   5   6   7   8   9   10   Next 10   Next 100