Java Mailing List Archive

Home » nutch-user.lucene »

Is there some arbitrary limit on content stored for use by summaries?

Tim Redding


Replies: Find Java Web Hosting

Author LoginPost Reply

We have a long page that appears in the search results but the summary
never contains the search terms. Why is this?

If we move the text containing the search terms up the page they get
displayed in the summary so it's obviously related to some limit imposed
somewhere. I've looked though all the configuration options and none
appear to change anything that sounds related to this.

We use Nutch 1.0 and the the page in question is 8.7KB in size.

Any help please?


Tim Redding
Senior Java Developer
Tribal DDB
12 Bishop's Bridge Road
London W2 6AA
T: +44 (0)20 7258 4517 I F: +44 (0)20 7258 4253

Tribal DDB, a division of DDB UK Limited, Company No. 00933578, with its registered office situated at 12 Bishops Bridge Road, London W2 6AA.
This e-mail is intended only for the named person or entity to which it is addressed and contains valuable business information that is privileged, confidential and/or otherwise protected from disclosure. Dissemination, distribution or copying of this e-mail or the information herein by anyone other than the intended recipient, or an employee, or agent responsible for delivering the message to the intended recipient, is strictly prohibited. All contents are the copyright property of the sender. If you are not the intended recipient, you are nevertheless bound to respect the sender's worldwide legal rights. We require that unintended recipients delete the e-mail and destroy all electronic copies in their system, retaining no copies in any media.
This email has been scanned by the MessageLabs Email Security System.
For more information please visit
©2008 - Jax Systems, LLC, U.S.A.