Book text with chapter line number

2013-04-23 Thread Jason Funk
Hello.

I'm trying to figure out if Solr is going to work for a new project that I am 
wanting to build. At it's heart it's a book text searching application. Each 
book is broken into chapters and each chapter is broken into lines. I want to 
be able to search these books and return relevant sections of the book and 
display the results with chapter and line number. I'm not sure how I would 
structure my data so that it's efficient and functional. I could simply treat 
each line of text as a document which would provide some of the functionality 
but what if the search query spanned two lines? Then it seems the passage the 
user was searching for wouldn't be returned. I could treat each book as a 
document and use highlighting to find the context but that seems to limit 
weighting/results for best matches as well as difficultly in finding 
chapter/line numbers. What is the best way to do this with Solr?

Is there a better tool to use to solve my problem?

Re: Book text with chapter line number

2013-04-23 Thread Jason Funk
Is there a better tool than Solr to use for my situation?


On Apr 23, 2013, at 5:04 PM, Jack Krupansky  wrote:

> There is no simple, obvious, and direct approach, right out of the box. Sure, 
> you can highlight passages of raw text, right out of the box, but that won't 
> give you chapters, pages, and line numbers. To do all of that, you would have 
> to either:
> 
> 1. Add chapter, page, and line number as part of the payload for each word. 
> And add some custom document transformers to access the information.
> or
> 2. Index each line as a separate Solr document, with fields for book, 
> chapter, page, and line number.
> 
> -- Jack Krupansky
> 
> -Original Message- From: Jason Funk
> Sent: Tuesday, April 23, 2013 5:02 PM
> To: solr-user@lucene.apache.org
> Subject: Book text with chapter line number
> 
> Hello.
> 
> I'm trying to figure out if Solr is going to work for a new project that I am 
> wanting to build. At it's heart it's a book text searching application. Each 
> book is broken into chapters and each chapter is broken into lines. I want to 
> be able to search these books and return relevant sections of the book and 
> display the results with chapter and line number. I'm not sure how I would 
> structure my data so that it's efficient and functional. I could simply treat 
> each line of text as a document which would provide some of the functionality 
> but what if the search query spanned two lines? Then it seems the passage the 
> user was searching for wouldn't be returned. I could treat each book as a 
> document and use highlighting to find the context but that seems to limit 
> weighting/results for best matches as well as difficultly in finding 
> chapter/line numbers. What is the best way to do this with Solr?
> 
> Is there a better tool to use to solve my problem?