get the position of matched word in the response

2019-08-04 Thread eli chen
hi i'm new to solr so please be patient.
how can i get the position of matched word in the results.

and no, im not talking about highlighting the words. i talkng about getting
the postition of the word in the content

i have field content which i do in q=content:"some_word"

the content field is not stored but its
 Indexed +Tokenized+ Multivalued+ TermVector Stored +Store Offset With
TermVector +Store Position With TermVector

thx for the help


Re: get the position of matched word in the response

2019-08-04 Thread eli chen
every content field is actually a book content
so let say someone search for the word "hello" and i found this word in the
book "the story jungle" at position 199 (step by word not char)

now i can look at my database and check the OCR of this word in this book
(and show highlight on the picture and etc)

my db is kinda of (just for simplicity)

book word ocr
-- --- -
th 1991,1,1,1

that the reason i need the offest of the word.

and btw the content field is just a big text_general field

thx again

‫בתאריך יום א׳, 4 באוג׳ 2019 ב-14:30 מאת ‪Erick Erickson‬‏ <‪
erickerick...@gmail.com‬‏>:‬

> Eli:
>
> What problem are you trying to solve? There’s no really convenient way to
> do this that know of, although it could be done, probably with some
> lucene-level code.
>
> This may be an XY problem, where you're asking how to do X (find the
> position of the matched word) because you think it’ll help solve some
> problem Y. What’s “Y”? Perhaps there’s an easier way to solve that problem
> if we knew what it was….
>
> Best,
> Erick
>
> > On Aug 4, 2019, at 6:55 AM, eli chen  wrote:
> >
> > hi i'm new to solr so please be patient.
> > how can i get the position of matched word in the results.
> >
> > and no, im not talking about highlighting the words. i talkng about
> getting
> > the postition of the word in the content
> >
> > i have field content which i do in q=content:"some_word"
> >
> > the content field is not stored but its
> > Indexed +Tokenized+ Multivalued+ TermVector Stored +Store Offset With
> > TermVector +Store Position With TermVector
> >
> > thx for the help
>
>


Re: get the position of matched word in the response

2019-08-04 Thread eli chen
thx
of course they search for pharses.
and if they searched "hello monkey" and solr found "hello my monkey".  i
want to get the position of "hello" and "monkey" (they words he actually
typed in the search).

and btw thx you all but i found
https://github.com/dbmdz/solr-ocrhighlighting which i think can help me a
lot. and i'll check the payload thing (im new to solr)



‫בתאריך יום א׳, 4 באוג׳ 2019 ב-15:40 מאת ‪Alexandre Rafalovitch‬‏ <‪
arafa...@gmail.com‬‏>:‬

> What happens if they search for "hello monkey" and match against
> "hello my monkeys"? What should it return? Why does your database not
> contain "hello" instead of 199?
>
> I am saying because if your clients are truly searching for just one
> word, then Solr may be an overkill for you. Perhaps you are looking
> for just "indexOf" within a string with parallel offset->OCR data
> structure. So, there is a hidden question in there of "why do you
> choose Solr".
>
> Then, there is a point that Solr searches words/numbers/geo-spacial
> but returns documents. So, sometimes, you need to understand what is a
> "document" for your business case. And transform your content for
> that. E.g. if you are really just searching for one word, then maybe
> you index your whole book as a bunch of document each containing a
> word, its OCR offset information, its book id. And if it is a couple
> of words, maybe you have a secondary field with context of that
> sentence (in index-only) form.
>
> Don't be afraid to abandon your first schema. Your business
> requirement is different enough.
>
> Regards,
>Alex.
>
>
> On Sun, 4 Aug 2019 at 07:46, eli chen  wrote:
> >
> > every content field is actually a book content
> > so let say someone search for the word "hello" and i found this word in
> the
> > book "the story jungle" at position 199 (step by word not char)
> >
> > now i can look at my database and check the OCR of this word in this book
> > (and show highlight on the picture and etc)
> >
> > my db is kinda of (just for simplicity)
> >
> > book word ocr
> > -- --- -
> > th 1991,1,1,1
> >
> > that the reason i need the offest of the word.
> >
> > and btw the content field is just a big text_general field
> >
> > thx again
> >
> > ‫בתאריך יום א׳, 4 באוג׳ 2019 ב-14:30 מאת ‪Erick Erickson‬‏ <‪
> > erickerick...@gmail.com‬‏>:‬
> >
> > > Eli:
> > >
> > > What problem are you trying to solve? There’s no really convenient way
> to
> > > do this that know of, although it could be done, probably with some
> > > lucene-level code.
> > >
> > > This may be an XY problem, where you're asking how to do X (find the
> > > position of the matched word) because you think it’ll help solve some
> > > problem Y. What’s “Y”? Perhaps there’s an easier way to solve that
> problem
> > > if we knew what it was….
> > >
> > > Best,
> > > Erick
> > >
> > > > On Aug 4, 2019, at 6:55 AM, eli chen  wrote:
> > > >
> > > > hi i'm new to solr so please be patient.
> > > > how can i get the position of matched word in the results.
> > > >
> > > > and no, im not talking about highlighting the words. i talkng about
> > > getting
> > > > the postition of the word in the content
> > > >
> > > > i have field content which i do in q=content:"some_word"
> > > >
> > > > the content field is not stored but its
> > > > Indexed +Tokenized+ Multivalued+ TermVector Stored +Store Offset With
> > > > TermVector +Store Position With TermVector
> > > >
> > > > thx for the help
> > >
> > >
>


hi question about solr

2019-12-02 Thread eli chen
hi im kind of new to solr so please be patient

i'll try to explain what do i need and what im trying to do.

we a have a lot of books content and we want to index them and allow search
in the books.
when someone search for a term
i need to get back the position of matchen word in the book
for example
if the book content is "hello my name is jeff" and someone search for "my".
i want to get back the position of my in the content field (which is 1 in
this case)
i tried to do that with payloads but no success. and another problem i
encourage is .
lets say the content field is "hello my name is jeff what is your name".
now if someone search for "name" i want to get back the index of all
occurrences not just the first one

is there any way to that with solr without develop new plugins

thx


Re: hi question about solr

2019-12-02 Thread eli chen
yes

On Mon, 2 Dec 2019 at 13:29, Bernd Fehling 
wrote:

> In short,
>
> you are trying to use an indexer as a full-text search engine, right?
>
> Regards
> Bernd
>
> Am 02.12.19 um 12:24 schrieb eli chen:
> > hi im kind of new to solr so please be patient
> >
> > i'll try to explain what do i need and what im trying to do.
> >
> > we a have a lot of books content and we want to index them and allow
> search
> > in the books.
> > when someone search for a term
> > i need to get back the position of matchen word in the book
> > for example
> > if the book content is "hello my name is jeff" and someone search for
> "my".
> > i want to get back the position of my in the content field (which is 1 in
> > this case)
> > i tried to do that with payloads but no success. and another problem i
> > encourage is .
> > lets say the content field is "hello my name is jeff what is your name".
> > now if someone search for "name" i want to get back the index of all
> > occurrences not just the first one
> >
> > is there any way to that with solr without develop new plugins
> >
> > thx
> >
>


Re: hi question about solr

2019-12-02 Thread eli chen
first of all thank you very much. i was looking for good resource to read
on solr.

i actually already tried the term vector. but for it to work i had to set
the fl=content which response with the value of content field (which really
really big)