Indexing HTML table into SOLR
Hi, Does one of you have some pointers (articles, papers, etc...) or experience to share about the right way for indexing the html tables content into Solr Documents? Thanks! Benjamin
"Avoiding" a schema.xml
Hi, I am interested to index some documents in Solr, as I did in Lucene. I mean: giving via solrJ all the information about the field I am adding (Tokenize, store, facet etc...) can we do that? Or is it mandatory to define a schema on the collection? Thanks a lot! Benjamin
Re: "Avoiding" a schema.xml
Thanks! Indeed, one of my issues is that I can not know about the fields to be indexed before seeing (and making some entity extraction) on the browsed documents. It is the reason I thought to avoid the schema definition ... The schema API sounds interesting! Does it exist via SolrJ? Many thanks! Benjamin On Thu, Apr 30, 2015 at 6:27 PM, Erick Erickson wrote: > Could you explain a bit more _why_ you want to do this? As you're > probably well aware, there > are multiple ways to shoot yourself in the foot in lower-level Lucene. > > If you have some situation where you're creating indexes on the fly > that may vary then > you could consider the "managed schema" that lets you create a schema > via API calls, > then you wouldn't need to mess with editing the schema.xml file for > instance. > > Best, > Erick > > On Thu, Apr 30, 2015 at 8:12 AM, Shawn Heisey wrote: > > On 4/30/2015 8:43 AM, Sznajder ForMailingList wrote: > >> I am interested to index some documents in Solr, as I did in Lucene. > >> > >> I mean: giving via solrJ all the information about the field I am adding > >> (Tokenize, store, facet etc...) > >> > >> can we do that? Or is it mandatory to define a schema on the collection? > > > > All that information is defined on the server. You do not have direct > > access to the Lucene index - Solr is intended as an abstraction, so the > > admin and the users/applications that use Solr do not need to understand > > all the low-level details that go into a Lucene application. The admin > > just has to deal with configuration files like schema.xml, and the users > > just need to know what fields are in each document and how the query > > syntax works. Deeper Lucene knowledge is helpful, but not strictly > > necessary. > > > > If you want Lucene-level control, you'll need to write the search server > > yourself using Lucene. If you have very specific needs that Solr's > > approach can't satisfy, you always have this option. > > > > The newest Solr versions do have an example of what's known as a > > "data-driven" schema, or schemaless mode. In this mode, Solr builds up > > the schema automatically, guessing the field type based on what kind of > > data is the first to arrive for each field. This is good for > > prototyping, but for production use, I would want to be in full manual > > control of the schema. > > > > Thanks, > > Shawn > > >
Creating a new collection via solrj
Hi, I would like to create programmatically a new collection with a given Schema (the schema.xml file is in my java project under a folder configuration/, for example) However, I did not find a solrj example describing these steps. If one of you could help.. thanks! Benjamin
HW requirements
Hi , Could you give me some hints wrt HW requirements for Solr if I need to index about 400 Gigas of text? Thanks Benjamin
UI Velocity
Hi I tried to use the UI Velocity from Solr. Could you please help in the following: - how do I define the fields from my schema that I would like to be displayed as facet in the UI? Thanks! Benjamin
UI Admin - and "stored=false" fields
Hi I am indexing some content under "text" field. In the schema.xml "text" field is defined as : However, when I am looking to the documents via the UI http://localhost:8983/solr/#/sec_600b/query I see the text field content in the returned documents. Do I make a mistake? Or this behavior (i.e. no-stored fields are displayed in admin ui) is expected? thanks! Benjamin.
Velocity UI and hyperlink
Hi I would like one of the fields, I display in the results of Velocity UI, to be a hyperlink. In my example, I am storing a field "url" containing the link to the online page of the indexed document and I would like to have this displayed field a hyperlink to this page. Could you please indicate me waht should I change to get that? thanks! Benjamin
Re: Velocity UI and hyperlink
Thanks!! However, each time I change a *.vm file, I do not succeed to see the change on my browser until, I delete + recreate the collectoin and re-index. Isn't there a way to immediately see the display change? Best regards On Mon, Jun 8, 2015 at 11:46 PM, Erik Hatcher wrote: > Benjamin - > > The templates for VelocityResponseWriter (/browse, etc) are under > conf/velocity. Find the template that generates the piece you want to > affect (which may be hit.vm or hit_.vm? - depends on which > version of Solr you’re using and which configuration you’ve started with to > be more precise) and modify it to render a hyperlink around > $doc.getFirstValue(“url”), maybe something like: > > http://www.lucidworks.com > > > > > > On Jun 8, 2015, at 4:29 PM, Sznajder ForMailingList < > bs4mailingl...@gmail.com> wrote: > > > > Hi > > > > I would like one of the fields, I display in the results of Velocity UI, > to > > be a hyperlink. > > > > In my example, I am storing a field "url" containing the link to the > online > > page of the indexed document and I would like to have this displayed > field > > a hyperlink to this page. > > > > Could you please indicate me waht should I change to get that? > > > > thanks! > > > > Benjamin > >
Re: Velocity UI and hyperlink
Hi I am using 5.1 Currently, I defined a directory solr-conf/ . Under this directory, I have a velocity directory containing my different *.vm files. When I create a collection, I am creating via bin\solr create -c COLL_NAME -d PATH_TO_SOLR_CONF Your indication was helpful : changing the file copied under solr\server was the right way! thanks again! Ben On Tue, Jun 9, 2015 at 12:25 AM, Erik Hatcher wrote: > What version of Solr? And where is the file you’re changing? > > With Solr 5.2, one example of what you’re trying to do is under > example/files. In the README we have this: > > bin/solr start > -Dvelocity.template.base.dir=/example/files/conf/velocity/ > > When you create a collection it clones the configuration (in 5x; under > server/solr/…) so if you wanted to in-place edit you’d edit those files > rather than the original configuration which would require a collection > re-create. > > With the above command-line, you can have templates anywhere you like and > edit them in place, and they override any in the configuration of the Solr > collection. > > See > https://cwiki.apache.org/confluence/display/solr/Response+Writers#ResponseWriters-VelocityResponseWriter > for perhaps some more details. If there’s any way I can make this easier, > let me know. > > If the above info doesn’t work or apply because you’re on a different > version of Solr, provide more details and I’ll help from there. > > — > Erik Hatcher, Senior Solutions Architect > http://www.lucidworks.com > > > > > > On Jun 8, 2015, at 5:07 PM, Sznajder ForMailingList < > bs4mailingl...@gmail.com> wrote: > > > > Thanks!! > > > > However, each time I change a *.vm file, I do not succeed to see the > change > > on my browser until, I delete + recreate the collectoin and re-index. > > > > Isn't there a way to immediately see the display change? > > > > Best regards > > > > On Mon, Jun 8, 2015 at 11:46 PM, Erik Hatcher > > wrote: > > > >> Benjamin - > >> > >> The templates for VelocityResponseWriter (/browse, etc) are under > >> conf/velocity. Find the template that generates the piece you want to > >> affect (which may be hit.vm or hit_.vm? - depends on which > >> version of Solr you’re using and which configuration you’ve started > with to > >> be more precise) and modify it to render a hyperlink around > >> $doc.getFirstValue(“url”), maybe something like: > >> > >> http://www.lucidworks.com > >> > >> > >> > >> > >>> On Jun 8, 2015, at 4:29 PM, Sznajder ForMailingList < > >> bs4mailingl...@gmail.com> wrote: > >>> > >>> Hi > >>> > >>> I would like one of the fields, I display in the results of Velocity > UI, > >> to > >>> be a hyperlink. > >>> > >>> In my example, I am storing a field "url" containing the link to the > >> online > >>> page of the indexed document and I would like to have this displayed > >> field > >>> a hyperlink to this page. > >>> > >>> Could you please indicate me waht should I change to get that? > >>> > >>> thanks! > >>> > >>> Benjamin > >> > >> > >
Re: Velocity UI and hyperlink
Hi Erik When running solr in simple mode on my laptop, I found the *vm files under under server/solr/COLLECTION_NAME/conf however, when running on my server in cloud mode (with only one node), I do not find these conf/ directory under server. Does it sit on another place? thanks! On Tue, Jun 9, 2015 at 3:34 AM, Erik Hatcher wrote: > Do note that changing the file copied under solr/server is risky, as you > may delete and recreate the collection and lose your changes. If you use > the system property trick mentioned below, you can develop without having > to recreate the collection but once you do it’ll incorporate the changes. > > — > Erik Hatcher, Senior Solutions Architect > http://www.lucidworks.com <http://www.lucidworks.com/> > > > > > > On Jun 8, 2015, at 5:37 PM, Sznajder ForMailingList < > bs4mailingl...@gmail.com> wrote: > > > > Hi > > > > I am using 5.1 > > > > Currently, I defined a directory solr-conf/ . > > Under this directory, I have a velocity directory containing my different > > *.vm files. > > > > When I create a collection, I am creating via > > bin\solr create -c COLL_NAME -d PATH_TO_SOLR_CONF > > > > Your indication was helpful : changing the file copied under solr\server > > was the right way! > > > > thanks again! > > > > Ben > > > > On Tue, Jun 9, 2015 at 12:25 AM, Erik Hatcher > > wrote: > > > >> What version of Solr? And where is the file you’re changing? > >> > >> With Solr 5.2, one example of what you’re trying to do is under > >> example/files. In the README we have this: > >> > >>bin/solr start > >> > -Dvelocity.template.base.dir=/example/files/conf/velocity/ > >> > >> When you create a collection it clones the configuration (in 5x; under > >> server/solr/…) so if you wanted to in-place edit you’d edit those files > >> rather than the original configuration which would require a collection > >> re-create. > >> > >> With the above command-line, you can have templates anywhere you like > and > >> edit them in place, and they override any in the configuration of the > Solr > >> collection. > >> > >> See > >> > https://cwiki.apache.org/confluence/display/solr/Response+Writers#ResponseWriters-VelocityResponseWriter > >> for perhaps some more details. If there’s any way I can make this > easier, > >> let me know. > >> > >> If the above info doesn’t work or apply because you’re on a different > >> version of Solr, provide more details and I’ll help from there. > >> > >> — > >> Erik Hatcher, Senior Solutions Architect > >> http://www.lucidworks.com > >> > >> > >> > >> > >>> On Jun 8, 2015, at 5:07 PM, Sznajder ForMailingList < > >> bs4mailingl...@gmail.com> wrote: > >>> > >>> Thanks!! > >>> > >>> However, each time I change a *.vm file, I do not succeed to see the > >> change > >>> on my browser until, I delete + recreate the collectoin and re-index. > >>> > >>> Isn't there a way to immediately see the display change? > >>> > >>> Best regards > >>> > >>> On Mon, Jun 8, 2015 at 11:46 PM, Erik Hatcher > >>> wrote: > >>> > >>>> Benjamin - > >>>> > >>>> The templates for VelocityResponseWriter (/browse, etc) are under > >>>> conf/velocity. Find the template that generates the piece you want to > >>>> affect (which may be hit.vm or hit_.vm? - depends on which > >>>> version of Solr you’re using and which configuration you’ve started > >> with to > >>>> be more precise) and modify it to render a hyperlink around > >>>> $doc.getFirstValue(“url”), maybe something like: > >>>> > >>>> http://www.lucidworks.com > >>>> > >>>> > >>>> > >>>> > >>>>> On Jun 8, 2015, at 4:29 PM, Sznajder ForMailingList < > >>>> bs4mailingl...@gmail.com> wrote: > >>>>> > >>>>> Hi > >>>>> > >>>>> I would like one of the fields, I display in the results of Velocity > >> UI, > >>>> to > >>>>> be a hyperlink. > >>>>> > >>>>> In my example, I am storing a field "url" containing the link to the > >>>> online > >>>>> page of the indexed document and I would like to have this displayed > >>>> field > >>>>> a hyperlink to this page. > >>>>> > >>>>> Could you please indicate me waht should I change to get that? > >>>>> > >>>>> thanks! > >>>>> > >>>>> Benjamin > >>>> > >>>> > >> > >> > >
Highlight in Velocity UI on Google Chrome
Hi, I was testing the highlight feature and played with the techproducts example. It appears that the highlighting works on Mozilla Firefox, but not on Google Chrome. For your information Benjamin
"More" facets or "Less" facets in the Velocity UI
Hi Is there a way to add a link "More" (or "Less") under the list of facet's values, in aim to display more facets values? Thanks! Benjamin