Hello
I'm indexing xml files with xpathEntityProcessor, and for some hundreads
documents on 12 millions are not processed.
When I tried to index only one of the KO documents it doesn't either index.
So it's not a matter of big number of documents.
We tried to do the xslt transformation external
Hello,
I have a solr index (12 M docs, 45Go) with facets, and I'm trying to
improve facet queries performances.
1/ I tried to use docvalue on facet fields, it didn't work well
2/ I tried facet.threads=-1 in my querie, and worked perfectely (from more
15s to 2s for longest queries)
3/ I'm tryi
tp://tokee.github.io/lucene-solr/
Right now we use solr 4.6, and we soon deliver our relsease, and I'm
afraid I won't have time to try this time, but I can try for next release
(next month I think).
Thanks very much again
Jerome
Dupont
jerome.dupont_at#bnf.fr
Participez à l'acquisition d'un Trésor national - Le manuscrit royal de
François I er Avant d'imprimer, pensez à l'environnement.
Hello,
I'm trying to index documents with Data import handler and solrcloud at the
same time. (huge collection, need to make parallel indexing)
First I had a dih configuration whichs works with solr standalone.
(Indexing for two month every week)
I've transformed my configuration to "cloudify"
Hello again
Finally, I found the problem.
It seems that
_ The indexation request was done with an http GET and not with POST,
because I was lauching it from a favorite in my navigator.
Launching indexation on my documents by the admin interface made indexation
work.
_ Antoher problem was that som
Hello again,
I still trying to index a with solr cloud and dih. I can index but it seems
that indexation is done on only 1 shard. (my goal was to parallelze that to
go fast)
This my conf:
I have 2 tomcat instances,
One with zookeeper embedded in solr 4.4.0 started and 1 shard (port 8080)
The othe
It works
I've done what you said:
_ In my request to get list of documents, I add a where clause filtering on
the select getting the documents to index:
where noticebib.numnoticebib LIKE '%${dataimporter.request.suffixeNotice}'"
_ And I called my dih on each shard with the parameter suffixeNotice
Hello,
I have a question, I index documents and a small part them are skipped, (I
am in onError="skip" mode)
I'm trying to get a list of them, in order to analyse what's worng with
these documents
Is there a mean to get the list of skipped documents, and some more
information (my onError="skip" i
Hello,
We are trying to use data import handler and particularly on a collection
which contains many file (one xml per document)
Our configuration works for a small amount of files, but dataimport fails
with OutofMemory Error when running it on 10M files (in several
directories...)
This is it
The configuraiton works with LineEntityProcessor, with few documents (havn
(t test with many documents yet.
For information this the config
... fields de
Hello,
I want to use a index a huge list of xml file.
_ Using FileListEntityProcessor causes an OutOfMemoryException (too many
files...)
_ I can do it using a LineEntityProcessor reading a list of files,
generated externally, but I would prefer to generate the list in SOLR
_ So to avoid to mantai
Hi,
Thanks for your anwser, it made me go ahead.
The name of the entity was not good, not consistent with schema
Now the first entity works fine: the query is done to the database and
returns the good result.
The problem is that the second entity, which is a XPathEntityProcessor
entity, doesn't r
Thanks very much, it works, with dataSource (capital S) !!!
Finally, I didn't have to define a "CHEMINRELATIF" field in the
configuration, it's working without it.
This is the definive working configuration:
Hello,
I use solr and dataimport to index xml files with a dtd.
The dtd is referenced like this
Previously we were using solr4 in a tomcat container.
During the import process, solr tries to validate the xml file with the
dtd.
To find it we were defining -Duser.dir=pathToDtD and solr could find
14 matches
Mail list logo