Re: 'missing content stream' issuing expungeDeletes=true

2015-09-02 Thread Erick Erickson
bq: When we found out the document has a docfreq of 2, we did a query on the document's product id and indeed 2 documents were returned. We suspect 1 of them is deleted but not remove from the index. This is totally inconsistent with how Solr works _if_ these documents had the same value for whate

Re: 'missing content stream' issuing expungeDeletes=true

2015-09-02 Thread Derek Poh
There are around 6+ millions documents in the collection. Each document (or product record) is unqiue in the collection. When we found out the document has a docfreq of 2, we did a query on the document's product id and indeed 2 documents were returned. We suspect 1 of them is deleted but not re

Re: 'missing content stream' issuing expungeDeletes=true

2015-09-01 Thread Erick Erickson
How many document total in your corpus? And how many do you intend to have? My point is that if you are testing this with a small corpus, the results are very likely different than when you test on a reasonable corpus. So if you expect your "real" index will contain many more docs than what you're

Re: 'missing content stream' issuing expungeDeletes=true

2015-09-01 Thread Derek Poh
Erick Yes, we see documents changing their position in the list due to having deleted docs. In our searchresult,weapply higher boost (bq) to a group of matched documents to have them display at the top tier of the result. At times 1 or 2 of these documentsare not return in the top tier, they a

Re: 'missing content stream' issuing expungeDeletes=true

2015-09-01 Thread Erick Erickson
Derek: Why do you care? What evidence do you have that this matters _practically_? If you've look at scoring with a small number of documents, you'll see significant differences due to deleted documents. In most cases, as you get a larger number of documents the ranking of documents in an index w

Re: 'missing content stream' issuing expungeDeletes=true

2015-09-01 Thread Upayavira
I wonder if this resolves it [1]. It has been applied to trunk, but not to the 5.x release branch. If you needed it in 5.x, I wonder if there's a way that particular choice could be made configurable. Upayavira [1] https://issues.apache.org/jira/browse/LUCENE-6711 On Tue, Sep 1, 2015, at 02:43 A

Re: 'missing content stream' issuing expungeDeletes=true

2015-08-31 Thread Derek Poh
Hi Upayavira In fact we are using optimize currently but was advised to use expunge deletes as it is less resource intensive. So expunge deletes will only remove deleted documents, it will not merge all index segments into one? If we don't use optimize, the deleted documents in the index will

Re: 'missing content stream' issuing expungeDeletes=true

2015-08-31 Thread Upayavira
If you really must expunge deletes, use optimize. That will merge all index segments into one, and in the process will remove any deleted documents. Why do you need to expunge deleted documents anyway? It is generally done in the background for you, so you shouldn't need to worry about it. Upayav

Re: Missing content stream

2010-10-08 Thread Erick Erickson
Thanks for letting us know. Erick On Fri, Oct 8, 2010 at 2:21 PM, Dennis Brundage wrote: > > Thanks to anyone who spent time looking at this. In the future, should > anyone else run across this, I got it working by adding the "name" > attribute > to the tag. Final script looks like: > > > >

Re: Missing content stream

2010-10-08 Thread Dennis Brundage
Thanks to anyone who spent time looking at this. In the future, should anyone else run across this, I got it working by adding the "name" attribute to the tag. Final script looks like: http://localhost.:4747/solr/update"; enctype="multipart/form-data" accept-charset="UTF-8" method="POST">

Re: Missing content stream

2010-10-08 Thread Dennis Brundage
Markus Jelsma-2 wrote: > > I don't know if sending the wrong content type results in this error > message > but you must send the proper content type header, text/xml. > Thanks Markus. I thought uploading a file from an HTML form required "multipart/form-data". Anyway, I did try changing

Re: Missing content stream

2010-10-08 Thread Markus Jelsma
I don't know if sending the wrong content type results in this error message but you must send the proper content type header, text/xml. On Friday, October 08, 2010 03:57:33 pm Dennis Brundage wrote: > I am just getting started with Solr and have successfully run through the > tutorial. I now h

Re: Missing content Stream

2008-05-17 Thread Chris Hostetter
: *Here is what i get on the console,* : : May 12, 2008 9:57:20 AM org.apache.solr.core.SolrException log : : SEVERE: org.apache.solr.core.SolrException: missing content stream ...ah, ok -- when you say "on the console" you mean the console where you are running jetty and getting the log outpu

Re: Missing content Stream

2008-05-12 Thread Ricky
Hi Hoss, * 1) Posting the exact same question twice because you didn't get a reply in the first 8 hours isn't going to encourage people to reply faster. best case scenerio: you waste people's time they could be spending reading another email; worst case scnerio: you irk people and put them in a ba

Re: Missing content Stream

2008-05-10 Thread Chris Hostetter
1) Posting the exact same question twice because you didn't get a reply in the first 8 hours isn't going to encourage people to reply faster. best case scenerio: you waste people's time they could be spending reading another email; worst case scnerio: you irk people and put them in a bad mood

Re: Missing content Stream

2008-05-09 Thread Ricky
Yes, i have followed the directions on http://wiki.apache.org/solr/UpdateCSV. i am learning Solr from the mentioned webpage. Can it be a problem with CURL? /Rickey On Fri, May 9, 2008 at 10:15 AM, Ryan McKinley <[EMAIL PROTECTED]> wrote: > make sure you a

Re: Missing content Stream

2008-05-09 Thread Ryan McKinley
make sure you are following all the directions on: http://wiki.apache.org/solr/UpdateCSV in particular check "Methods of uploading CSV records" On May 9, 2008, at 9:58 AM, Ricky wrote: Hello, Am a newbie to SOLR. I am trying to learn it now. i have downloaded apache-solr 1.2.0.zip file. I hav

Re: missing content stream - simple tab file

2008-03-24 Thread tim robertson
Thanks, You are correct... ' ' around the URL solved it - schoolboy error thanks Tim On Mon, Mar 24, 2008 at 9:48 PM, Chris Hostetter <[EMAIL PROTECTED]> wrote: > > Tim: double check that solr-user mail isn't showing up in your spam > folder, you may need to whitelist it since it identifies i

Re: missing content stream - simple tab file

2008-03-24 Thread Chris Hostetter
Tim: double check that solr-user mail isn't showing up in your spam folder, you may need to whitelist it since it identifies itself as bulk mail. : And I am uploading using this command: : curl : http://localhost:8983/solr/update/csv?fieldnames=name,count&separator=%09&escape=\&header=false--d

Re: Missing Content Stream

2008-01-21 Thread Ismail Siddiqui
I am trying solrj to index.. using follwing code String url = "http://localhost:8080/solr";; SolrServer server = new CommonsHttpSolrServer( url ); its giving error that undifined symbol for constructor(string). can somoen tell me why this constructor thrwoing error while in source file i can c

Re: Missing Content Stream

2008-01-15 Thread Ismail Siddiqui
thanks brian and otis, i will definitely try solrj.. but actaually now the problem is resolved by setting content length in header i was missing it c.setRequestProperty("Content-Length", xmlText.length()+""); but now its not throwing any error but not indexing the document either.. do I have to set

Re: Missing Content Stream

2008-01-15 Thread Brian Whitman
On Jan 15, 2008, at 1:50 PM, Ismail Siddiqui wrote: Hi Everyone, I am new to solr. I am trying to index xml using http post as follows Ismail, you seem to have a few spelling mistakes in your xml string. "fiehld, nadme" etc. (a) try fixing them, (b) try solrj instead, I agree w/ otis.

Re: Missing Content Stream

2008-01-15 Thread Otis Gospodnetic
Ismail, use Solrj instead, you'll be much happier. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Ismail Siddiqui <[EMAIL PROTECTED]> To: solr-user@lucene.apache.org Sent: Tuesday, January 15, 2008 1:50:25 PM Subject: Missing Content Stream Hi