Problem - Help me with DataImport

2011-03-02 Thread Matias Alonso
Good Morning,


First, sorry for my poor english.


I trying to index “blogs” (rss) to my solr, so I´m using a dataImportHandler
for this.

I can´t index the date and I don´t no how to index static values (constant)
in a Field.

When I make a “Full Import” it doesn´t index the docs; if I delete the line
of date, it´s work.

When I debug with verbose it shows me the right information.


Below you can see my dataImportHandler:







   

   http://locademiaz.wordpress.com/feed/";


processor="XPathEntityProcessor"


transformer="DateFormatTransformer"


forEach="/rss/channel/item">


















   

   







I appreciate your help.

Thank you very much.



Matias.


Re: Problem - Help me with DataImport

2011-03-02 Thread Matias Alonso
Stefan,

Thank you very much! It works perfect...
Any idea for the other question? Someone?


Matias.



2011/3/2 Stefan Matheis 

> Matias,
>
> for indexing constant/static values .. try
> http://wiki.apache.org/solr/DataImportHandler#TemplateTransformer
>
> Regards
> Stefan
>
> On Wed, Mar 2, 2011 at 2:46 PM, Matias Alonso 
> wrote:
> > Good Morning,
> >
> >
> > First, sorry for my poor english.
> >
> >
> > I trying to index “blogs” (rss) to my solr, so I´m using a
> dataImportHandler
> > for this.
> >
> > I can´t index the date and I don´t no how to index static values
> (constant)
> > in a Field.
> >
> > When I make a “Full Import” it doesn´t index the docs; if I delete the
> line
> > of date, it´s work.
> >
> > When I debug with verbose it shows me the right information.
> >
> >
> > Below you can see my dataImportHandler:
> >
> >
> >
> > 
> >
> >
> >
> >   
> >
> >>
> >   pk="link"
> >
> >   url="
> > http://locademiaz.wordpress.com/feed/";
> >
> >
> > processor="XPathEntityProcessor"
> >
> >
> > transformer="DateFormatTransformer"
> >
> >
> > forEach="/rss/channel/item">
> >
> >
> >  dateTimeFormat="EEE,
> > d MMM  HH:mm:ss Z" />
> >
> >
> >
> >
> > 
> >
> >
> > 
> >
> >
> > 
> >
> >
> > 
> >
> >   
> >
> >   
> >
> > 
> >
> >
> >
> >
> >
> > I appreciate your help.
> >
> > Thank you very much.
> >
> >
> >
> > Matias.
> >
>


Help -DIH (mail)

2011-03-04 Thread Matias Alonso
Hi everyone!


 I’m trying to index mails into solr through DHI (based on the
“example-DIH”). For this I´m using my personal email from gmail, but I can´t
index.


Configuration in Data-config .xml:



  



  




When I debug and verbose I get the following messange (only first lines):


 

org.apache.solr.handler.dataimport.DataImportHandlerException: Connection
failed Processing Document # 1 at




I appreciate your help!

Regards,


 Matias.


Re: Help -DIH (mail)

2011-03-04 Thread Matias Alonso
Thks Gora,

I forgot "s". Now, doesn´t have error, but not index.
This´s the answer in my command line (windows  :( ).

..
..
..
protocol : imaps
host : imap.gmail.com
folders :
Recibidos,recibidos,RECIBIDOS,inbox.InBox,INBOX,Mail,MAIL,mail,CORREO,
correo,Correo
recurse : false
exclude : []
include : []
batchSize : 100
fetchSize : 32768
read timeout : 6
conection timeout : 3
custom filter :
fetch mail since : Sat Jan 01 00:00:00 GFT 2011

04/03/2011 14:05:40 org.apache.solr.core.SolrCore execute
INFO: [mail] webapp=/solr path=/select
params={command=status&qt=/dataimport} st
atus=0 QTime=0
04/03/2011 14:05:40 org.apache.solr.core.SolrCore execute
INFO: [mail] webapp=/solr path=/select
params={command=show-config&qt=/dataimpor
t} status=0 QTime=0
04/03/2011 14:05:40 org.apache.solr.handler.dataimport.MailEntityProcessor
conne
ctToMailBox
INFO: Connected to mailbox
04/03/2011 14:05:40 org.apache.solr.update.processor.LogUpdateProcessor
finish
INFO: {} 0 0
04/03/2011 14:05:40 org.apache.solr.handler.dataimport.DocBuilder execute
INFO: Time taken = 0:0:3.31
04/03/2011 14:05:48 org.apache.solr.core.SolrCore execute
INFO: [mail] webapp=/solr path=/select
params={command=status&qt=/dataimport} st
atus=0 QTime=0
04/03/2011 14:05:48 org.apache.solr.core.SolrCore execute
INFO: [mail] webapp=/solr path=/select
params={command=show-config&qt=/dataimpor
t} status=0 QTime=0

Matias.





2011/3/4 Gora Mohanty 

> On Fri, Mar 4, 2011 at 9:20 PM, Matias Alonso 
> wrote:
> > Hi everyone!
> >
> >
> >  I’m trying to index mails into solr through DHI (based on the
> > “example-DIH”). For this I´m using my personal email from gmail, but I
> can´t
> > index.
>
> Have not used the MailEntityProcessor with Gmail, but some
> points below:
>
> > Configuration in Data-config .xml:
> >
> > 
> >
> >  
> >
> > >
> >  user="m...@gmail.com"
>  ^ I presume that you have put in your actual
>email address here.
> [...]
> >  protocol="imap"/>
>  ^ Shouldn't this be imaps, at least as
> per http://wiki.apache.org/solr/MailEntityProcessor
>
> Regards,
> Gora
>


Re: Help -DIH (mail)

2011-03-04 Thread Matias Alonso
Thks Peter,

Yes, gmail gives me imaps (i understood that). So, I tried what you mention
but I had get the original mesange I posted.

Matias.




2011/3/4 Peter Sturge 

> Hi,
>
> You need to put your password in as well. You should use protocol="imap"
> unless your gmail is set for imaps (I don't believe the free gmail gives
> you
> this).
>
>  user="u...@mydomain.com"
>  password="userpwd"
>  host="imap.mydomain.com"
>  include=""
>  exclude=""
>  processor="MailEntityProcessor"
>  protocol="imap"
>   />
>
> HTH
> Peter
>
>
>
> On Fri, Mar 4, 2011 at 4:42 PM, Gora Mohanty  wrote:
>
> > On Fri, Mar 4, 2011 at 9:20 PM, Matias Alonso 
> > wrote:
> > > Hi everyone!
> > >
> > >
> > >  I’m trying to index mails into solr through DHI (based on the
> > > “example-DIH”). For this I´m using my personal email from gmail, but I
> > can´t
> > > index.
> >
> > Have not used the MailEntityProcessor with Gmail, but some
> > points below:
> >
> > > Configuration in Data-config .xml:
> > >
> > > 
> > >
> > >  
> > >
> > > > >
> > >  user="m...@gmail.com"
> > ^ I presume that you have put in your actual
> >email address here.
> > [...]
> > >  protocol="imap"/>
> >  ^ Shouldn't this be imaps, at least as
> > per http://wiki.apache.org/solr/MailEntityProcessor
> >
> > Regards,
> > Gora
> >
>


Re: Help -DIH (mail)

2011-03-04 Thread Matias Alonso

  
   
  


2011/3/4 Peter Sturge 

> Hi Matias,
>
> Can you post your data-config.xml? (with disquised names/credentials)
>
> Thanks,
> Peter
>
>
> On Fri, Mar 4, 2011 at 5:13 PM, Matias Alonso  >wrote:
>
> > Thks Peter,
> >
> > Yes, gmail gives me imaps (i understood that). So, I tried what you
> mention
> > but I had get the original mesange I posted.
> >
> > Matias.
> >
> >
> >
> >
> > 2011/3/4 Peter Sturge 
> >
> > > Hi,
> > >
> > > You need to put your password in as well. You should use
> protocol="imap"
> > > unless your gmail is set for imaps (I don't believe the free gmail
> gives
> > > you
> > > this).
> > >
> > > > >  user="u...@mydomain.com"
> > >  password="userpwd"
> > >  host="imap.mydomain.com"
> > >  include=""
> > >  exclude=""
> > >  processor="MailEntityProcessor"
> > >  protocol="imap"
> > >   />
> > >
> > > HTH
> > > Peter
> > >
> > >
> > >
> > > On Fri, Mar 4, 2011 at 4:42 PM, Gora Mohanty 
> wrote:
> > >
> > > > On Fri, Mar 4, 2011 at 9:20 PM, Matias Alonso <
> matiasgalo...@gmail.com
> > >
> > > > wrote:
> > > > > Hi everyone!
> > > > >
> > > > >
> > > > >  I’m trying to index mails into solr through DHI (based on the
> > > > > “example-DIH”). For this I´m using my personal email from gmail,
> but
> > I
> > > > can´t
> > > > > index.
> > > >
> > > > Have not used the MailEntityProcessor with Gmail, but some
> > > > points below:
> > > >
> > > > > Configuration in Data-config .xml:
> > > > >
> > > > > 
> > > > >
> > > > >  
> > > > >
> > > > > > > > >
> > > > >  user="m...@gmail.com"
> > > > ^ I presume that you have put in your actual
> > > >email address here.
> > > > [...]
> > > > >  protocol="imap"/>
> > > >  ^ Shouldn't this be imaps, at least as
> > > > per http://wiki.apache.org/solr/MailEntityProcessor
> > > >
> > > > Regards,
> > > > Gora
> > > >
> > >
> >
>


Re: Help -DIH (mail)

2011-03-04 Thread Matias Alonso
Hi Peter,

I test with deltaFetch="false", but doesn´t work :(
I'm using "DataImportHandler Development Console" to index (
http://localhost:8983/solr/mail/admin/dataimport.jsp?handler=/dataimport);
I'm working with "example-DIH".

thks...



2011/3/4 Peter Sturge 

> Hi Matias,
>
> I haven't seen it in the posts, but I may have missed it -- what is the
> import command you're sending?
> Something like:
> http://localhost:8983/solr/db/dataimport?command=full-import
>
> Can you also test it with deltaFetch="false". I seem to remember having
> some
> problems with delta in the MailEntityProcessor.
>
>
>
> On Fri, Mar 4, 2011 at 6:29 PM, Matias Alonso  >wrote:
>
> > 
> >  
> >>   user="myem...@gmail.com"
> >  password="mypassword"
> >  host="imap.gmail.com"
> >  fetchMailsSince="2011-01-01 00:00:00"
> >  deltaFetch="true"
> >  include=""
> >  exclude=""
> >  recurse="false"
> >
> >
> >
> folders="Recibidos,recibidos,RECIBIDOS,inbox.InBox,INBOX,Mail,MAIL,mail,CORREO,correo,Correo"
> >   includeContent="true"
> >  processAttachments="false"
> >  includeOtherUserFolders="false"
> >  includeSharedFolders="false"
> >  batchSize="100"
> >  processor="MailEntityProcessor"
> >  protocol="imaps" />
> >  
> > 
> >
> > 2011/3/4 Peter Sturge 
> >
> > > Hi Matias,
> > >
> > > Can you post your data-config.xml? (with disquised names/credentials)
> > >
> > > Thanks,
> > > Peter
> > >
> > >
> > > On Fri, Mar 4, 2011 at 5:13 PM, Matias Alonso  > > >wrote:
> > >
> > > > Thks Peter,
> > > >
> > > > Yes, gmail gives me imaps (i understood that). So, I tried what you
> > > mention
> > > > but I had get the original mesange I posted.
> > > >
> > > > Matias.
> > > >
> > > >
> > > >
> > > >
> > > > 2011/3/4 Peter Sturge 
> > > >
> > > > > Hi,
> > > > >
> > > > > You need to put your password in as well. You should use
> > > protocol="imap"
> > > > > unless your gmail is set for imaps (I don't believe the free gmail
> > > gives
> > > > > you
> > > > > this).
> > > > >
> > > > > > > > >  user="u...@mydomain.com"
> > > > >  password="userpwd"
> > > > >  host="imap.mydomain.com"
> > > > >  include=""
> > > > >  exclude=""
> > > > >  processor="MailEntityProcessor"
> > > > >  protocol="imap"
> > > > >   />
> > > > >
> > > > > HTH
> > > > > Peter
> > > > >
> > > > >
> > > > >
> > > > > On Fri, Mar 4, 2011 at 4:42 PM, Gora Mohanty 
> > > wrote:
> > > > >
> > > > > > On Fri, Mar 4, 2011 at 9:20 PM, Matias Alonso <
> > > matiasgalo...@gmail.com
> > > > >
> > > > > > wrote:
> > > > > > > Hi everyone!
> > > > > > >
> > > > > > >
> > > > > > >  I’m trying to index mails into solr through DHI (based on the
> > > > > > > “example-DIH”). For this I´m using my personal email from
> gmail,
> > > but
> > > > I
> > > > > > can´t
> > > > > > > index.
> > > > > >
> > > > > > Have not used the MailEntityProcessor with Gmail, but some
> > > > > > points below:
> > > > > >
> > > > > > > Configuration in Data-config .xml:
> > > > > > >
> > > > > > > 
> > > > > > >
> > > > > > >  
> > > > > > >
> > > > > > > > > > > > >
> > > > > > >  user="m...@gmail.com"
> > > > > > ^ I presume that you have put in your actual
> > > > > >email address here.
> > > > > > [...]
> > > > > > >  protocol="imap"/>
> > > > > >  ^ Shouldn't this be imaps, at least as
> > > > > > per http://wiki.apache.org/solr/MailEntityProcessor
> > > > > >
> > > > > > Regards,
> > > > > > Gora
> > > > > >
> > > > >
> > > >
> > >
> >
>


Re: Help -DIH (mail)

2011-03-04 Thread Matias Alonso
Hi Peter,

>From "DataImportHandler Development Console" I made a full-import, but
didn´t work.

Now, I execute "
http://localhost:8983/solr/mail/dataimport?command=full-import"; but nothing
happends; no index; no errors.

thks...

Matias.



2011/3/4 Peter Sturge 

> Hi Mataias,
>
>
> http://localhost:8983/solr/mail/admin/dataimport.jsp?handler=/dataimportaccesses
> the dataimport handler, but you need to tell it to do something by
> sending a command:
> http://localhost:8983/solr/mail/admin/dataimport.jsp?handler=/dataimport
> ?command=full-import<
> http://localhost:8983/solr/db/dataimport?command=full-import>
>
> If you haven't already, have a look at:
>
>
> http://www.lucidimagination.com/search/out?u=http%3A%2F%2Fwiki.apache.org%2Fsolr%2FDataImportHandler
>
> It gives very thorough and useful advice on getting the DIH working.
>
> Peter
>
>
>
> On Fri, Mar 4, 2011 at 6:59 PM, Matias Alonso  >wrote:
>
> > Hi Peter,
> >
> > I test with deltaFetch="false", but doesn´t work :(
> > I'm using "DataImportHandler Development Console" to index (
> > http://localhost:8983/solr/mail/admin/dataimport.jsp?handler=/dataimport
> );
> > I'm working with "example-DIH".
> >
> > thks...
> >
> >
> >
> > 2011/3/4 Peter Sturge 
> >
> > > Hi Matias,
> > >
> > > I haven't seen it in the posts, but I may have missed it -- what is the
> > > import command you're sending?
> > > Something like:
> > > http://localhost:8983/solr/db/dataimport?command=full-import
> > >
> > > Can you also test it with deltaFetch="false". I seem to remember having
> > > some
> > > problems with delta in the MailEntityProcessor.
> > >
> > >
> > >
> > > On Fri, Mar 4, 2011 at 6:29 PM, Matias Alonso  > > >wrote:
> > >
> > > > 
> > > >  
> > > >> > >   user="myem...@gmail.com"
> > > >  password="mypassword"
> > > >  host="imap.gmail.com"
> > > >  fetchMailsSince="2011-01-01 00:00:00"
> > > >  deltaFetch="true"
> > > >  include=""
> > > >  exclude=""
> > > >  recurse="false"
> > > >
> > > >
> > > >
> > >
> >
> folders="Recibidos,recibidos,RECIBIDOS,inbox.InBox,INBOX,Mail,MAIL,mail,CORREO,correo,Correo"
> > > >   includeContent="true"
> > > >  processAttachments="false"
> > > >  includeOtherUserFolders="false"
> > > >  includeSharedFolders="false"
> > > >  batchSize="100"
> > > >  processor="MailEntityProcessor"
> > > >  protocol="imaps" />
> > > >  
> > > > 
> > > >
> > > > 2011/3/4 Peter Sturge 
> > > >
> > > > > Hi Matias,
> > > > >
> > > > > Can you post your data-config.xml? (with disquised
> names/credentials)
> > > > >
> > > > > Thanks,
> > > > > Peter
> > > > >
> > > > >
> > > > > On Fri, Mar 4, 2011 at 5:13 PM, Matias Alonso <
> > matiasgalo...@gmail.com
> > > > > >wrote:
> > > > >
> > > > > > Thks Peter,
> > > > > >
> > > > > > Yes, gmail gives me imaps (i understood that). So, I tried what
> you
> > > > > mention
> > > > > > but I had get the original mesange I posted.
> > > > > >
> > > > > > Matias.
> > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > > 2011/3/4 Peter Sturge 
> > > > > >
> > > > > > > Hi,
> > > > > > >
> > > > > > > You need to put your password in as well. You should use
> > > > > protocol="imap"
> > > > > > > unless your gmail is set for imaps (I don't believe the free
> > gmail
> > > > > gives
> > > > > > > you
> > > > > > > this).
> > > > > > >
> > > > > > > > > > > > >  user="u...@mydomain.com"
> > > > > > &g

Re: Help -DIH (mail)

2011-03-04 Thread Matias Alonso
I´m using the trunk.

Thanks Peter for your preoccupation!

Matias.



2011/3/4 Peter Sturge 

> Hi Matias,
>
> What version of Solr are you using? Are you running any patches (maybe
> SOLR-2245)?
>
> Thanks,
> Peter
>
>
>
> On Fri, Mar 4, 2011 at 8:25 PM, Matias Alonso  >wrote:
>
> > Hi Peter,
> >
> > From "DataImportHandler Development Console" I made a full-import, but
> > didn´t work.
> >
> > Now, I execute "
> > http://localhost:8983/solr/mail/dataimport?command=full-import"; but
> > nothing
> > happends; no index; no errors.
> >
> > thks...
> >
> > Matias.
> >
> >
> >
> > 2011/3/4 Peter Sturge 
> >
> > > Hi Mataias,
> > >
> > >
> > >
> >
> http://localhost:8983/solr/mail/admin/dataimport.jsp?handler=/dataimportaccesses
> > > the dataimport handler, but you need to tell it to do something by
> > > sending a command:
> > >
> http://localhost:8983/solr/mail/admin/dataimport.jsp?handler=/dataimport
> > > ?command=full-import<
> > > http://localhost:8983/solr/db/dataimport?command=full-import>
> > >
> > > If you haven't already, have a look at:
> > >
> > >
> > >
> >
> http://www.lucidimagination.com/search/out?u=http%3A%2F%2Fwiki.apache.org%2Fsolr%2FDataImportHandler
> > >
> > > It gives very thorough and useful advice on getting the DIH working.
> > >
> > > Peter
> > >
> > >
> > >
> > > On Fri, Mar 4, 2011 at 6:59 PM, Matias Alonso  > > >wrote:
> > >
> > > > Hi Peter,
> > > >
> > > > I test with deltaFetch="false", but doesn´t work :(
> > > > I'm using "DataImportHandler Development Console" to index (
> > > >
> > http://localhost:8983/solr/mail/admin/dataimport.jsp?handler=/dataimport
> > > );
> > > > I'm working with "example-DIH".
> > > >
> > > > thks...
> > > >
> > > >
> > > >
> > > > 2011/3/4 Peter Sturge 
> > > >
> > > > > Hi Matias,
> > > > >
> > > > > I haven't seen it in the posts, but I may have missed it -- what is
> > the
> > > > > import command you're sending?
> > > > > Something like:
> > > > > http://localhost:8983/solr/db/dataimport?command=full-import
> > > > >
> > > > > Can you also test it with deltaFetch="false". I seem to remember
> > having
> > > > > some
> > > > > problems with delta in the MailEntityProcessor.
> > > > >
> > > > >
> > > > >
> > > > > On Fri, Mar 4, 2011 at 6:29 PM, Matias Alonso <
> > matiasgalo...@gmail.com
> > > > > >wrote:
> > > > >
> > > > > > 
> > > > > >  
> > > > > >> > > > >   user="myem...@gmail.com"
> > > > > >  password="mypassword"
> > > > > >  host="imap.gmail.com"
> > > > > >  fetchMailsSince="2011-01-01 00:00:00"
> > > > > >  deltaFetch="true"
> > > > > >  include=""
> > > > > >  exclude=""
> > > > > >  recurse="false"
> > > > > >
> > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> folders="Recibidos,recibidos,RECIBIDOS,inbox.InBox,INBOX,Mail,MAIL,mail,CORREO,correo,Correo"
> > > > > >   includeContent="true"
> > > > > >  processAttachments="false"
> > > > > >  includeOtherUserFolders="false"
> > > > > >  includeSharedFolders="false"
> > > > > >  batchSize="100"
> > > > > >  processor="MailEntityProcessor"
> > > > > >  protocol="imaps" />
> > > > > >  
> > > > > > 
> > > > > >
> > > > > > 2011/3/4 Peter Sturge 
> > > > > >
> > > > > > > Hi Matias,
> > > > > > >
> > > > > > > Can you post your data-config.xml? (with disquised
> > > names/credentials)
> > &

Re: Help -DIH (mail)

2011-03-09 Thread Matias Alonso
Hi Peter,

When I execute the commands you mentioned, nothing happend.
Below I show you the comands executed and the answered of they.
Sorry, but I don´t know how to enable the log; my jre is by default.
Rememeber I´m running the example-DIH (trunk\solr\example\example-DIH\solr);
java -Dsolr.solr.home="./example-DIH/solr/" -jar start.jar.



Import:
http://localhost:8983/solr/mail/dataimport?command=status<
http://localhost:8983/solr/mail/dataimport?command=full-import>


-

0
15

-

-

data-config.xml


-

full-import<http://localhost:8983/solr/mail/dataimport?command=full-import>

idle


-

This response format is experimental.  It is likely to change in the future.





Status:
http://localhost:8983/solr/mail/dataimport?command=status<
http://localhost:8983/solr/mail/dataimport?command=full-import>



-

0
0

-

-

data-config.xml


-

status<http://localhost:8983/solr/mail/dataimport?command=full-import>

idle


-

This response format is experimental.  It is likely to change in the future.






Thank you for your help.

Matias.






2011/3/4 Peter Sturge 

> Can you try this:
>
> Issue a full import command like this:
>
> http://localhost:8983/solr/dataimport?command=full-import<
> http://localhost:8983/solr/db/dataimport?command=full-import>
>
> (There is no core name here - if you're using a core name (db?), then add
> that in between solr/ and /dataimport)
>
> then, run:
> http://localhost:8983/solr/dataimport?command=status<
> http://localhost:8983/solr/db/dataimport?command=full-import>
>
> This will show the results of the previous import. Has it been rolled-back?
> If so, there might be something in the log if it's enabled (see your jre's
> lib/logging.properties file).
> (you won't see any errors unless you run the status command - that's where
> they're stored)
>
> HTH
> Peter
>
>
>
>
> On Sat, Mar 5, 2011 at 12:46 AM, Matias Alonso  >wrote:
>
> > I´m using the trunk.
> >
> > Thanks Peter for your preoccupation!
> >
> > Matias.
> >
> >
> >
> > 2011/3/4 Peter Sturge 
> >
> > > Hi Matias,
> > >
> > > What version of Solr are you using? Are you running any patches (maybe
> > > SOLR-2245)?
> > >
> > > Thanks,
> > > Peter
> > >
> > >
> > >
> > > On Fri, Mar 4, 2011 at 8:25 PM, Matias Alonso  > > >wrote:
> > >
> > > > Hi Peter,
> > > >
> > > > From "DataImportHandler Development Console" I made a full-import,
> but
> > > > didn´t work.
> > > >
> > > > Now, I execute "
> > > > http://localhost:8983/solr/mail/dataimport?command=full-import"; but
> > > > nothing
> > > > happends; no index; no errors.
> > > >
> > > > thks...
> > > >
> > > > Matias.
> > > >
> > > >
> > > >
> > > > 2011/3/4 Peter Sturge 
> > > >
> > > > > Hi Mataias,
> > > > >
> > > > >
> > > > >
> > > >
> > >
> >
> http://localhost:8983/solr/mail/admin/dataimport.jsp?handler=/dataimportaccesses
> > > > > the dataimport handler, but you need to tell it to do something by
> > > > > sending a command:
> > > > >
> > >
> http://localhost:8983/solr/mail/admin/dataimport.jsp?handler=/dataimport
> > > > > ?command=full-import<
> > > > > http://localhost:8983/solr/db/dataimport?command=full-import>
> > > > >
> > > > > If you haven't already, have a look at:
> > > > >
> > > > >
> > > > >
> > > >
> > >
> >
> http://www.lucidimagination.com/search/out?u=http%3A%2F%2Fwiki.apache.org%2Fsolr%2FDataImportHandler
> > > > >
> > > > > It gives very thorough and useful advice on getting the DIH
> working.
> > > > >
> > > > > Peter
> > > > >
> > > > >
> > > > >
> > > > > On Fri, Mar 4, 2011 at 6:59 PM, Matias Alonso <
> > matiasgalo...@gmail.com
> > > > > >wrote:
> > > > >
> > > > > > Hi Peter,
> > > > > >
> > > > > > I test with deltaFetch="false", but doesn´t work :(
> > > > > > I'm using "DataImportHandler Development Console" to index (
> > > > > >
> > > >
> > http://localhost:8983/solr/mail/admin/

Re: Help -DIH (mail)

2011-03-09 Thread Matias Alonso
1 11:52:03 org.apache.solr.search.SolrIndexSearcher warm
INFO: autowarming result for Searcher@1cee792 main

documentCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
09/03/2011 11:52:03 org.apache.solr.core.QuerySenderListener newSearcher
INFO: QuerySenderListener sending requests to Searcher@1cee792 main
09/03/2011 11:52:03 org.apache.solr.core.SolrCore execute
INFO: [mail] webapp=null path=null
params={start=0&event=newSearcher&q=solr&rows=10} hits=0 status=0 QTime=0
09/03/2011 11:52:03 org.apache.solr.core.SolrCore execute
INFO: [mail] webapp=null path=null
params={start=0&event=newSearcher&q=rocks&rows=10} hits=0 status=0 QTime=0
09/03/2011 11:52:03 org.apache.solr.core.SolrCore execute
INFO: [mail] webapp=null path=null
params={event=newSearcher&q=static+newSearcher+warming+query+from+solrconfig.xml}
hits=0 status=0 QTime=0
09/03/2011 11:52:03 org.apache.solr.core.QuerySenderListener newSearcher
INFO: QuerySenderListener done.
09/03/2011 11:52:03 org.apache.solr.core.SolrCore registerSearcher
INFO: [mail] Registered new searcher Searcher@1cee792 main
09/03/2011 11:52:03 org.apache.solr.search.SolrIndexSearcher close
INFO: Closing Searcher@9a18a0 main

fieldValueCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}

filterCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}

queryResultCache{lookups=0,hits=0,hitratio=0.00,inserts=5,evictions=0,size=5,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}

documentCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
09/03/2011 11:52:03 org.apache.solr.handler.dataimport.SolrWriter
readIndexerProperties
INFO: Read dataimport.properties
09/03/2011 11:52:03 org.apache.solr.handler.dataimport.SolrWriter persist
INFO: Wrote last indexed time to dataimport.properties
09/03/2011 11:52:03 org.apache.solr.update.processor.LogUpdateProcessor
finish
INFO: {deleteByQuery=*:*,optimize=} 0 0
09/03/2011 11:52:03 org.apache.solr.handler.dataimport.DocBuilder execute
INFO: Time taken = 0:0:2.359



09/03/2011 11:54:58 org.apache.solr.core.SolrCore execute
INFO: [mail] webapp=/solr path=/dataimport params={command=status} status=0
QTime=0



Thks,

Matias.





2011/3/9 Peter Sturge 

> Hi,
>
> You've included some output in your message, so I presume something
> *did* happen when you ran the 'status' command (but it might not be
> what you wanted to happen :-)
>
> If you run:
> http://localhost:8983/solr/mail/dataimport?command=status
>
> and you get something like this back:
> idle
> 
> 
>
> It means that no full-import or delta-import has been run during the
> life of the JVM Solr session.
>
> You should try running:
>http://localhost:8983/solr/mail/dataimport?command=full-import
>
> Then run:
>   http://localhost:8983/solr/mail/dataimport?command=status
>
> to see the status of the full-import (busy, idle, error, rolled back etc.)
>
> You can enable java logging by editing your JRE's lib/logging.properties
> file.
>
> Something like this should give you some log files:
> handlers= java.util.logging.FileHandler
> .level= INFO
> java.util.logging.FileHandler.pattern = ./logs/mylogs%d.log
> java.util.logging.FileeHandler.level = INFO
> java.util.logging.FileHandler.limit = 50
> java.util.logging.FileHandler.count = 1
> java.util.logging.FileHandler.formatter = java.util.logging.SimpleFormatter
>
> NOTE: Make sure the 'logs' folder exists (in your $cwd) before you
> start, or you'll get an error.
>
> HTH
> Peter
>
>
>
>
> On Wed, Mar 9, 2011 at 12:47 PM, Matias Alonso 
> wrote:
> > Hi Peter,
> >
> > When I execute the commands you mentioned, nothing happend.
> > Below I show you the comands executed and the answered of they.
> > Sorry, but I don´t know how to enable the log; my jre is by default.
> > Rememeber I´m running the example-DIH
> (trunk\solr\example\example-DIH\solr);
> > java -Dsolr.solr.home="./example-DIH/solr/" -jar start.jar.
> >
> >
> >
> > Import:
> > http://localhost:8983/solr/mail/dataimport?command=status<
> > http://localhost:8983/solr/mail/dataimport?command=full-import>
> >
> > 
> > -
> > 
> > 0
> > 15
> > 
> > -
&

Re: Help -DIH (mail)

2011-03-09 Thread Matias Alonso
Log:
09/03/2011 11:54:58 org.apache.solr.core.SolrCore execute
INFO: [mail] webapp=/solr path=/dataimport params={command=status} status=0
QTime=0


XML

-

0
0

-

-

data-config.xml


status
idle

-

0
0
0
2011-03-09 11:52:01
-

Indexing completed. Added/Updated: 0 documents. Deleted 0 documents.

2011-03-09 11:52:03
2011-03-09 11:52:03
0
0:0:2.359

-

This response format is experimental.  It is likely to change in the future.




Thks,

Matias.

2011/3/9 Peter Sturge 

> Hi,
>
> When you ran the status command, what was the output?
>
>
> On Wed, Mar 9, 2011 at 2:55 PM, Matias Alonso 
> wrote:
> > Peter,
> >
> > You´re right; may be I expose wrong because of my english.
> > I done everything you told me. I think that no find the "folder" when
> index.
> > What you thinking about?
> > Below I show to you part of the log.
> >
> >
> >
> > 09/03/2011 11:52:01 org.apache.solr.core.SolrCore execute
> > INFO: [mail] webapp=/solr path=/dataimport params={command=full-import}
> > status=0 QTime=0
> > 09/03/2011 11:52:01 org.apache.solr.handler.dataimport.DataImporter
> > doFullImport
> > INFO: Starting Full Import
> > 09/03/2011 11:52:01 org.apache.solr.handler.dataimport.SolrWriter
> > readIndexerProperties
> > INFO: Read dataimport.properties
> > 09/03/2011 11:52:01 org.apache.solr.update.DirectUpdateHandler2 deleteAll
> > INFO: [mail] REMOVING ALL DOCUMENTS FROM INDEX
> > 09/03/2011 11:52:01 org.apache.solr.core.SolrDeletionPolicy onInit
> > INFO: SolrDeletionPolicy.onInit: commits:num=1
> >commit{dir=D:\Search
> >
> Plugtree\trunk\solr\example\example-DIH\solr\mail\data\index,segFN=segments_c,version=1298912662799,generation=12,filenames=[segments_c]
> > 09/03/2011 11:52:01 org.apache.solr.core.SolrDeletionPolicy updateCommits
> > INFO: newest commit = 1298912662799
> > 09/03/2011 11:52:01
> org.apache.solr.handler.dataimport.MailEntityProcessor
> > logConfig
> > INFO: user : myem...@gmail.com
> > pwd : mypass
> > protocol : imaps
> > host : imap.gmail.com
> > folders :
> >
> Recibidos,recibidos,RECIBIDOS,inbox.InBox,INBOX,Mail,MAIL,mail,CORREO,correo,Correo,Mail,mail,MAIL
> > recurse : false
> > exclude : []
> > include : []
> > batchSize : 100
> > fetchSize : 32768
> > read timeout : 6
> > conection timeout : 3
> > custom filter :
> > fetch mail since : Thu Mar 03 00:00:00 GFT 2011
> >
> > 09/03/2011 11:52:03
> org.apache.solr.handler.dataimport.MailEntityProcessor
> > connectToMailBox
> > INFO: Connected to mailbox
> > 09/03/2011 11:52:03 org.apache.solr.handler.dataimport.DocBuilder finish
> > INFO: Import completed successfully
> > 09/03/2011 11:52:03 org.apache.solr.update.DirectUpdateHandler2 commit
> > INFO: start
> >
> commit(optimize=true,waitFlush=false,waitSearcher=true,expungeDeletes=false)
> > 09/03/2011 11:52:03 org.apache.solr.core.SolrDeletionPolicy onCommit
> > INFO: SolrDeletionPolicy.onCommit: commits:num=2
> >commit{dir=D:\Search
> >
> Plugtree\trunk\solr\example\example-DIH\solr\mail\data\index,segFN=segments_c,version=1298912662799,generation=12,filenames=[segments_c]
> >commit{dir=D:\Search
> >
> Plugtree\trunk\solr\example\example-DIH\solr\mail\data\index,segFN=segments_d,version=1298912662800,generation=13,filenames=[segments_d]
> > 09/03/2011 11:52:03 org.apache.solr.core.SolrDeletionPolicy updateCommits
> > INFO: newest commit = 1298912662800
> > 09/03/2011 11:52:03 org.apache.solr.search.SolrIndexSearcher 
> > INFO: Opening Searcher@1cee792 main
> > 09/03/2011 11:52:03 org.apache.solr.update.DirectUpdateHandler2 commit
> > INFO: end_commit_flush
> > 09/03/2011 11:52:03 org.apache.solr.search.SolrIndexSearcher warm
> > INFO: autowarming Searcher@1cee792 main from Searcher@9a18a0 main
> >
> >
> fieldValueCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
> > 09/03/2011 11:52:03 org.apache.solr.search.SolrIndexSearcher warm
> > INFO: autowarming result for Searcher@1cee792 main
> >
> >
> fieldValueCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
> > 09/03/2011 11:52:03 org.apache.solr.search.SolrIndexSearcher warm
> > INFO: autowarming Searcher@1cee792 main from Searcher@9a18a0 main
> >
> >
> filterCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTim

email - DIH

2011-03-21 Thread Matias Alonso
Hi,


I’m using Data Import Handler for index emails.

The problem is that nota ll the emails was indexed When I do a full import.

Someone have any idea?


Regards,

-- 
Matias.


Re: email - DIH

2011-03-22 Thread Matias Alonso
Thank you very much for your answer Erick.


My apologies for the previous email; my problem is that I don´t speak
English very well and I´m new in the world of mailing list.


The problem is that I´m indexing emails throw Data import Handler using
Gmail with imaps; I do this for search on email list in the future. The
emails are indexed partiality and I can´t found the problem of why don´t
index all of the emails.



Below I show you de configuration of my DIH.






   







The date of my emails is later to “2010-01-01 00:00:00”.




I´ve done a full import and no errors were found, but in the status I saw
that was added 28 documents, and in the console, I found 35 messanges.

Below I show you the status screen, first, and then part of the console
output.



Status:





0

1







data-config.xml





status

idle





0

28

0

2011-03-22 15:55:12



Indexing completed. Added/Updated: 28 documents. Deleted 0 documents.



2011-03-22 15:55:20

2011-03-22 15:55:20

28

0:0:8.520





This response format is experimental.  It is likely to change in the future.







…”

Mar 22, 2011 3:55:14 PM
org.apache.solr.handler.dataimport.MailEntityProcessor connectToMailBox

INFO: Connected to mailbox

Mar 22, 2011 3:55:15 PM
org.apache.solr.handler.dataimport.MailEntityProcessor$FolderIterator next

INFO: Opened folder : inbox

Mar 22, 2011 3:55:15 PM
org.apache.solr.handler.dataimport.MailEntityProcessor$FolderIterator next

INFO: Added its children to list  :

Mar 22, 2011 3:55:15 PM
org.apache.solr.handler.dataimport.MailEntityProcessor$FolderIterator next

INFO: NO children :

Mar 22, 2011 3:55:16 PM
org.apache.solr.handler.dataimport.MailEntityProcessor$MessageIterator


INFO: Total messages : 35

Mar 22, 2011 3:55:16 PM
org.apache.solr.handler.dataimport.MailEntityProcessor$MessageIterator


INFO: Search criteria applied. Batching disabled

Mar 22, 2011 3:55:19 PM org.apache.solr.handler.dataimport.DocBuilder finish

INFO: Import completed successfully

“…



Regards,

Matias.





2011/3/22 Erick Erickson 

> Not unless you provide a lot more data. Have you
> inspected the Solr logs and seen any anomalies?
>
> Please review:
> http://wiki.apache.org/solr/UsingMailingLists
>
> Best
> Erick
>
> On Mon, Mar 21, 2011 at 3:56 PM, Matias Alonso 
> wrote:
> > Hi,
> >
> >
> > I’m using Data Import Handler for index emails.
> >
> > The problem is that nota ll the emails was indexed When I do a full
> import.
> >
> > Someone have any idea?
> >
> >
> > Regards,
> >
> > --
> > Matias.
> >
>


Re: email - DIH

2011-03-23 Thread Matias Alonso
Hi Gora,

I appreciate your help.

I´ve done what you said but if omit "fetchMailsSince" "full-import" doesn´t
work.

This´s the messenge on the console ..."SEVERE: Full Import
failed:org.apache.solr.handler.dataimport.DataImportHandlerException:
Invalid value for fetchMailSince:  Processing Document # 1"...

The email I use for this was created at the begin of this month.


Regards,
Matias.



2011/3/23 Gora Mohanty 

> On Tue, Mar 22, 2011 at 9:38 PM, Matias Alonso 
> wrote:
> [...]
> > The problem is that I´m indexing emails throw Data import Handler using
> > Gmail with imaps; I do this for search on email list in the future. The
> > emails are indexed partiality and I can´t found the problem of why don´t
> > index all of the emails.
> [...]
> > I´ve done a full import and no errors were found, but in the status I saw
> > that was added 28 documents, and in the console, I found 35 messanges.
> [...]
>
> > INFO: Total messages : 35
> >
> > Mar 22, 2011 3:55:16 PM
> > org.apache.solr.handler.dataimport.MailEntityProcessor$MessageIterator
> > 
> >
> > INFO: Search criteria applied. Batching disabled
> [...]
>
> The above seems to indicate that the MailEntityProcessor does find
> all 35 messages, but indexes only 28. Are you sure that all 35 are
> since 2010-01-01 00:00:00? Could you try without fetchMailsSince?
>
> Regards,
> Gora
>



-- 
Matias.


Re: email - DIH

2011-03-23 Thread Matias Alonso
Hi Gora,

Also, all the emails were received after that date.

Regards,

Matias.



2011/3/23 Gora Mohanty 

> On Tue, Mar 22, 2011 at 9:38 PM, Matias Alonso 
> wrote:
> [...]
> > The problem is that I´m indexing emails throw Data import Handler using
> > Gmail with imaps; I do this for search on email list in the future. The
> > emails are indexed partiality and I can´t found the problem of why don´t
> > index all of the emails.
> [...]
> > I´ve done a full import and no errors were found, but in the status I saw
> > that was added 28 documents, and in the console, I found 35 messanges.
> [...]
>
> > INFO: Total messages : 35
> >
> > Mar 22, 2011 3:55:16 PM
> > org.apache.solr.handler.dataimport.MailEntityProcessor$MessageIterator
> > 
> >
> > INFO: Search criteria applied. Batching disabled
> [...]
>
> The above seems to indicate that the MailEntityProcessor does find
> all 35 messages, but indexes only 28. Are you sure that all 35 are
> since 2010-01-01 00:00:00? Could you try without fetchMailsSince?
>
> Regards,
> Gora
>



-- 
Matias.