Re: Seattle / PNW Hadoop/Lucene/HBase Meetup, Wed Sep 30th
As Bradford is out of town this evening, I will take up the mantel of Person-on-Point. Contact me with questions re: tonight's gathering. See you tonight! -Nick 614.657.0267 On Mon, Sep 28, 2009 at 4:33 PM, Bradford Stephens < bradfordsteph...@gmail.com> wrote: > Hello everyone! > Don't forget that the Meetup is THIS Wednesday! I'm looking forward to > hearing about Hive from the Facebook team ... and there might be a few > other > interesting talks as well. Here's the details in the wiki: > http://wiki.apache.org/hadoop/PNW_Hadoop_%2B_Apache_Cloud_Stack_User_Group > > Cheers, > Bradford > > On Mon, Sep 14, 2009 at 11:35 AM, Bradford Stephens < > bradfordsteph...@gmail.com> wrote: > > > Greetings, > > > > It's time for another Hadoop/Lucene/Apache"Cloud" Stack meetup! > > This month it'll be on Wednesday, the 30th, at 6:45 pm. > > > > We should have a few interesting guests this time around -- someone from > > Facebook may be stopping by to talk about Hive :) > > > > We've had great attendance in the past few months, let's keep it up! I'm > > always > > amazed by the things I learn from everyone. > > > > We're back at the University of Washington, Allen Computer Science > > Center (not Computer Engineering) > > Map: http://www.washington.edu/home/maps/?CSE > > > > Room: 303 -or- the Entry level. If there are changes, signs will be > posted. > > > > More Info: > > > > The meetup is about 2 hours (and there's usually food): we'll have two > > in-depth talks of 15-20 > > minutes each, and then several "lightning talks" of 5 minutes. If no > > one offers, We'll then have discussion and 'social time'. we'll just > > have general discussion. Let net know if you're interested in speaking > > or attending. We'd like to focus on education, so every presentation > > *needs* to ask some questions at the end. We can talk about these > > after the presentations, and I'll record what we've learned in a wiki > > and share that with the rest of us. > > > > Contact: Bradford Stephens, 904-415-3009, bradfordsteph...@gmail.com > > > > Cheers, > > Bradford > > -- > > http://www.roadtofailure.com -- The Fringes of Scalability, Social > > Media, and Computer Science > > > > > > -- > http://www.roadtofailure.com -- The Fringes of Scalability, Social Media, > and Computer Science >
Re: Seattle / PNW Hadoop/Lucene/HBase Meetup, Wed Sep 30th
Hey PNW Clouders! I'd really like to chat further with the crew doing distributed Solr. Give me a ring or shoot me an email, let's do lunch! -Nick On Wed, Sep 30, 2009 at 2:10 PM, Nick Dimiduk wrote: > As Bradford is out of town this evening, I will take up the mantel of > Person-on-Point. Contact me with questions re: tonight's gathering. > > See you tonight! > > -Nick > 614.657.0267 > > > On Mon, Sep 28, 2009 at 4:33 PM, Bradford Stephens < > bradfordsteph...@gmail.com> wrote: > >> Hello everyone! >> Don't forget that the Meetup is THIS Wednesday! I'm looking forward to >> hearing about Hive from the Facebook team ... and there might be a few >> other >> interesting talks as well. Here's the details in the wiki: >> http://wiki.apache.org/hadoop/PNW_Hadoop_%2B_Apache_Cloud_Stack_User_Group >> >> Cheers, >> Bradford >> >> On Mon, Sep 14, 2009 at 11:35 AM, Bradford Stephens < >> bradfordsteph...@gmail.com> wrote: >> >> > Greetings, >> > >> > It's time for another Hadoop/Lucene/Apache"Cloud" Stack meetup! >> > This month it'll be on Wednesday, the 30th, at 6:45 pm. >> > >> > We should have a few interesting guests this time around -- someone from >> > Facebook may be stopping by to talk about Hive :) >> > >> > We've had great attendance in the past few months, let's keep it up! I'm >> > always >> > amazed by the things I learn from everyone. >> > >> > We're back at the University of Washington, Allen Computer Science >> > Center (not Computer Engineering) >> > Map: http://www.washington.edu/home/maps/?CSE >> > >> > Room: 303 -or- the Entry level. If there are changes, signs will be >> posted. >> > >> > More Info: >> > >> > The meetup is about 2 hours (and there's usually food): we'll have two >> > in-depth talks of 15-20 >> > minutes each, and then several "lightning talks" of 5 minutes. If no >> > one offers, We'll then have discussion and 'social time'. we'll just >> > have general discussion. Let net know if you're interested in speaking >> > or attending. We'd like to focus on education, so every presentation >> > *needs* to ask some questions at the end. We can talk about these >> > after the presentations, and I'll record what we've learned in a wiki >> > and share that with the rest of us. >> > >> > Contact: Bradford Stephens, 904-415-3009, bradfordsteph...@gmail.com >> > >> > Cheers, >> > Bradford >> > -- >> > http://www.roadtofailure.com -- The Fringes of Scalability, Social >> > Media, and Computer Science >> > >> >> >> >> -- >> http://www.roadtofailure.com -- The Fringes of Scalability, Social Media, >> and Computer Science >> > >
Sharded Index Creation Magic?
Hello! I'm working with Solr-1.3.0 using a sharded index for distributed, aggregated search. I've successfully run through the example described in the DistributedSearch wiki page. I have built an index from a corpus of some 50mil documents in an HBase table and created 7 shards using the org.apache.hadoop.hbase.mapred.BuildTableIndex. I can deploy any one of these shards to a single Solr instance and happily search the index after tweaking the schema appropriately. However, when I search across all deployed shards using the &shards= query parameter ( http://host00:8080/solr/select?shards=host00:8080/solr,host01:8080/solr&q=body\%3A%3Aterm), I get a NullPointerException: java.lang.NullPointerException at org.apache.solr.handler.component.QueryComponent.mergeIds(QueryComponent.java:421) at org.apache.solr.handler.component.QueryComponent.handleResponses(QueryComponent.java:265) at org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:264) at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131) at org.apache.solr.core.SolrCore.execute(SolrCore.java:1204) at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:303) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:232) Debugging into the QueryComponent.mergeIds() method reveals the instance sreq.responses (line 356) contains one response for each shard specified, each with the number of results received by the independant queries. The problems begin down at line 370 because the SolrDocument instance has only a score field -- which proves problematic in the following line where the id is requested. The SolrDocument, only containing a score, lacks the designated ID field (from my schema) and thus the document cannot be added to the results queue. Because the example on the wiki works by loading the documents directly into Solr for indexing, I have come to the conclusion that there is some extra magic happening in this index generation process which my process lacks. Thanks for the help!
Re: Sharded Index Creation Magic?
I do, but you raise an interesting point. I had named the field incorrectly. I'm a little puzzled as to why individual search worked with the broken field name, but now all is well! On Tue, Jul 14, 2009 at 12:03 AM, Shalin Shekhar Mangar < shalinman...@gmail.com> wrote: > On Tue, Jul 14, 2009 at 2:00 AM, Nick Dimiduk wrote: > > > However, when I search across all > > deployed shards using the &shards= query parameter ( > > > > > http://host00:8080/solr/select?shards=host00:8080/solr,host01:8080/solr&q=body > > \%3A%3Aterm), > > I get a NullPointerException: > > > > java.lang.NullPointerException > >at > > > org.apache.solr.handler.component.QueryComponent.mergeIds(QueryComponent.java:421) > >at > > > org.apache.solr.handler.component.QueryComponent.handleResponses(QueryComponent.java:265) > >at > > > org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:264) > >at > > > org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131) > >at org.apache.solr.core.SolrCore.execute(SolrCore.java:1204) > >at > > > org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:303) > >at > > > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:232) > > > > Debugging into the QueryComponent.mergeIds() method reveals the instance > > sreq.responses (line 356) contains one response for each shard specified, > > each with the number of results received by the independant queries. The > > problems begin down at line 370 because the SolrDocument instance has > only > > a > > score field -- which proves problematic in the following line where the > id > > is requested. The SolrDocument, only containing a score, lacks the > > designated ID field (from my schema) and thus the document cannot be > added > > to the results queue. > > > > Because the example on the wiki works by loading the documents directly > > into > > Solr for indexing, I have come to the conclusion that there is some extra > > magic happening in this index generation process which my process lacks. > > > > > Do you have a uniqueKey defined in your schema.xml? > > -- > Regards, > Shalin Shekhar Mangar. >
Re: Seattle Hadoop/Lucene/NoSQL Meetup; Wed Feb 24th, Feat. MongoDB
Reminder: this month's Seattle Hadoop Meetup is this Wednesday. Don't forget to RSVP! On Tue, Feb 16, 2010 at 6:09 PM, Bradford Stephens < bradfordsteph...@gmail.com> wrote: > Greetings, > > It's time for another awesome Seattle Hadoop/Lucene/Scalability/NoSQL > Meetup! > > As always, it's at the University of Washington, Allen Computer > Science building, Room 303 at 6:45pm. You can find a map here: > http://www.washington.edu/home/maps/southcentral.html?cse > > Last month, we had a great talk from Steve McPherson of Razorfish on > their usage of Hadoop. This month, we'll have Richard Kreuter from > MongoDB talking about, well, MongoDB. As well as assorted discussion > on the Hadoop ecosystem. > > If you can, please RSVP here (not required, but very nice): > http://www.meetup.com/Seattle-Hadoop-HBase-NoSQL-Meetup/ > > My cell # is 904-415-3009 if you have questions/get lost. > > Cheers, > Bradford > > -- > http://www.drawntoscalehq.com -- Big Data for all. The Big Data Platform. > > http://www.roadtofailure.com -- The Fringes of Scalability, Social > Media, and Computer Science >
Re: Seattle Hadoop/Scalability/NoSQL Meetup Tonight!
Not that I'm aware of. 2010/2/25 Tim TerlegÄrd > 2010/2/25 Bradford Stephens : > > Thanks for coming, everyone! We had around 25 people. A *huge* > > success, for Seattle. And a big thanks to 10gen for sending Richard. > > > > Can't wait to see you all next month. > > Did anyone record the event? > > /Tim >