Hi,
Thanks all. Also looking forward to working with you. The coming period
you will find me working mostly on deploying Nutchgora on a fairly big
scale with a focus on crawling lots of data. We use HBase as a backend
store. The goal is to create a webpage database as big as possible. This
means that at first the most effort will be put into the HBase DataStore
and the crawling logic of Nutchgora branch of Nutch. (Indexing and all
that will be attended to later). This is pretty low level and specific
stuff, although I predict that a lot of work still needs to be done in
these areas. I understand that things like project infrastructure i.e.
the build process also is important and that there are many improvements
to be made there too, but I'll try not to meddle with this just yet.
Though this is partly because I'm still pretty inexperienced when it
comes to these components.
Hope this gives an impression of my initial efforts.
Ferdy.
On 01/12/2012 11:05 AM, Henry Saputra wrote:
Welcome Ferdy! Looking forward to working with you
- Henry
On Mon, Jan 9, 2012 at 7:01 AM, Mattmann, Chris A (388J)
<[email protected]> wrote:
Hi Folks,
A while ago I nominated Ferdy Galema for the Gora PPMC and for committership.
I'm happy
to report that he's accepted and that he was VOTEd in by members of the Gora
community.
Welcome Ferdy! Feel free to say a bit about yourself.
Cheers,
Chris
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: [email protected]
WWW: http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++