Apache Hadoop Get Together Berlin October 2010 - this time with a huge Mahout focus

2010-09-15 Thread Isabel Drost

Hello,

this is to announce the next Apache Hadoop Get Together sponsored by
JTeam (http://www.jteam.nl) that will take place in newthinking store
in Berlin.

When: October 7th, 5p.m.
Where: Newthinking store Berlin

As always there will be slots of 30min each for talks on your Hadoop
topic. After each talk there will be a lot time to discuss. You can
order drinks directly at the bar in the newthinking store. If you like,
you can order pizza. We will go to Cafe Aufsturz after the event for
some beer and something to eat.

Talks scheduled so far:

Max Heimel: "Hidden Markov Models for Apache Mahout"

Abstract: In this talk I will present and discuss an implementation of
a powerful statistical tool called Hidden Markov Models for the Apache
Mahout project. Hidden Markov models allow to mathematically deduce the
structure of an underlying - and unobservable - process based on the
structure of the produced data. Hidden Markov Models are thus
frequently applied in pattern recognition to deduce structures that are
not directly observable. Examples for applications of Hidden Markov
Models include the recognition of syllables in speech recordings,
handwritten letter recognition and part-of-speech tagging.

Sebastian Schelter: Distributed Itembased Collaborative Filtering with
Apache Mahout"

Abstract: Recommendation Mining helps users find items they like. A
very popular way to implement this is by using Collaborative Filtering.
This talk will give an introduction to an approach called Itembased
Collaborative Filtering and explain Mahout's Map/Reduce based
implementation of it.

Please do indicate on Upcoming
 or on Xing

if you are coming so we can more safely plan capacities. Updates to the
event, a brief summary and videos will be posted on
http://isabel-drost.de/hadoop

JTeam is looking for Java developers and search enthusiasts. Check out
their jobs page (http://www.jteam.nl/Jobs/Jobs.html) for more info!

As always a big Thank You goes to newthinking store for providing the
venue for free for our event.

Looking forward to seeing you in Berlin as well,
Isabel


DataDevRoom at the 2011 edition of the FOSDEM

2010-12-07 Thread Isabel Drost
Hello,

We (Olivier, Nicolas and I) are organizing a Data Analytics DevRoom
that will take place during the next edition of the FOSDEM in Brussels
on Feb. 5. Here is the CFP:

  http://datadevroom.couch.it/CFP

You might be interested in attending the event and take the
opportunity to speak about your projects. 

Important Dates (all dates in GMT +2):

Submission deadline:  2010-12-17
Notification of accepted speakers: 2010-12-20
Publication of final schedule:  2011-01-10
Meetup: 2011-02-05

The event will comprise presentations on scalable data processing. We
invite you to submit talks on the topics: Information retrieval / Search
Large Scale data processing, Machine Learning, Text Mining, Computer
vision, Linked Open Data.

High quality, technical submissions are called for, ranging from
principles to practice. We are looking for presentations on the
implementation of the systems themselves, real world applications and
case studies.

Submissions should be based on free software solutions.

Looking forward to meeting you face to face in Brussels,
Isabel


CFP - Berlin Buzzwords 2011 - Search, Score, Scale

2011-01-25 Thread Isabel Drost
This is to announce the Berlin Buzzwords 2011. The second edition of the 
successful conference on scalable and open search, data processing and data 
storage in Germany, taking place in Berlin.

Call for Presentations Berlin Buzzwords
   http://berlinbuzzwords.de
  Berlin Buzzwords 2011 - Search, Store, Scale
6/7 June 2011

The event will comprise presentations on scalable data processing. We invite 
you 
to submit talks on the topics:

   * IR / Search - Lucene, Solr, katta or comparable solutions
   * NoSQL - like CouchDB, MongoDB, Jackrabbit, HBase and others
   * Hadoop - Hadoop itself, MapReduce, Cascading or Pig and relatives
   * Closely related topics not explicitly listed above are welcome. We are
 looking for presentations on the implementation of the systems themselves,
 real world applications and case studies.

Important Dates (all dates in GMT +2)
   * Submission deadline: March 1st 2011, 23:59 MEZ
   * Notification of accepted speakers: March 22th, 2011, MEZ.
   * Publication of final schedule: April 5th, 2011.
   * Conference: June 6/7. 2011

High quality, technical submissions are called for, ranging from principles to 
practice. We are looking for real world use cases, background on the 
architecture of specific projects and a deep dive into architectures built on 
top of e.g. Hadoop clusters.

Proposals should be submitted at http://berlinbuzzwords.de/content/cfp-0 no 
later than March 1st, 2011. Acceptance notifications will be sent out soon 
after 
the submission deadline. Please include your name, bio and email, the title of 
the talk, a brief abstract in English language. Please indicate whether you 
want 
to give a lightning (10min), short (20min) or long (40min) presentation and 
indicate the level of experience with the topic your audience should have (e.g. 
whether your talk will be suitable for newbies or is targeted for experienced 
users.) If you'd like to pitch your brand new product in your talk, please let 
us know as well - there will be extra space for presenting new ideas, awesome 
products and great new projects.

The presentation format is short. We will be enforcing the schedule rigorously.

If you are interested in sponsoring the event (e.g. we would be happy to 
provide 
videos after the event, free drinks for attendees as well as an after-show 
party), please contact us.

Follow @hadoopberlin on Twitter for updates. Tickets, news on the conference, 
and the final schedule are be published at http://berlinbuzzwords.de.

Program Chairs: Isabel Drost, Jan Lehnardt, and Simon Willnauer.

Please re-distribute this CfP to people who might be interested.

If you are local and wish to meet us earlier, please note that this Thursday 
evening there will be an Apache Hadoop Get Together (videos kindly sponsored by 
Cloudera, venue kindly provided for free by Zanox) featuring talks on Apache 
Hadoop in production as well as news on current Apache Lucene developments.

Contact us at:

newthinking communications 
GmbH Schönhauser Allee 6/7 
10119 Berlin, 
Germany 

Julia Gemählich
Isabel Drost 

+49(0)30-9210 596


signature.asc
Description: This is a digitally signed message part.


September Hadoop Get Together

2009-08-24 Thread Isabel Drost
I would like to announce the September-2009 Hadoop Get Together in newthinking 
store Berlin.



   When: 29. September 2009 at 5:00pm
   Where: newthinking store, Tucholskystr. 48, Berlin, Germany



As always there will be slots of 20min each for talks on your Hadoop topic. 
After each talk there will be a lot time to discuss. You can order drinks 
directly at the bar in the newthinking store. If you like, you can order 
pizza. There are quite a few good restaurants nearby, so we can go there 
after the official part.

Talks scheduled so far:

Thorsten Schuett, Solving Puzzles with MapReduce: MapReduce is most often used 
for data mining and filtering large datasets. In this talk we will show that 
it also useful for a completely different problem domain: solving puzzles. 
Based on MapReduce, we can implement massively parallel breadth-first and 
heuristic search. MapReduce will take care of the hard problems, like 
parallelization, disk and error handling, while we can concentrate on the 
puzzle. Throughout the talk we will use the sliding puzzle 
(http://en.wikipedia.org/wiki/Sliding_puzzle) as our example.

Thilo Götz, Text analytics on jaql: Jaql (JSON query language) is a query 
language for Javascript Object Notation that runs on top of Apache Hadoop. It 
was primarily designed for large scale analysis of semi-structured data. I 
will give an introduction to jaql and describe our experiences using it for 
text analytics tasks. Jaql is open source and available from
 http://code.google.com/p/jaql.

Uwe Schindler, Lucene 2.9 Developments: Numeric Search, Per-Segment- and
Near-Real-Time Search, new TokenStream API: Uwe Schindler presents some new 
additions to Lucene 2.9. In the first half he will talk about fast numerical 
and date range queries (NumericRangeQuery, formerly TrieRangeQuery) and their 
usage in geospatial search applications like the Publishing Network for 
Geoscientific & Environmental Data (PANGAEA). In the second half of his talk, 
Uwe will highlight various improvements to the internal search implementation 
for near-real-time search. Finally, he will present the new TokenStream API, 
based on AttributeSource/Attributes that make indexing more pluggable. Future
developments in the Flexible Indexing Area will make use of it. Uwe will
show a Tokenizer that uses custom attributes to index XML files into various
document fields based on XML element names as a possible use-case.

We would like to invite you, the visitor to also tell your Hadoop story, if 
you like, you can bring slides - there will be a beamer.

A big Thanks goes to the newthinking store for providing a room in the center 
of Berlin for us.

For further information and updates, please refer to: 
http://upcoming.yahoo.com/event/4314020/


Hope to see you soon in Berlin,
Isabel

-- 
QOTD: "Gort, klaatu nikto barada."   -- The Day the Earth Stood Still 
  |\  _,,,---,,_   Web:   
  /,`.-'`'-.  ;-;;,_  
 |,4-  ) )-,_..;\ (  `'-' 
'---''(_/--'  `-'\_) (fL)  IM:  



signature.asc
Description: This is a digitally signed message part.


Apache Hadoop Get Together: Next week Tuesday, newthinking store Berlin Germany

2009-09-22 Thread Isabel Drost

This is a friendly reminder that the next Apache Hadoop Get Together
takes place next week on Tuesday, 29th of September* at newthinking
store (Tucholskystr. 48, Berlin):

http://upcoming.yahoo.com/event/4314020/

   * Thorsten Schuett, Solving Puzzles with MapReduce.
   * Thilo Götz, Text analytics on jaql.
   * Uwe Schindler, Lucene 2.9 Developments.

Big thanks goes to newthinking store for providing the venue for free
and to Cloudera for sponsoring videos of the talks. Links to the videos
will be posted on , on the upcoming page
linked above, as well as on the Cloudera Blog soon after the event.

The 7th Get Together is scheduled for December, 16th. If you would like
to submit a talk or sponsor the event, please contact me.


Hope to see you in Berlin next week,

Isabel



* The event is scheduled right before the UIMA workshop in Potsdam,
which may be of interest to you if you are a UIMA user:

http://docs.google.com/View?id=dft23bqs_3c7qnzg6x


Apache Hadoop Get Together Berlin - December 2009

2009-11-10 Thread Isabel Drost

As announced at ApacheCon US, the next Apache Hadoop Get Together Berlin is 
scheduled for December 2009.

When: Wednesday December 16, 2009  at 5:00pm 
Where: newthinking store, Tucholskystr. 48, Berlin

As always there will be slots of 20min each for talks on your Hadoop topic. 
After each talk there will be a lot time to discuss. You can order drinks 
directly at the bar in the newthinking store. If you like, you can order 
pizza. We will go to Cafe Aufsturz after the event for some beer and 
something to eat.

Talks scheduled so far:

Richard Hutton (nugg.ad): "Moving from five days to one hour." - This talk 
explains how we made data processing scalable at nugg.ad. The company's core 
business is online advertisement targeting. Our servers receive 10,000 
requests per second resulting in data of 100GB per day.

As the classical data warehouse solution reached its limit, we moved to a 
framework built on top of Hadoop to make analytics speedy,data mining 
detailed and all of our lives easier. We will give an overview of our 
solution involving file system structures, scheduling, messaging and 
programming languages from the future.

Jörg Möllenkamp (Sun): "Hadoop on Sun"
Abstract: Hadoop is a well known technology inside of Sun. This talk want to 
show some interesting use cases of Hadoop in conjunction with Sun 
technologies. The first show case wants to demonstrate how Hadoop can used to 
load massive multicore system with up to 256 threads in a single system to 
the max. The second use case shows how several mechanisms integrated in 
Solaris can ease the deployment and operation of Hadoop even in non-dedicated 
environments. The last usecase will show the combination of the Sun Grid 
Engine and Hadoop. Talk may contain command-line demonstrations ;).

Nikolaus Pohle (nurago): "M/R for MR - Online Market Research powered by 
Apache Hadoop. Enable consultants to analyze online behavior for audience 
segmentation, advertising effects and usage patterns."

We would like to invite you, the visitor to also tell your Hadoop story, if 
you like, you can bring slides - there will be a beamer.

A big Thanks goes to the newthinking store for providing a room in the center 
of Berlin for us. Another big thanks goes to StudiVZ for sponsoring videos of 
the talks. Links to the videos will be posted here as well as on the StudiVZ 
blog.

Please do indicate on the following upcoming event if you are planning to 
attend to make planning (and booking tables at Aufsturz) easier:

http://upcoming.yahoo.com/event/4842528/


Looking forward to seeing you in Berlin,
Isabel

-- 
  |\  _,,,---,,_   Web:   
  /,`.-'`'-.  ;-;;,_  
 |,4-  ) )-,_..;\ (  `'-' 
'---''(_/--'  `-'\_) (fL)  IM:  



signature.asc
Description: This is a digitally signed message part.


Third Hadoop Get Together @ Berlin

2008-11-24 Thread Isabel Drost

The third German Hadoop get together is going to take place at 9th of December 
at newthinking store in Berlin:

http://upcoming.yahoo.com/event/1383706/?ps=6

You can order drinks directly at the bar in the newthinking store. As this Get 
Together takes place in December - Christmas time - there will be cookies as 
well. There are quite a few good restaurants nearby, so we can go there after 
the official part.

Stefan Groschupf offered to prepare a talk on his project katta. We are still 
looking for one or more interesting talks. We would like to invite you, the 
visitor to tell your Hadoop story. If you like, you can bring slides - there 
will be a beamer. Please send your proposal at [EMAIL PROTECTED]

There will be slots of 20min each for talks on your Hadoop topic. After each 
talk there will be time to discuss.

A big Thanks goes to the newthinking store for again providing a room in the 
center of Berlin for us.

Looking forward to seeing you in Berlin,
Isabel Drost

-- 
QOTD: It's not an optical illusion, it just looks like one.   -- Phil White 
  |\  _,,,---,,_   Web:   <http://www.isabel-drost.de>
  /,`.-'`'-.  ;-;;,_   VoIP:
 |,4-  ) )-,_..;\ (  `'-'  Tel: (+49) 30 6920 6101
'---''(_/--'  `-'\_) (fL)  IM:  



pgpI1Jphg3Gtu.pgp
Description: PGP signature


Apache Hadoop Get Together Berlin March 2010

2010-01-31 Thread Isabel Drost
Hello,

this is to announce the next Apache Hadoop Get Together Berlin:

  When: March 10th, 5p.m. 
  Where: Newthinking store Berlin 


Talks scheduled so far:

   * Bram Smeets (JTeam/ Amsterdam): Spatial Search.
   * Dragan Milosevic (zanox/ Berlin: Product Search and Reporting powered by 
 Hadoop.
   * Bob Schulze (eCircle/ Munich): Database and Table Design Tips with HBase.


A big Thanks goes to the newthinking store for providing a room in the center 
of 
Berlin for us. Another big thanks goes to Nokia Gate 5 for sponsoring videos of 
the talks. Links to the videos will be posted after the event.


More information as well as registration is available on upcoming or xing:

 http://upcoming.yahoo.com/event/5280014
 https://www.xing.com/events/apache-hadoop-march-2010-459305


Looking forward to seeing you in Berlin,
Isabel


signature.asc
Description: This is a digitally signed message part.


Call for presentations - Berlin Buzzwords - Summer 2010

2010-03-11 Thread Isabel Drost
Call for Presentations Berlin Buzzwords
 http://buzzwordsberlin.de
  Berlin Buzzwords 2010 - Search, Store, Scale
   7/8 June 2010


This is to announce the Berlin Buzzwords 2010. The first conference on scalable 
and open search, data processing and data storage in Germany, taking place in 
Berlin.

The event will comprise presentations on scalable data processing. We invite 
you 
to submit talks on the topics:

Information retrieval / Search - Lucene, Solr, katta or comparable solutions
NoSQL - like CouchDB, MongoDB, Jackrabbit, HBase and others
Hadoop - Hadoop itself, MapReduce, Cascading or Pig and relatives

Closely related topics not explicitly listed above are welcome. We are looking 
for presentations on the implementation of the systems themselves, real world 
applications and case studies. 

Important Dates (all dates in GMT +2):

Submission deadline: April 17th 2010, 23:59
Notification of accepted speakers: May 1st, 2010. 
Publication of final schedule: May 9th, 2010. 
Conference: June 7/8. 2010.

High quality, technical submissions are called for, ranging from principles to 
practice. We are looking for real world use cases, background on the 
architecture of specific projects and a deep dive into architectures built on 
top of e.g. Hadoop clusters. 

Proposals should be submitted at http://berlinbuzzwords.de/content/cfp no later 
than April 17th, 2010. Acceptance notifications will be sent out on May 1st. 
Please include your name, bio and email, the title of the talk, a brief 
abstract 
in English language. Please indicate whether you want to give a short (30min) 
or 
long (45min) presentation and indicate the level of experience with the topic 
your audience should have (e.g. whether your talk will be suitable for newbies 
or is targeted for experienced users.)

The presentation format is short: either 30 or 45 minutes including questions. 
We will be enforcing the schedule rigorously. 

If you are interested in sponsoring the event (e.g. we would be happy to 
provide 
videos after the event, free drinks for attendees as well as an after-show 
party), please contact us. 

Follow @hadoopberlin on Twitter for updates. News on the conference will be 
published on our website at http://berlinbuzzwords.de

Program Chairs: Isabel Drost, Jan Lehnardt, and Simon Willnauer.

Schedule and further updates on the event will be published on 
http://berlinbuzzwords.de Please re-distribute this CfP to people who might be 
interested.

Contact us at: 
newthinking communications GmbH
Schönhauser Allee 6/7
10119 Berlin, Germany
Andreas Gebhard 
Isabel Drost 
+49(0)30-9210 596


signature.asc
Description: This is a digitally signed message part.


Re: Call for presentations - Berlin Buzzwords - Summer 2010

2010-03-11 Thread Isabel Drost
On 11.03.2010 Isabel Drost wrote:
> Call for Presentations Berlin Buzzwords

It should have been http://berlinbuzzwords.de of course...

Isabel


signature.asc
Description: This is a digitally signed message part.


Berlin Buzzwords - early registration extended

2010-04-08 Thread Isabel Drost

Hello,

we would like to invite everyone interested in data storage, analysis and 
search 
to join us for two days on June 7/8th in Berlin for an in-depth, technical, 
developer-focused conference located in the heart of Europe. Presentations will 
range from beginner friendly introductions on the hot data analysis topics up 
to 
in-depth technical presentations of scalable architectures.

Our intention is to bring together users and developers of data storage, 
analysis and search projects. Meet members of the development team working on 
projects you use. Get in touch with other developers you may know only from 
mailing list discussions. Exchange ideas with those using your software and get 
their feedback while having a drink in one of Berlin's many bars.

Early bird registration has been extended until April 17th - so don't wait too 
long. Tickets are available at: http://berlinbuzzwords.de/content/tickets

If you would like to submit a talk yourself: Conference submission is open for 
little more than one week. More details are available online in the call for 
presentations:

http://berlinbuzzwords.de/content/call-presentations-open

Looking forward to meeting you in the beautiful, vibrant city of Berlin this 
summer for a conference packed with high profile speakers, awesome talks and 
lots of interesting discussions.

Isabel


signature.asc
Description: This is a digitally signed message part.


Speakers and Schedule for Berlin Buzzwords 2010 - Search, Store and Scale 7th/8th 2010

2010-05-10 Thread Isabel Drost
Hi folks,

we proudly present the Berlin Buzzwords talks and presentations.
As promised there are tracks specific to the three tags search, store
and scale. We have a fantastic mixture of developers and users of open
source software projects that make scaling data processing today
possible.

There is Steve Loughran, Aaron Kimball and Stefan Groschupf from the
Apache Hadoop community. We have Grant Ingersoll, Robert Muir and the
"Generics Policeman" Uwe Schindler from the Lucene community.

For those interested in NoSQL databases there is Mathias Stearn from
MongoDB, Jan Lehnardt from CouchDB and Eric Evans, the guy who coined
the term NoSQL one year ago.

We have just published the initial version of the schedule here:

http://berlinbuzzwords.de/content/schedule-published

It seems like we are having a fantastic set of talks and speakers for
Buzzwords.

Visit us at http://berlinbuzzwords.de and register for the conference -
looking forward to seeing you in Berlin this summer!

If you like the event, please tell your friends - help spread the word
on Berlin Buzzwords.

Thanks to Jan Lehnardt, Simon Willnauer and newthinking communications
for co-organising the event.


Isabel


Berlin Buzzwords 2014: CfP is open

2014-01-23 Thread Isabel Drost-Fromm
I'm super happy to announce that the call for submissions for Berlin
Buzzwords 2013 is open. For those who don't know the conference - in
my "absolutely objective opinion" the event is the most exciting
conference on storing, processing and searching large amounts of
digital data for engineers.

The 5th edition of Berlin Buzzwords will take place on May 25-28,
2014 at Kulturbrauerei Berlin.

Berlin Buzzwords is looking for speakers who submit talks on the
following topics:

* Information Retrieval / Search i.e. Lucene, Solr, katta, ElasticSearch or
comparable solutions

* NoSQL and SQL i.e. CouchDB, MongoDB, Jackrabbit, Hbase and others

* Large Data Processing i.e. Hadoop itself, MapReduce, Cascading, Pig,
Spark and friends

Closely related topics not explicity listed above are welcome as well.

The Call for Submissions will be open until February 9! Be part of
Berlin Buzzwords and submit your session idea. Please register here:
.

Looking forward to lots of interesting proposals - and looking forward to
meeting all of you in Berlin later this year (did I mention that Berlin
rocks in summer?)


Isabel

PS: As always, any help with spreading the word is highly welcome.

PS2: One final hint - even though speakers of course get a complimentary
conference pass make sure to still check out our ticket page in
particular if you'd like to bring your children to the conference - we
do provide child day care on a donation basis but need your registration
for capacity planning: http://berlinbuzzwords.de/tickets



Call for Presentations FOSS Backstage open

2018-01-11 Thread Isabel Drost-Fromm
Hi,

As announced on Berlin Buzzwords we (that is Isabel Drost-Fromm, Stefan
Rudnitzki as well as the eventing team over at newthinking communications GmbH)
are working on a new conference in summer in Berlin. The name of this new
conference will be "FOSS Backstage". Backstage comprises all things
FOSS governance, open collaboration and how to build and manage communities
within the open source space.


Submission URL: https://foss-backstage.de/call-papers 

The event will comprise presentations on all things FOSS governance,
decentralised decision making, open collaboration. We invite you to submit talks
on the topics: FOSS project governance, collaboration, community management.
Asynchronous/ decentralised decision making.  Vendor neutrality in FOSS,
sustainable FOSS, cross team collaboration.  Dealing with poisonous people.
Project growth and hand-over. Trademarks. Strategic licensing.  While it's
primarily targeted at contributions from FOSS people, we would love to also
learn more on how typical FOSS collaboration models work well within
enterprises. Closely related topics not explicitly listed above are welcome. 

Important Dates (all dates in GMT +2)

Submission deadline: February 18th, 2018.

Conference: June, 13th/14th, 2018


High quality talks are called for, ranging from principles to practice. We are
looking for real world case studies, background on the social architecture of
specific projects and a deep dive into cross community collaboration.
Acceptance notifications will be sent out soon after the submission deadline.
Please include your name, bio and email, the title of the talk, a brief abstract
in English language.

We have drafted the submission form to allow for regular talks, each 45 min in
length. However you are free to submit your own ideas on how to support the
event: If you would like to take our attendees out to show them your favourite
bar in Berlin, please submit this offer through the CfP form.  If you are
interested in sponsoring the event (e.g. we would be happy to provide videos
after the event, free drinks for attendees as well as an after-show party),
please contact us.

Schedule and further updates on the event will be published soon on the event
web page.

Please re-distribute this CfP to people who might be interested.

 Contact us at:
 newthinking communications GmbH
 Schoenhauser Allee 6/7
 10119 Berlin, Germany
 i...@foss-backstage.de


Looking forward to meeting you all in person in summer :) I would love to see 
all those
tracks filled with lots of valuable talks on the Apache Way, on how we work,
on how the incubator works, on how being a 501(c3) influences how people get 
involved
and projects are being run, on how being a member run organisation is different,
on merit for life, on growing communities, on things gone great - and things
gone entirely wrong in the ASF's history, on how to interact with Apache
projects as a corporation and everything else you can think of.


Isabel


-- 
Sorry for any typos: Mail was typed in vim, written in mutt, via ssh (most 
likely involving some kind of mobile connection only.)