Luciano, Now I can properly volunteer myself as a mentor.
Thank you William Em seg, 20 de ago de 2018 às 19:37, William Colen <co...@apache.org> escreveu: > Luciano, I am very happy to see this incubation proposal here. I am > following this project for a while. Knowing the reputation on the > committees involved and B2W I know it can be very successful. > I am waiting for the board aproval for joining IPMC. Once approved I would > happly volunteer myself as a mentor. > > Em seg, 20 de ago de 2018 às 17:45, Luciano Resende <luckbr1...@gmail.com> > escreveu: > >> We are still looking for one more mentor before we put this proposal to a >> vote, any volunteers? >> >> On Wed, Aug 15, 2018 at 12:13 PM Luciano Resende <luckbr1...@gmail.com> >> wrote: >> >> > We would like to start a discussion on accepting Marvin-AI as an Apache >> > Incubator project. >> > >> > The proposal is available at the incubator wiki, and also copied below: >> > https://wiki.apache.org/incubator/Marvin-AI >> > >> > As part of the initial due diligence, we have done a preliminary name >> > search and the results are available on the JIRA below: >> > >> > https://issues.apache.org/jira/browse/PODLINGNAMESEARCH-144 >> > >> > We are also looking for two additional mentors. >> > >> > >> > Thanks in advance for your time reviewing and providing feedback. >> > >> > === >> > >> > = Marvin-AI = >> > >> > == Abstract == >> > >> > Marvin-AI is an open-source artificial intelligence (AI) platform that >> > helps data scientists, prototype and productionalize complex solutions >> with >> > a scalable, low-latency, language-agnostic, and standardized >> architecture >> > while simplifies the process of exploration and modeling. >> > >> > == Proposal == >> > >> > Marvin helps non-experienced developers create industry-grade AI >> > applications. It has three core components: a development environment >> to >> > be used during data exploration and hypothesis validation (Toolbox), a >> > library which should be extended to create Marvin engines, and a Scala >> > application server which interprets engines (Engine Executor). >> > A basic premise of Marvin is that it should be language-agnostic, able >> to >> > interpret engines implemented in different programming languages. >> > >> > == Background == >> > >> > The Marvin AI project was initiated as an internal project at B2W >> Digital >> > (Brazil), the largest e-commerce company in Latin America. Nowadays, it >> is >> > used by all data scientists within the B2W team. Oftentimes, data >> > scientists don't have an extensive background in software engineering, >> yet >> > are in charge of creating AI applications that need to scale to high >> > throughput and provide millisecond-level response times. At B2W, Marvin >> AI >> > plays an important role in this process, abstracting advanced software >> > engineering procedures, allowing data scientists to focus on their >> > knowledge domain. >> > >> > == Rationale == >> > >> > With recent advances in computer architecture and a corresponding >> increase >> > in the amount of data generated by always-connected devices, AI >> algorithms >> > offer a solution to problems that have long troubled modern >> corporations. >> > Since AI developers come from various fields, such as statistics, >> physics, >> > and math, there exists a strong need for platforms which enable them to >> > move from prototypes to enterprise applications. Although some tools >> claim >> > to offer this service, in reality, there is no reliable open-source >> > solution. >> > >> > == Initial Goals == >> > >> > The initial goals will most likely be to merge the existing codebase >> into >> > a single repository, migrate it to Apache, and then integrate with the >> > Apache development process. Furthermore, we plan for incremental >> > development and releases, as per Apache guidelines. >> > >> > == Current Status == >> > >> > === Meritocracy === >> > >> > Marvin already works under principles of meritocracy. Today, Marvin >> > already has some contributors that are part of other institutions. >> Although >> > there is no formal process defined to become a committer, contributors >> that >> > make major changes/improvements to the platform are naturally granted >> write >> > access to the repository. >> > >> > >> > === Community === >> > >> > Acceptance into the Apache foundation would substantially boost both >> > Marvin's user and developer communities. The current community includes >> a >> > few experienced developers that have either academic or professional >> > experience with AI. The community is largely comprised of data >> scientists >> > working at B2W and other companies such as Cloudera, MIT, Qume Labs, >> > Laguro.com, and CBYK. Also, there is a meetup group of hundreds of >> users >> > who meet regularly to exchange ideas about Marvin and, more generally, >> AI. >> > >> > Reference to the group: https://www.meetup.com/marvin-ai/members/ >> > >> > === Core Developers === >> > >> > The core developers for Marvin are listed in the contributor's list and >> > initial PPMC below. These lists include B2W employees, MIT students, >> UFSCAR >> > researchers, independent contributors, and some employees of other >> > companies like Cloudera, Qume Labs, Laguro.com, and CBYK. >> > >> > === Alignment === >> > >> > The initial committers strongly believe that by being part of the Apache >> > Software Foundation, Marvin AI will be part of a comprehensive suite >> for AI >> > applications that can process big data and enable enterprises to extract >> > value from their data lakes. Also, we hope that by integrating with >> other >> > Apache projects such as Apache Spark, Apache Hadoop; that this will >> foster >> > additional collaboration between these projects furthering the already >> > existing integration points and expanding the community of contributors. >> > >> > >> > == Known Risks == >> > >> > === Orphaned products === >> > >> > Given the current maturity of Marvin and how well it has been received >> at >> > technical conferences, the risk of the project being abandoned is >> minimal. >> > AI is not academia-exclusive anymore, and as enterprises start to add >> > data-science pipelines to their applications, demand for Marvin will >> only >> > increase. >> > >> > === Inexperience with Open Source === >> > >> > Marvin AI has been an open-source project since October 2017. The >> project >> > was started in a company where open-source culture is foundational. B2W >> > Digital runs the largest e-commerce in Latin America on top of >> open-source >> > projects. >> > >> > === Reliance on Salaried Developers === >> > >> > Marvin AI receives substantial efforts from salaried developers -- a few >> > of which were hired by companies to work exclusively for the project -- >> but >> > the majority devote "after-hours" or spare time to this project. Some >> > developers are graduate students that contribute in their free time at >> > school. >> > >> > === Relationships with Other Apache Products === >> > >> > Marvin integrates with several Apache products, such as Hadoop (HDFS) >> and >> > Spark. Marvin shares some similar features with PredictionIO, >> specifically >> > the model application server and a design pattern that was inspired by >> the >> > DASE. Despite these similarities, Marvin is catered towards a different >> > clientele (data scientists), and for that reason, it includes many >> critical >> > features that are not provided by PredictionIO. >> > >> > === An Excessive Fascination with the Apache Brand === >> > >> > While the ASF brand will undoubtedly help Marvin become a successful >> > project, Marvin is already gaining traction at companies around the >> globe. >> > >> > == Documentation == >> > >> > http://www.marvin-ai.org >> > >> > >> > == Initial Source == >> > >> > The current codebase is available at http://github.com/marvin-ai. This >> is >> > practically the same code that will be migrating to the Apache >> Foundation, >> > the notable difference being that the multiple repositories will be >> merged >> > into a single repository (if necessary). >> > >> > These are the main repositories and a very simplified explanation about >> > each one: >> > >> > '''Main repositories''' >> > >> > * marvin-ai/marvin-python-toolbox - Data Science toolbox that helps in >> > the creation of new ML engines >> > * marvin-ai/marvin-engine-executor - Component responsible for >> > interpreting, serving and managing Marvin engines >> > * marvin-ai/marvin-public-engines - Marvin engine examples to help new >> > Marvin users to build engines >> > * marvin-ai/marvin-platform-book - Documentation in GitHub book site >> > format >> > >> > '''Secondary repositories (Experimental and Initial)''' >> > * marvin-ai/marvin-vagrant-dev - Development environment that uses >> > VirtualBox and vagrant to non mac and Linux users; >> > * marvin-ai/marvin-paper - Source code (latex format) of the first >> Marvin >> > paper published in PAPIS.io conference in Boston. >> > * marvin-ai/marvin-cluster-admin - Admin module responsible to manage >> > Marvin cluster; >> > * marvin-ai/marvin-automl - AutoML module responsible to help data >> > scientist to build machine learning models with a very simple visual >> > interface; >> > >> > >> > == External Dependencies == >> > >> > It is very likely that all our dependencies are using either the Apache >> or >> > MIT license. Upon acceptance to the incubator, we would begin a thorough >> > analysis of all transitive dependencies to verify this fact and >> introduce >> > license checking into the build and release process. >> > >> > == Required Resources == >> > >> > === Mailing lists === >> > >> > * priv...@marvin-ai.incubator.apache.org (with moderated >> subscriptions) >> > * d...@marvin-ai.incubator.apache.org >> > * comm...@marvin-ai.incubator.apache.org >> > >> > >> > === Git Repositories === >> > >> > * https://git-wip-us.apache.org/repos/asf/incubator-marvin-ai.git >> > >> > === Issue Tracking === >> > >> > * JIRA (MARVIN) >> > >> > == Initial Committers == >> > >> > * Lucas Bonatto Miguel <lucasbona...@gmail.com> - Qume Labs >> (California >> > - USA) >> > * Daniel Takabayashi <daniel.takabaya...@gmail.com> - B2W Digital (São >> > Paulo - BR) / Laguro.com (California - USA) >> > * Bruno Piraja <bruno.pir...@b2wdigital.com> - B2W Digital (São Paulo >> - >> > BR) >> > * Zhang Yifei <zhang.yi...@b2wdigital.com> - B2W Digital (São Paulo - >> BR) >> > * Harrison Wang <hwang...@mit.edu> - MIT (USA) >> > * Brody West <bro...@mit.edu> - MIT (USA) >> > * Rafael Novello <rafael.nove...@b2wdigital.com> - B2W Digital (São >> > Paulo - BR) >> > * Willian Leite <willian.le...@cbyk.com.br> - CBYK (São Paulo - BR) >> > * Danilo Nunes <nunesdan...@gmail.com> - Qume Labs (California - USA) >> > * Alan Silva <alan.si...@cloudera.com> Cloudera (USA) >> > * Jeremy Elster <jeremy.els...@b2wdigital.com> - B2W Digital (São >> Paulo >> > - BR) >> > >> > >> > == Sponsors == >> > >> > === Champion === >> > >> > * Luciano Resende - (lresende) >> > >> > === Nominated Mentors === >> > >> > * Luciano Resende - (lresende) >> > >> > === Sponsoring Entity === >> > We would like to propose the Apache Incubator to sponsor this project. >> > >> > -- >> > Luciano Resende >> > http://twitter.com/lresende1975 >> > http://lresende.blogspot.com/ >> > >> >> >> -- >> Luciano Resende >> http://twitter.com/lresende1975 >> http://lresende.blogspot.com/ >> >