+1 On 2018/08/23 10:26:43, "Tan,Zhongyi" <tanzhon...@baidu.com> wrote: > +1 (not binding) > > 在 2018/8/23 上午7:47, "Willem Jiang" <willem.ji...@gmail.com> 写入: > > >+1 (binding) > > > > > >Willem Jiang > > > >Twitter: willemjiang > >Weibo: 姜宁willem > > > >On Wed, Aug 22, 2018 at 1:43 AM, Luciano Resende <luckbr1...@gmail.com> > >wrote: > > > >> After the initial discussion, please vote on the acceptance of Marvin-AI > >> Project for incubation at the Apache Incubator. The full proposal is > >> available at the end of this message and on the wiki at : > >> > >> https://wiki.apache.org/incubator/Marvin-AI > >> > >> Please cast your votes: > >> > >> [ ] +1, bring Marvin-AI into Incubator > >> [ ] +0, I don't care either way > >> [ ] -1, do not bring Marvin-AI into Incubator, because... > >> > >> The vote is open for the next 72 hours and only votes from the > >> Incubator PMC are binding. > >> > >> === > >> > >> = Marvin-AI = > >> > >> == Abstract == > >> > >> Marvin-AI is an open-source artificial intelligence (AI) platform that > >> helps data scientists, prototype and productionalize complex solutions > >>with > >> a scalable, low-latency, language-agnostic, and standardized > >>architecture > >> while simplifies the process of exploration and modeling. > >> > >> == Proposal == > >> > >> Marvin helps non-experienced developers create industry-grade AI > >> applications. It has three core components: a development environment > >>to > >> be used during data exploration and hypothesis validation (Toolbox), a > >> library which should be extended to create Marvin engines, and a Scala > >> application server which interprets engines (Engine Executor). > >> A basic premise of Marvin is that it should be language-agnostic, able > >>to > >> interpret engines implemented in different programming languages. > >> > >> == Background == > >> > >> The Marvin AI project was initiated as an internal project at B2W > >>Digital > >> (Brazil), the largest e-commerce company in Latin America. Nowadays, it > >>is > >> used by all data scientists within the B2W team. Oftentimes, data > >> scientists don't have an extensive background in software engineering, > >>yet > >> are in charge of creating AI applications that need to scale to high > >> throughput and provide millisecond-level response times. At B2W, Marvin > >>AI > >> plays an important role in this process, abstracting advanced software > >> engineering procedures, allowing data scientists to focus on their > >> knowledge domain. > >> > >> == Rationale == > >> > >> With recent advances in computer architecture and a corresponding > >>increase > >> in the amount of data generated by always-connected devices, AI > >>algorithms > >> offer a solution to problems that have long troubled modern > >>corporations. > >> Since AI developers come from various fields, such as statistics, > >>physics, > >> and math, there exists a strong need for platforms which enable them to > >> move from prototypes to enterprise applications. Although some tools > >>claim > >> to offer this service, in reality, there is no reliable open-source > >> solution. > >> > >> == Initial Goals == > >> > >> The initial goals will most likely be to merge the existing codebase > >>into a > >> single repository, migrate it to Apache, and then integrate with the > >>Apache > >> development process. Furthermore, we plan for incremental development > >>and > >> releases, as per Apache guidelines. > >> > >> == Current Status == > >> > >> === Meritocracy === > >> > >> Marvin already works under principles of meritocracy. Today, Marvin > >>already > >> has some contributors that are part of other institutions. Although > >>there > >> is no formal process defined to become a committer, contributors that > >>make > >> major changes/improvements to the platform are naturally granted write > >> access to the repository. > >> > >> > >> === Community === > >> > >> Acceptance into the Apache foundation would substantially boost both > >> Marvin's user and developer communities. The current community includes > >>a > >> few experienced developers that have either academic or professional > >> experience with AI. The community is largely comprised of data > >>scientists > >> working at B2W and other companies such as Cloudera, MIT, Qume Labs, > >> Laguro.com, and CBYK. Also, there is a meetup group of hundreds of > >>users > >> who meet regularly to exchange ideas about Marvin and, more generally, > >>AI. > >> > >> Reference to the group: https://www.meetup.com/marvin-ai/members/ > >> > >> === Core Developers === > >> > >> The core developers for Marvin are listed in the contributor's list and > >> initial PPMC below. These lists include B2W employees, MIT students, > >>UFSCAR > >> researchers, independent contributors, and some employees of other > >> companies like Cloudera, Qume Labs, Laguro.com, and CBYK. > >> > >> === Alignment === > >> > >> The initial committers strongly believe that by being part of the Apache > >> Software Foundation, Marvin AI will be part of a comprehensive suite > >>for AI > >> applications that can process big data and enable enterprises to extract > >> value from their data lakes. Also, we hope that by integrating with > >>other > >> Apache projects such as Apache Spark, Apache Hadoop; that this will > >>foster > >> additional collaboration between these projects furthering the already > >> existing integration points and expanding the community of contributors. > >> > >> > >> == Known Risks == > >> > >> === Orphaned products === > >> > >> Given the current maturity of Marvin and how well it has been received > >>at > >> technical conferences, the risk of the project being abandoned is > >>minimal. > >> AI is not academia-exclusive anymore, and as enterprises start to add > >> data-science pipelines to their applications, demand for Marvin will > >>only > >> increase. > >> > >> === Inexperience with Open Source === > >> > >> Marvin AI has been an open-source project since October 2017. The > >>project > >> was started in a company where open-source culture is foundational. B2W > >> Digital runs the largest e-commerce in Latin America on top of > >>open-source > >> projects. > >> > >> === Reliance on Salaried Developers === > >> > >> Marvin AI receives substantial efforts from salaried developers -- a > >>few of > >> which were hired by companies to work exclusively for the project -- but > >> the majority devote "after-hours" or spare time to this project. Some > >> developers are graduate students that contribute in their free time at > >> school. > >> > >> === Relationships with Other Apache Products === > >> > >> Marvin integrates with several Apache products, such as Hadoop (HDFS) > >>and > >> Spark. Marvin shares some similar features with PredictionIO, > >>specifically > >> the model application server and a design pattern that was inspired by > >>the > >> DASE. Despite these similarities, Marvin is catered towards a different > >> clientele (data scientists), and for that reason, it includes many > >>critical > >> features that are not provided by PredictionIO. > >> > >> === An Excessive Fascination with the Apache Brand === > >> > >> While the ASF brand will undoubtedly help Marvin become a successful > >> project, Marvin is already gaining traction at companies around the > >>globe. > >> > >> == Documentation == > >> > >> http://www.marvin-ai.org > >> > >> > >> == Initial Source == > >> > >> The current codebase is available at http://github.com/marvin-ai. This > >>is > >> practically the same code that will be migrating to the Apache > >>Foundation, > >> the notable difference being that the multiple repositories will be > >>merged > >> into a single repository (if necessary). > >> > >> These are the main repositories and a very simplified explanation about > >> each one: > >> > >> '''Main repositories''' > >> > >> * marvin-ai/marvin-python-toolbox - Data Science toolbox that helps in > >> the > >> creation of new ML engines > >> * marvin-ai/marvin-engine-executor - Component responsible for > >> interpreting, serving and managing Marvin engines > >> * marvin-ai/marvin-public-engines - Marvin engine examples to help new > >> Marvin users to build engines > >> * marvin-ai/marvin-platform-book - Documentation in GitHub book site > >> format > >> > >> '''Secondary repositories (Experimental and Initial)''' > >> * marvin-ai/marvin-vagrant-dev - Development environment that uses > >> VirtualBox and vagrant to non mac and Linux users; > >> * marvin-ai/marvin-paper - Source code (latex format) of the first > >>Marvin > >> paper published in PAPIS.io conference in Boston. > >> * marvin-ai/marvin-cluster-admin - Admin module responsible to manage > >> Marvin cluster; > >> * marvin-ai/marvin-automl - AutoML module responsible to help data > >> scientist to build machine learning models with a very simple visual > >> interface; > >> > >> > >> == External Dependencies == > >> > >> It is very likely that all our dependencies are using either the Apache > >>or > >> MIT license. Upon acceptance to the incubator, we would begin a thorough > >> analysis of all transitive dependencies to verify this fact and > >>introduce > >> license checking into the build and release process. > >> > >> == Required Resources == > >> > >> === Mailing lists === > >> > >> * priv...@marvin.incubator.apache.org (with moderated subscriptions) > >> * d...@marvin.incubator.apache.org > >> * comm...@marvin.incubator.apache.org > >> > >> > >> === Git Repositories === > >> > >> * https://git-wip-us.apache.org/repos/asf/incubator-marvin.git > >> > >> === Issue Tracking === > >> > >> * JIRA (MARVIN) > >> > >> == Initial Committers == > >> > >> * Lucas Bonatto Miguel <lucasbona...@gmail.com> - Qume Labs > >>(California - > >> USA) > >> * Daniel Takabayashi <daniel.takabaya...@gmail.com> - B2W Digital (São > >> Paulo - BR) / Laguro.com (California - USA) > >> * Bruno Piraja <bruno.pir...@b2wdigital.com> - B2W Digital (São Paulo - > >> BR) > >> * Zhang Yifei <zhang.yi...@b2wdigital.com> - B2W Digital (São Paulo - > >>BR) > >> * Harrison Wang <hwang...@mit.edu> - MIT (USA) > >> * Brody West <bro...@mit.edu> - MIT (USA) > >> * Rafael Novello <rafael.nove...@b2wdigital.com> - B2W Digital (São > >>Paulo > >> - BR) > >> * Willian Leite <willian.le...@cbyk.com.br> - CBYK (São Paulo - BR) > >> * Danilo Nunes <nunesdan...@gmail.com> - Qume Labs (California - USA) > >> * Alan Silva <alan.si...@cloudera.com> Cloudera (USA) > >> * Jeremy Elster <jeremy.els...@b2wdigital.com> - B2W Digital (São > >>Paulo - > >> BR) > >> > >> > >> == Sponsors == > >> > >> === Champion === > >> > >> * Luciano Resende - (lresende) > >> > >> === Nominated Mentors === > >> > >> * Luciano Resende - (lresende) > >> * Jim Jagielski - (jim) > >> * William Colen - (colen) > >> > >> === Sponsoring Entity === > >> We would like to propose the Apache Incubator to sponsor this project. > >> > >> > >> -- > >> Luciano Resende > >> http://twitter.com/lresende1975 > >> http://lresende.blogspot.com/ > >> > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org > For additional commands, e-mail: general-h...@incubator.apache.org >
--------------------------------------------------------------------- To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org