date:20240227

[gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Michał Górny

Hello,

Given the recent spread of the "AI" bubble, I think we really need to
look into formally addressing the related concerns.  In my opinion,
at this point the only reasonable course of action would be to safely
ban "AI"-backed contribution entirely.  In other words, explicitly
forbid people from using ChatGPT, Bard, GitHub Copilot, and so on, to
create ebuilds, code, documentation, messages, bug reports and so on for
use in Gentoo.

Just to be clear, I'm talking about our "original" content.  We can't do
much about upstream projects using it.


Rationale:

1. Copyright concerns.  At this point, the copyright situation around
generated content is still unclear.  What's pretty clear is that pretty
much all LLMs are trained on huge corpora of copyrighted material, and
all fancy "AI" companies don't give shit about copyright violations.
In particular, there's a good risk that these tools would yield stuff we
can't legally use.

2. Quality concerns.  LLMs are really great at generating plausibly
looking bullshit.  I suppose they can provide good assistance if you are
careful enough, but we can't really rely on all our contributors being
aware of the risks.

3. Ethical concerns.  As pointed out above, the "AI" corporations don't
give shit about copyright, and don't give shit about people.  The AI
bubble is causing huge energy waste.  It is giving a great excuse for
layoffs and increasing exploitation of IT workers.  It is driving
enshittification of the Internet, it is empowering all kinds of spam
and scam.


Gentoo has always stood out as something different, something that
worked for people for whom mainstream distros were lacking.  I think
adding "made by real people" to the list of our advantages would be
a good thing — but we need to have policies in place, to make sure shit
doesn't flow in.

Compare with the shitstorm at:
https://github.com/pkgxdev/pantry/issues/5358

-- 
Best regards,
Michał Górny



signature.asc
Description: This is a digitally signed message part

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Arsen Arsenović

Michał Górny  writes:

> Hello,
>
> Given the recent spread of the "AI" bubble, I think we really need to
> look into formally addressing the related concerns.  In my opinion,
> at this point the only reasonable course of action would be to safely
> ban "AI"-backed contribution entirely.  In other words, explicitly
> forbid people from using ChatGPT, Bard, GitHub Copilot, and so on, to
> create ebuilds, code, documentation, messages, bug reports and so on for
> use in Gentoo.
>
> Just to be clear, I'm talking about our "original" content.  We can't do
> much about upstream projects using it.
>
>
> Rationale:
>
> 1. Copyright concerns.  At this point, the copyright situation around
> generated content is still unclear.  What's pretty clear is that pretty
> much all LLMs are trained on huge corpora of copyrighted material, and
> all fancy "AI" companies don't give shit about copyright violations.
> In particular, there's a good risk that these tools would yield stuff we
> can't legally use.
>
> 2. Quality concerns.  LLMs are really great at generating plausibly
> looking bullshit.  I suppose they can provide good assistance if you are
> careful enough, but we can't really rely on all our contributors being
> aware of the risks.
>
> 3. Ethical concerns.  As pointed out above, the "AI" corporations don't
> give shit about copyright, and don't give shit about people.  The AI
> bubble is causing huge energy waste.  It is giving a great excuse for
> layoffs and increasing exploitation of IT workers.  It is driving
> enshittification of the Internet, it is empowering all kinds of spam
> and scam.
>
>
> Gentoo has always stood out as something different, something that
> worked for people for whom mainstream distros were lacking.  I think
> adding "made by real people" to the list of our advantages would be
> a good thing — but we need to have policies in place, to make sure shit
> doesn't flow in.
>
> Compare with the shitstorm at:
> https://github.com/pkgxdev/pantry/issues/5358

+1.  All I've seen from "generatative" (read: auto-plagiarizing) A"I" is
spam and theft, and have the full intention of blocking it where-ever my
vote counts.
-- 
Arsen Arsenović


signature.asc
Description: PGP signature

[gentoo-dev] Last rites: sci-chemistry/nmrglue

2024-02-27 Thread Michał Górny

# Michał Górny  (2024-02-27)
# Effectively unmaintained in Gentoo.  Still depends on dev-python/nose,
# on top of that tests are restricted, so we don't even know if it
# works at all.  No reverse dependencies.
# Removal on 2024-03-28.  Bug #878725.
sci-chemistry/nmrglue

-- 
Best regards,
Michał Górny



signature.asc
Description: This is a digitally signed message part

[gentoo-dev] Last rites: sci-biology/biopandas

2024-02-27 Thread Michał Górny

# Michał Górny  (2024-02-27)
# Still depends on dev-python/nose.  No reverse dependencies.
# Removal on 2024-03-28.  Bug #878721.
sci-biology/biopandas

-- 
Best regards,
Michał Górny



signature.asc
Description: This is a digitally signed message part

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Kenton Groombridge

On 24/02/27 03:45PM, Michał Górny wrote:
> Hello,
> 
> Given the recent spread of the "AI" bubble, I think we really need to
> look into formally addressing the related concerns.  In my opinion,
> at this point the only reasonable course of action would be to safely
> ban "AI"-backed contribution entirely.  In other words, explicitly
> forbid people from using ChatGPT, Bard, GitHub Copilot, and so on, to
> create ebuilds, code, documentation, messages, bug reports and so on for
> use in Gentoo.
> 
> Just to be clear, I'm talking about our "original" content.  We can't do
> much about upstream projects using it.
> 
> 
> Rationale:
> 
> 1. Copyright concerns.  At this point, the copyright situation around
> generated content is still unclear.  What's pretty clear is that pretty
> much all LLMs are trained on huge corpora of copyrighted material, and
> all fancy "AI" companies don't give shit about copyright violations.
> In particular, there's a good risk that these tools would yield stuff we
> can't legally use.
> 
> 2. Quality concerns.  LLMs are really great at generating plausibly
> looking bullshit.  I suppose they can provide good assistance if you are
> careful enough, but we can't really rely on all our contributors being
> aware of the risks.
> 
> 3. Ethical concerns.  As pointed out above, the "AI" corporations don't
> give shit about copyright, and don't give shit about people.  The AI
> bubble is causing huge energy waste.  It is giving a great excuse for
> layoffs and increasing exploitation of IT workers.  It is driving
> enshittification of the Internet, it is empowering all kinds of spam
> and scam.
> 
> 
> Gentoo has always stood out as something different, something that
> worked for people for whom mainstream distros were lacking.  I think
> adding "made by real people" to the list of our advantages would be
> a good thing — but we need to have policies in place, to make sure shit
> doesn't flow in.
> 
> Compare with the shitstorm at:
> https://github.com/pkgxdev/pantry/issues/5358
> 
> -- 
> Best regards,
> Michał Górny
> 

I completely agree.

Your rationale hits the most important concerns I have about these
technologies in open source. There is a significant opportunity for
Gentoo to set the example here.

-- 
Kenton Groombridge
Gentoo Linux Developer, SELinux Project


signature.asc
Description: PGP signature

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Alex Boag-Munroe

On Tue, 27 Feb 2024 at 15:21, Kenton Groombridge  wrote:
>
> On 24/02/27 03:45PM, Michał Górny wrote:
> > Hello,
> >
> > Given the recent spread of the "AI" bubble, I think we really need to
> > look into formally addressing the related concerns.  In my opinion,
> > at this point the only reasonable course of action would be to safely
> > ban "AI"-backed contribution entirely.  In other words, explicitly
> > forbid people from using ChatGPT, Bard, GitHub Copilot, and so on, to
> > create ebuilds, code, documentation, messages, bug reports and so on for
> > use in Gentoo.
> >
> > Just to be clear, I'm talking about our "original" content.  We can't do
> > much about upstream projects using it.
> >
> >
> > Rationale:
> >
> > 1. Copyright concerns.  At this point, the copyright situation around
> > generated content is still unclear.  What's pretty clear is that pretty
> > much all LLMs are trained on huge corpora of copyrighted material, and
> > all fancy "AI" companies don't give shit about copyright violations.
> > In particular, there's a good risk that these tools would yield stuff we
> > can't legally use.
> >
> > 2. Quality concerns.  LLMs are really great at generating plausibly
> > looking bullshit.  I suppose they can provide good assistance if you are
> > careful enough, but we can't really rely on all our contributors being
> > aware of the risks.
> >
> > 3. Ethical concerns.  As pointed out above, the "AI" corporations don't
> > give shit about copyright, and don't give shit about people.  The AI
> > bubble is causing huge energy waste.  It is giving a great excuse for
> > layoffs and increasing exploitation of IT workers.  It is driving
> > enshittification of the Internet, it is empowering all kinds of spam
> > and scam.
> >
> >
> > Gentoo has always stood out as something different, something that
> > worked for people for whom mainstream distros were lacking.  I think
> > adding "made by real people" to the list of our advantages would be
> > a good thing — but we need to have policies in place, to make sure shit
> > doesn't flow in.
> >
> > Compare with the shitstorm at:
> > https://github.com/pkgxdev/pantry/issues/5358
> >
> > --
> > Best regards,
> > Michał Górny
> >
>
> I completely agree.
>
> Your rationale hits the most important concerns I have about these
> technologies in open source. There is a significant opportunity for
> Gentoo to set the example here.
>
> --
> Kenton Groombridge
> Gentoo Linux Developer, SELinux Project

A thousand times yes.

[gentoo-dev] Last rites: app-misc/binwalk

2024-02-27 Thread Michał Górny

# Michał Górny  (2024-02-27)
# Unmaintained upstream.  Already carries a few patches.
# Depends on dev-python/nose.
# Removal on 2024-03-28.  Bug #878693.
app-misc/binwalk

-- 
Best regards,
Michał Górny



signature.asc
Description: This is a digitally signed message part

[gentoo-dev] Last rites: app-admin/salt, dev-python/pytest-salt-factories, dev-python/boto

2024-02-27 Thread Michał Górny

# Michał Górny  (2024-02-27)
# dev-python/boto is dead, with last release in 2018.  It has been
# replaced by dev-python/boto3.  It carries a ton of patches and still
# depends on dev-python/nose.
#
# app-admin/salt is its only remaining reverse dependency.  The ebuild
# is of very low quality.
#
# Removal on 2024-03-28.  Bug #888235.
app-admin/salt
dev-python/pytest-salt-factories
dev-python/boto

-- 
Best regards,
Michał Górny



signature.asc
Description: This is a digitally signed message part

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Marek Szuba


On 2024-02-27 14:45, Michał Górny wrote:


In my opinion, at this point the only reasonable course of action
would be to safely ban "AI"-backed contribution entirely.  In other
words, explicitly forbid people from using ChatGPT, Bard, GitHub
Copilot, and so on, to create ebuilds, code, documentation, messages,
bug reports and so on for use in Gentoo.


I very much support this idea, for all the three reasons quoted.

2. Quality concerns.  LLMs are really great at generating plausibly 
looking bullshit.  I suppose they can provide good assistance if you

are careful enough, but we can't really rely on all our contributors
being aware of the risks.


https://arxiv.org/abs/2211.03622


3. Ethical concerns.


...yeah. Seeing as we failed to condemn the Russian invasion of Ukraine 
in 2022, I would probably avoid quoting this as a reason for banning 
LLM-generated contributions. Even though I do, as mentioned above, very 
much agree with this point.


--
Marecki


OpenPGP_signature.asc
Description: OpenPGP digital signature

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Sam James

Marek Szuba  writes:

> On 2024-02-27 14:45, Michał Górny wrote:
>
>> In my opinion, at this point the only reasonable course of action
>> would be to safely ban "AI"-backed contribution entirely.  In other
>> words, explicitly forbid people from using ChatGPT, Bard, GitHub
>> Copilot, and so on, to create ebuilds, code, documentation, messages,
>> bug reports and so on for use in Gentoo.
>
> I very much support this idea, for all the three reasons quoted.
>
>> 2. Quality concerns.  LLMs are really great at generating plausibly
>> looking bullshit.  I suppose they can provide good assistance if you
>> are careful enough, but we can't really rely on all our contributors
>> being aware of the risks.
>
> https://arxiv.org/abs/2211.03622
>
>> 3. Ethical concerns.
>
> ...yeah. Seeing as we failed to condemn the Russian invasion of
> Ukraine in 2022, I would probably avoid quoting this as a reason for
> banning LLM-generated contributions. Even though I do, as mentioned
> above, very much agree with this point.

That's not a technical topic and we had an extended discussion about
what to do in -core, which included the risks of making life difficult
for Russian developers and contributors.

I don't think that's a helpful intervention here, sorry.

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Andreas K. Huettel

Am Dienstag, 27. Februar 2024, 15:45:17 CET schrieb Michał Górny:
> Hello,
> 
> Given the recent spread of the "AI" bubble, I think we really need to
> look into formally addressing the related concerns.  In my opinion,
> at this point the only reasonable course of action would be to safely
> ban "AI"-backed contribution entirely.  In other words, explicitly
> forbid people from using ChatGPT, Bard, GitHub Copilot, and so on, to
> create ebuilds, code, documentation, messages, bug reports and so on for
> use in Gentoo.

Fully agree and support this.

> 
> Just to be clear, I'm talking about our "original" content.  We can't do
> much about upstream projects using it.
[...] or implementing it.

So, also, no objections against someone (a real person, by his own mental
means) packaging AI software for Gentoo.


-- 
Andreas K. Hüttel
dilfri...@gentoo.org
Gentoo Linux developer
(council, toolchain, base-system, perl, libreoffice)

signature.asc
Description: This is a digitally signed message part.

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Ionen Wolkens

On Tue, Feb 27, 2024 at 03:45:17PM +0100, Michał Górny wrote:
> Hello,
> 
> Given the recent spread of the "AI" bubble, I think we really need to
> look into formally addressing the related concerns.  In my opinion,
> at this point the only reasonable course of action would be to safely
> ban "AI"-backed contribution entirely.  In other words, explicitly
> forbid people from using ChatGPT, Bard, GitHub Copilot, and so on, to
> create ebuilds, code, documentation, messages, bug reports and so on for
> use in Gentoo.

+1 from me, a clear stance before it really start hitting Gentoo sounds
good.
-- 
ionen


signature.asc
Description: PGP signature

Re: [gentoo-dev] 2024-02-26-debianutils-drops-installkernel-dep: add news item v2

2024-02-27 Thread Hank Leininger

On 2024-02-27, andrewammerlaan wrote:

> Until recently, sys-apps/debianutils was in turn pulled in by
> app-misc/ca-certificates, an essential package installed on many
> systems. This is no longer the case.[2]. As a result many users may find
> that sys-apps/debianutils and therefore sys-kernel/installkernel are no
> longer part of the dependency graph and will therefore be cleaned up by
> "emerge --depclean".

Sorry for speaking up late: I (mis)read the second sentence differently
from others in this thread, apparently.

"This is no longer the case." might apply to the first part of the
previous sentence, "was in turn pulled in by".

Or it might apply to the second part, "an essential package installed on
many systems."

I think what's meant is the former, it is no longer pulled in. But
someone reading this cold could be forgiven for reading that as
"ca-certificates is no longer an essential package".

Unfortunately my recommendation would be to restore the mention of a
dependency, in some form or fashion, which seems to be something that
was removed due to earlier feedback in this thread.

Maybe:

Until recently, sys-apps/debianutils was in turn pulled in by
app-misc/ca-certificates, an essential package installed on many
systems. That package no longer depends on sys-apps/debianutils.  As a
result many users may find that sys-apps/debianutils and therefore
sys-kernel/installkernel are no longer part of the dependency graph and
will therefore be cleaned up by "emerge --depclean".

Thanks,

Hank

signature.asc
Description: Digital signature

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Rich Freeman

On Tue, Feb 27, 2024 at 9:45 AM Michał Górny  wrote:
>
> Given the recent spread of the "AI" bubble, I think we really need to
> look into formally addressing the related concerns.

> 1. Copyright concerns.

I do think it makes sense to consider some of this.

However, I feel like the proposal is redundant with the existing
requirement to signoff on the DCO, which says:

>>> By making a contribution to this project, I certify that:

>>> 1. The contribution was created in whole or in part by me, and
>>> I have the right to submit it under the free software license
>>> indicated in the file; or

>>> 2. The contribution is based upon previous work that, to the best of
>>> my knowledge, is covered under an appropriate free software license,
>>> and I have the right under that license to submit that work with
>>> modifications, whether created in whole or in part by me, under the
>>> same free software license (unless I am permitted to submit under a
>>> different license), as indicated in the file; or

>>> 3. The contribution is a license text (or a file of similar nature),
>>> and verbatim distribution is allowed; or

>>> 4. The contribution was provided directly to me by some other person
>>> who certified 1., 2., 3., or 4., and I have not modified it.

Perhaps we ought to just re-advertise the policy that already exists?

> 2. Quality concerns.

As far as quality is concerned, I again share the concerns you raise,
and I think we should just re-emphasize what many other industries are
already making clear - that individuals are responsible for the
quality of their contributions.  Copy/pasting it blindly from an AI is
no different from copy/pasting it from some other random website, even
if it is otherwise legal.

> 3. Ethical concerns.

I think it is best to just avoid taking a stand on this.  Our ethics
are already documented in the Social Contract.

I think everybody agrees that what is right and wrong is obvious and
clear and universal.  Then we're all shocked to find that large
numbers of people have a universal perspective different from our own.
Even if 90% of contributors agree with a particular position, if we
start lopping off parts of our community 10% at a time we'll probably
find ourselves alone in a room sooner or later.  We can't make every
hill the one to die on.

> I think adding "made by real people" to the list of our advantages
> would be a good thing

Somehow I doubt this is going to help us steal market share from the
numerous other popular source-based Linux distros.  :)

To be clear, I don't think it is a bad idea to just reiterate that we
aren't looking for help from people who want to create scripts that
pipe things into some GPT API and pipe the output into a forum, bug,
issue, PR, or commit.  I've seen other FOSS projects struggling with
people trying to be "helpful" in this way.  I just don't think any of
this actually requires new policy.  If we find our policy to be
inadequate I think it is better to go back to the core principles and
better articulate what we're trying to achieve, rather than adjust it
to fit the latest fashions.

-- 
Rich

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Matthias Maier

On Tue, Feb 27, 2024, at 08:45 CST, Michał Górny  wrote:

> Given the recent spread of the "AI" bubble, I think we really need to
> look into formally addressing the related concerns.  In my opinion,
> at this point the only reasonable course of action would be to safely
> ban "AI"-backed contribution entirely.  In other words, explicitly
> forbid people from using ChatGPT, Bard, GitHub Copilot, and so on, to
> create ebuilds, code, documentation, messages, bug reports and so on for
> use in Gentoo.

+1


> 2. Quality concerns.  LLMs are really great at generating plausibly
> looking bullshit.  I suppose they can provide good assistance if you are
> careful enough, but we can't really rely on all our contributors being
> aware of the risks.

This is my main concern, but all of the other points are valid as well.


Best,
Matthias

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Roy Bamford

On 2024.02.27 14:45, Michał Górny wrote:
> Hello,
> 
> Given the recent spread of the "AI" bubble, I think we really need to
> look into formally addressing the related concerns.  In my opinion,
> at this point the only reasonable course of action would be to safely
> ban "AI"-backed contribution entirely.  In other words, explicitly
> forbid people from using ChatGPT, Bard, GitHub Copilot, and so on, to
> create ebuilds, code, documentation, messages, bug reports and so on
> for
> use in Gentoo.
> 
> Just to be clear, I'm talking about our "original" content.  We can't
> do
> much about upstream projects using it.
> 
> 
> Rationale:
> 
> 1. Copyright concerns.  At this point, the copyright situation around
> generated content is still unclear.  What's pretty clear is that
> pretty
> much all LLMs are trained on huge corpora of copyrighted material, and
> all fancy "AI" companies don't give shit about copyright violations.
> In particular, there's a good risk that these tools would yield stuff
> we
> can't legally use.
> 
> 2. Quality concerns.  LLMs are really great at generating plausibly
> looking bullshit.  I suppose they can provide good assistance if you
> are
> careful enough, but we can't really rely on all our contributors being
> aware of the risks.
> 
> 3. Ethical concerns.  As pointed out above, the "AI" corporations
> don't
> give shit about copyright, and don't give shit about people.  The AI
> bubble is causing huge energy waste.  It is giving a great excuse for
> layoffs and increasing exploitation of IT workers.  It is driving
> enshittification of the Internet, it is empowering all kinds of spam
> and scam.
> 
> 
> Gentoo has always stood out as something different, something that
> worked for people for whom mainstream distros were lacking.  I think
> adding "made by real people" to the list of our advantages would be
> a good thing — but we need to have policies in place, to make sure
> shit
> doesn't flow in.
> 
> Compare with the shitstorm at:
> https://github.com/pkgxdev/pantry/issues/5358
> 
> -- 
> Best regards,
> Michał Górny
> 
> 

Michał,

An excellent piece of prose setting out the rationale.
I fully support it.

-- 
Regards,

Roy Bamford
(Neddyseagoon) a member of
elections
gentoo-ops
forum-mods
arm64

pgp0BJ299ipp6.pgp
Description: PGP signature

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Sam James

Michał Górny  writes:

> Hello,
>
> Given the recent spread of the "AI" bubble, I think we really need to
> look into formally addressing the related concerns.  In my opinion,
> at this point the only reasonable course of action would be to safely
> ban "AI"-backed contribution entirely.  In other words, explicitly
> forbid people from using ChatGPT, Bard, GitHub Copilot, and so on, to
> create ebuilds, code, documentation, messages, bug reports and so on for
> use in Gentoo.
>
> Just to be clear, I'm talking about our "original" content.  We can't do
> much about upstream projects using it.
>

I agree with the proposal, just some thoughts below.

I'm a bit worried this is slightly performative - which is not a dig at
you at all - given we can't really enforce it, and it requires honesty,
but that's also not a reason to not try ;)

>
> Rationale:
>
> 1. Copyright concerns.  At this point, the copyright situation around
> generated content is still unclear.  What's pretty clear is that pretty
> much all LLMs are trained on huge corpora of copyrighted material, and
> all fancy "AI" companies don't give shit about copyright violations.
> In particular, there's a good risk that these tools would yield stuff we
> can't legally use.
>

It also makes risk for anyone basing products or tools on Gentoo if
we're not confident about the integrity / provenance of our work.

> 2. Quality concerns.  LLMs are really great at generating plausibly
> looking bullshit.  I suppose they can provide good assistance if you are
> careful enough, but we can't really rely on all our contributors being
> aware of the risks.
>
> 3. Ethical concerns.  As pointed out above, the "AI" corporations don't
> give shit about copyright, and don't give shit about people.  The AI
> bubble is causing huge energy waste.  It is giving a great excuse for
> layoffs and increasing exploitation of IT workers.  It is driving
> enshittification of the Internet, it is empowering all kinds of spam
> and scam.
>
>
> Gentoo has always stood out as something different, something that
> worked for people for whom mainstream distros were lacking.  I think
> adding "made by real people" to the list of our advantages would be
> a good thing — but we need to have policies in place, to make sure shit
> doesn't flow in.
>
> Compare with the shitstorm at:
> https://github.com/pkgxdev/pantry/issues/5358


signature.asc
Description: PGP signature

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Ulrich Mueller

> On Tue, 27 Feb 2024, Rich Freeman wrote:

> On Tue, Feb 27, 2024 at 9:45 AM Michał Górny  wrote:
>> 
>> Given the recent spread of the "AI" bubble, I think we really need to
>> look into formally addressing the related concerns.

First of all, I fully support mgorny's proposal.

>> 1. Copyright concerns.

> I do think it makes sense to consider some of this.

> However, I feel like the proposal is redundant with the existing
> requirement to signoff on the DCO, which says:

 By making a contribution to this project, I certify that:

 1. The contribution was created in whole or in part by me, and
 I have the right to submit it under the free software license
 indicated in the file; or

 2. The contribution is based upon previous work that, to the best of
 my knowledge, is covered under an appropriate free software license,
 and I have the right under that license to submit that work with
 modifications, whether created in whole or in part by me, under the
 same free software license (unless I am permitted to submit under a
 different license), as indicated in the file; or

 3. The contribution is a license text (or a file of similar nature),
 and verbatim distribution is allowed; or

 4. The contribution was provided directly to me by some other person
 who certified 1., 2., 3., or 4., and I have not modified it.

I have been thinking about this aspect too. Certainly there is some
overlap with our GLEP 76 policy, but I don't think that it is redundant.

I'd rather see it as a (much needed) clarification how to deal with AI
generated code. All the better if the proposal happens to agree with
policies that are already in place.

Ulrich

signature.asc
Description: PGP signature

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Kenton Groombridge

On 24/02/27 07:07PM, Ulrich Mueller wrote:
> > On Tue, 27 Feb 2024, Rich Freeman wrote:
> 
> > On Tue, Feb 27, 2024 at 9:45 AM Michał Górny  wrote:
> >> 
> >> Given the recent spread of the "AI" bubble, I think we really need to
> >> look into formally addressing the related concerns.
> 
> First of all, I fully support mgorny's proposal.
> 
> >> 1. Copyright concerns.
> 
> > I do think it makes sense to consider some of this.
> 
> > However, I feel like the proposal is redundant with the existing
> > requirement to signoff on the DCO, which says:
> 
>  By making a contribution to this project, I certify that:
> 
>  1. The contribution was created in whole or in part by me, and
>  I have the right to submit it under the free software license
>  indicated in the file; or
> 
>  2. The contribution is based upon previous work that, to the best of
>  my knowledge, is covered under an appropriate free software license,
>  and I have the right under that license to submit that work with
>  modifications, whether created in whole or in part by me, under the
>  same free software license (unless I am permitted to submit under a
>  different license), as indicated in the file; or
> 
>  3. The contribution is a license text (or a file of similar nature),
>  and verbatim distribution is allowed; or
> 
>  4. The contribution was provided directly to me by some other person
>  who certified 1., 2., 3., or 4., and I have not modified it.
> 
> I have been thinking about this aspect too. Certainly there is some
> overlap with our GLEP 76 policy, but I don't think that it is redundant.
> 
> I'd rather see it as a (much needed) clarification how to deal with AI
> generated code. All the better if the proposal happens to agree with
> policies that are already in place.
> 
> Ulrich

This is my interpretation of it as well, especially when it comes to
para. 2:

>>> 2. The contribution is based upon previous work that, to the best of
>>> my knowledge, is covered under an appropriate free software license,
>>> [...]

It is extremely difficult (if not impossible) to verify this with some of
these tools, and that's assuming that the user of these tools knows
enough about how they work where this is a concern to them. I would
argue it's best to stay away from these tools at least until there is more
clear and concise legal interpretation of their usage in relation to
copyright.

-- 
Kenton Groombridge
Gentoo Linux Developer, SELinux Project


signature.asc
Description: PGP signature

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Peter Böhm

Am Dienstag, 27. Februar 2024, 18:50:15 CET schrieb Roy Bamford:
> On 2024.02.27 14:45, Michał Górny wrote:
> > Hello,
> > 
> > [...]
> > 
> > Gentoo has always stood out as something different, something that
> > worked for people for whom mainstream distros were lacking.  I think
> > adding "made by real people" to the list of our advantages would be
> > a good thing — but we need to have policies in place, to make sure
> > shit
> > doesn't flow in.
> > 
> > Compare with the shitstorm at:
> > https://github.com/pkgxdev/pantry/issues/5358
> 
> Michał,
> 
> An excellent piece of prose setting out the rationale.
> I fully support it.

I would like to add the following:

Last year we had a chatbot in our Gentoo forum that posted 76 posts on 
2024-12-19. An inexperienced moderator (me) then asked his colleagues on the 
basis of which forum rules we can ban this chatbot:

"Do we have a rule somewhere that an AI and a chatbot are not allowed to log 
in? I have read our Guideĺines ( https://forums.gentoo.org/viewtopic-t-525.html 
) and found no such prohibition. On what basis could we even block 
a chatbot ?"

The answer from two experienced colleagues was that this is already covered by 
our forum rules, because chatbots usually cannot (yet) fulfill the requirements 
of a forum post and therefore violate our Guideĺines.

To be honest, I asked myself at the time what would happen if we had a clearly 
recognizable AI as a user that made (reasonably) sensible posts. We would then 
have no chance of banning this AI user without an explicit prohibition. I 
would be much more comfortable if we clearly communicated that we do not 
accept an AI as a user.

Yes, I would also be very happy to see this proposal implemented.

-- 
Best regards,
Peter (aka pietinger)

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Eli Schwartz

On 2/27/24 9:45 AM, Michał Górny wrote:
> Hello,
> 
> Given the recent spread of the "AI" bubble, I think we really need to
> look into formally addressing the related concerns.  In my opinion,
> at this point the only reasonable course of action would be to safely
> ban "AI"-backed contribution entirely.  In other words, explicitly
> forbid people from using ChatGPT, Bard, GitHub Copilot, and so on, to
> create ebuilds, code, documentation, messages, bug reports and so on for
> use in Gentoo.


No constructive or valuable contributions will fall afoul of the new ban.

Seems reasonable to me.


-- 
Eli Schwartz


OpenPGP_0x84818A6819AF4A9B.asc
Description: OpenPGP public key


OpenPGP_signature.asc
Description: OpenPGP digital signature

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Oskari Pirhonen

On Tue, Feb 27, 2024 at 15:45:17 +0100, Michał Górny wrote:
> Hello,
> 
> Given the recent spread of the "AI" bubble, I think we really need to
> look into formally addressing the related concerns.  In my opinion,
> at this point the only reasonable course of action would be to safely
> ban "AI"-backed contribution entirely.  In other words, explicitly
> forbid people from using ChatGPT, Bard, GitHub Copilot, and so on, to
> create ebuilds, code, documentation, messages, bug reports and so on for
> use in Gentoo.
> 
> Just to be clear, I'm talking about our "original" content.  We can't do
> much about upstream projects using it.
> 

I agree.

But for the sake of discussion:

What about cases where someone, say, doesn't have an excellent grasp of
English and decides to use, for example, ChatGPT to aid in writing
documentation/comments (not code) and puts a note somewhere explicitly
mentioning what was AI-generated so that someone else can take a closer
look?

I'd personally not be the biggest fan of this if it wasn't in something
like a PR or ml post where it could be reviewed before being made final.
But the most impportant part IMO would be being up-front about it.

> 
> Rationale:
> 
> 1. Copyright concerns.  At this point, the copyright situation around
> generated content is still unclear.  What's pretty clear is that pretty
> much all LLMs are trained on huge corpora of copyrighted material, and
> all fancy "AI" companies don't give shit about copyright violations.
> In particular, there's a good risk that these tools would yield stuff we
> can't legally use.
> 

I really dislike the lack of audit trail for where the bits and pieces
come from. Not to mention the examples from early on where Copilot was
filling in incorrect attribution.

- Oskari

signature.asc
Description: PGP signature

Re: [gentoo-dev] Last rites: app-admin/salt, dev-python/pytest-salt-factories, dev-python/boto

2024-02-27 Thread Patrick McLean

The salt ebuild has been refactored to remove the tests and modules that 
require dev-python/boto, and the mask has been removed.


Salt has a lot of users, and it would be doing them a disservice to 
remove it from the tree.


On 2024-02-27 07:42, Michał Górny wrote:

# Michał Górny  (2024-02-27)
# dev-python/boto is dead, with last release in 2018.  It has been
# replaced by dev-python/boto3.  It carries a ton of patches and still
# depends on dev-python/nose.
#
# app-admin/salt is its only remaining reverse dependency.  The ebuild
# is of very low quality.
#
# Removal on 2024-03-28.  Bug #888235.
app-admin/salt
dev-python/pytest-salt-factories
dev-python/boto

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

2024-02-27 Thread Michał Górny

On Tue, 2024-02-27 at 21:05 -0600, Oskari Pirhonen wrote:
> What about cases where someone, say, doesn't have an excellent grasp of
> English and decides to use, for example, ChatGPT to aid in writing
> documentation/comments (not code) and puts a note somewhere explicitly
> mentioning what was AI-generated so that someone else can take a closer
> look?
> 
> I'd personally not be the biggest fan of this if it wasn't in something
> like a PR or ml post where it could be reviewed before being made final.
> But the most impportant part IMO would be being up-front about it.

I'm afraid that wouldn't help much.  From my experiences, it would be
less effort for us to help writing it from scratch, than trying to
untangle whatever verbose shit ChatGPT generates.  Especially that
a person with poor grasp of the language could have trouble telling
whether the generated text is actually meaningful.

-- 
Best regards,
Michał Górny

signature.asc
Description: This is a digitally signed message part

[gentoo-dev] Last rites: app-misc/rmlint

2024-02-27 Thread Michał Górny

# Michał Górny  (2024-02-28)
# The project is not really actively maintained upstream, and it still
# depends on dev-python/nose.  There are other tools with similar
# functionality.
# Removal on 2024-03-29.  Bug #878695.
app-misc/rmlint

-- 
Best regards,
Michał Górny



signature.asc
Description: This is a digitally signed message part

[gentoo-dev] Last rites: dev-python/nose

2024-02-27 Thread Michał Górny

# Michał Górny  (2024-02-28)
# Nosetests have been abandoned in 2015.  Upstream (while technically
# still around) has refused to accept any patches since, and we have
# already had to fork it, to keep it somewhat working.  All
# the remaining reverse dependencies were finally ported or last rited.
# Removal on 2024-03-29.  Bug #822414.
dev-python/nose

-- 
Best regards,
Michał Górny



signature.asc
Description: This is a digitally signed message part

[gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

[gentoo-dev] Last rites: sci-chemistry/nmrglue

[gentoo-dev] Last rites: sci-biology/biopandas

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

[gentoo-dev] Last rites: app-misc/binwalk

[gentoo-dev] Last rites: app-admin/salt, dev-python/pytest-salt-factories, dev-python/boto

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

Re: [gentoo-dev] 2024-02-26-debianutils-drops-installkernel-dep: add news item v2

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

Re: [gentoo-dev] Last rites: app-admin/salt, dev-python/pytest-salt-factories, dev-python/boto

Re: [gentoo-dev] RFC: banning "AI"-backed (LLM/GPT/whatever) contributions to Gentoo

[gentoo-dev] Last rites: app-misc/rmlint

[gentoo-dev] Last rites: dev-python/nose

26 matches

Site Navigation

Mail list logo

Footer information