Does Clojure have an anti LLM code contributions policy? Would it make sense for Clojure to adopt one?

Question

Does Clojure have an anti LLM code contributions policy? Would it make sense for Clojure to adopt one?

asked Mar 29 in Meta by Ellie

LLM use appears to have a bunch of ethical and practical problems, the most pressing one seems to be the plagiarism that seems to be pretty excessive even for code and even when not baited:

Video clip of apparently a lawyer live demoing what seems to be Co-Pilot plagiasm

I don't know anything about law, but the lawyer at one point says: "This is a copyright infringement." I've also found this: https://www.twobirds.com/en/insights/2025/landmark-ruling-of-the-munich-regional-court-(gema-v-openai)-on-copyright-and-ai-training It seems to talk about "fair use" of AI model training.

Also see this high-profile incident: https://www.pcgamer.com/software/ai/microsoft-uses-plagiarized-ai-slop-flowchart-to-explain-how-github-works-removes-it-after-original-creator-calls-it-out-careless-blatantly-amateuristic-and-lacking-any-ambition-to-put-it-gently/

Also this field study that appears to be putting the plagiarism rate at seemingly at least 2-5%: https://dl.acm.org/doi/10.1145/3543507.3583199

This article mentions a study that apparently puts the plagiarism rate at 8–15% as a minimum for the easily detectable kind: https://www.theatlantic.com/technology/2026/01/ai-memorization-research/685552/

I don't know what this means legally, but at least morally and ethically this seems sad.

Apparently, some people in the Clojure space already spoke out against LLMs, but I wasn't able to find any anti LLM policy. If there was such a policy, I would expect it to be mentioned in the following places:

I simply wanted to suggest that perhaps the project may want to adopt such a policy.

Here are some other projects that have already done so: Asahi Linux, elementaryOS, Forgejo, Gedit, Gentoo, GIMP, GoToSocial, Löve2D, Loupe, NetBSD, postmarketOS, Qemu, RedoxOS, Servo, stb libraries, Zig.

My deepest apologies if there already is such a policy, or if I'm asking in the wrong space.

3 Answers

commented Apr 13 by Ellie

commented Apr 13 by alexmiller

Sean Corfield · Answer 1 · 2026-03-30T14:05:24+0000

answered Mar 30 by Sean Corfield

My feeling here, as a contributor to several Contrib libraries that are under the Clojure CLA (and a one-time contributor to core!), is that the CLA already covers this from a legal standpoint:

"You covenant, represent, warrant and agree that:

each contribution that you submit is and shall be an original work of authorship and you can legally grant the rights set out in this RHCA;"

(plus various other clauses that cover copyright ownership/grant/etc)

I think any public declarations that "we don't accept AI-generated contributions" are performative.

If you don't have a legally-binding contributors' license agreement? Sure, then you probably need a stated policy about contributions -- but if you're concerned enough about AI-generated contributions to think that, you probably ought to have a legally-binding contributors' license agreement...

commented Mar 30 by Sean Corfield

commented Mar 30 by John Newman

commented Mar 30 by Sean Corfield

If you have a copyright-assigning contributors' license agreement, I don't believe you can legally submit AI-generated code -- since you don't own and cannot transfer copyright on that code.

I suspect the big legal question which we'll need the courts to figure out when a challenge arises is: at what point does "auto-complete" on "your" code become "AI-generated" and therefore non-copyrightable?

I don't think anyone knows the answer to that yet.

If I write the code and have Claude fix a bug in it that amounts to a few lines that I could have been inspired to write myself based on, say, a code fragment I found on StackOverflow as a fix for a similar bug... Is that still copyrightable (i.e., does that small bug fix inclusion invalidate my copyright as a whole on the rest of the code I personally wrote)? What is "fair use" in the context of AI-generated code fragments?

commented Mar 30 by John Newman

commented Apr 6 by Ellie

commented Apr 7 by Sean Corfield

John Newman · Answer 2 · 2026-03-29T15:30:34+0000

Hi Ellie, I'm one of these Clojure users who has vocally favored the use of LLMs in the Clojure community so I've been looking forward to contributing to a conversation about this. Thanks for bringing it up.

I'd like to address each of your points.

LLM use appears to have a bunch of ethical and practical problems

Information technology has a bunch of ethical and practical problems. That isn't new. One of the major ethical and practical problems with information technology today is copyright and patent law.

I don't know anything about law, but the lawyer at one point says: "This is a copyright infringement."

I'm okay with laws that prevent merchants from lying about the provenance of their goods, but I take offense with the notion that people are not free to transact honestly with one another using the information they already possess. And I reject the notion that people are not sovereign owners over the information they possess. I take offense with those who try to impose constraints on my sovereignty in that regard.

Also this field study that appears to be putting the plagiarism rate at seemingly at least 2-5%

Impersonation will increase with AI. And people who pretend to accomplish things they didn't actually accomplish deserve to be found out. Fraud is illegal and cheating is a punishable policy in most environments. But that should not be construed as restricting the reuse of publicly available information about how the world works. Reusing public information and repurposing it in some honest way that is useful in a transaction of mutual interest between people - that freedom must not be curtailed.

I don't know what this means legally, but at least morally and ethically this seems sad.

The legality of copyright and patent law under a liberal system where the subjects of a state are taken to be sovereign agents, with inalienable rights, has always been on a precarious crash course. Human individual sovereignty is simply not compatible with artificially imposed information embargoes for the sake of (supposedly) temporary economic monopolies.

And this crash course was always destined to explode when AI arrived. These two worlds cannot coexist. And we've known this for decades: all byte streams can be reinterpreted; you can't actually own bytes; a universally coherent patent system in time and space is not mathematically definable without letting one attacker patent the whole thing.

An AI future was never going to be compatible with this artificial information monopoly system that we greedily invented a few hundred years ago, under the newfound powers of the panoptical state.

Apparently, some people in the Clojure space already spoke out against LLMs, but I wasn't able to find any anti LLM policy.

I would encourage people in the technology community to debate these topics further before jumping to conclusions.

But if we look at where things are going with AI, I think it's fair to say that some percentage of their outputs will BS for a very long time.

But there's a silver lining to that headline - people will still be needed to filter that BS for a very long time.

So I think we'll develop human filtration systems to channel and filter out the noise coming out of these generators.

And the larger and more important a software project is, and the more risk there is to changing a given piece of code, the more human filtration we'll want in that pipeline.

For the Linux kernel, one would hope, so many human eyeballs have reviewed a given piece of code before it's committed that it shouldn't even matter whether a human wrote it - many humans agree the code is the right direction. That's what mattered.

So for projects like Clojure, you can have rules like "humans have to see this first," but that's already so obvious - everybody knows Rich would never let a branch of code enter core without his full agreement, even if an alien came from space and handed it to him.

For projects with very distributed control, where the direction of the project needs to be some principled philosophy that can survive any future leader's opinion on the project's direction, I can see the point in creating more abstract rules of engagement for how human filtration systems will limit the rate of BS leaking into a codebase.

Clojure isn't one of these distributed control projects. It's a collection of cool technology bits from a guy we trust won't let slop in, from humans or computers. That's already the value proposition. If I were to propose that Rich let more LLM generated content into Clojure, do we all not already know what the answer would be? Are these anti-LLM policy documents symbols of political solidarity around some group grievances regarding climate, plagiarism and slop? Or are you really worried Clojure might end up with slop in it?

Ultimately, going forward, given the avalanche of code that LLMs are about to create for us, I think having human-oriented slop filtration systems are going to be a necessary component of most open source ecosystems - so I'm not against Clojure having slop-prevention systems. But Clojure IMO is one of the least likely to ever have that happen anyway. It already has the strongest possible slop filtration system - all clojure core code changes must transact through a single person's mind named Rich Hickey. That's already the contract.

As a proponent of using more LLMs to help us explore the boundaries of what is possible, I'm also in favor of communities like Clojure adopting tools and policies and procedures to constrain the rate of change. So I wouldn't be mad at seeing policies from Clojure around it. I would just caution everyone against producing prolicy documents "Against AI" that won't even mean anything in two years when everyone has moved on to it being normal. I would frame it as being about preserving Clojure code quality as opposed to some sense in which we can turn back time and somehow not have code being generated by LLMs. That's not going to happen, folks.

Anyway. I have some strong opinions on these topics so I very much appreciate it when folks bring it up, giving us all the opportunity to think about these things a little bit deeper, so thank you for posting this. I think it would be unhelpful for the Clojure community to "go to war with AI," but I'm totally in favor arguing and having debates about pushing this stuff in the right direction and some of that will be policy docs and guidelines and whatnot. Just my 2 cents, take it with a grain of salt.

Does Clojure have an anti LLM code contributions policy? Would it make sense for Clojure to adopt one?

Please log in or register to add a comment.

Please log in or register to answer this question.

3 Answers

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Categories

Does Clojure have an anti LLM code contributions policy? Would it make sense for Clojure to adopt one?

Please log in or register to add a comment.

Please log in or register to answer this question.

3 Answers

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Related questions

Categories