A judge has dismissed the majority of claims in a copyright lawsuit filed by developers against GitHub, Microsoft, and OpenAI.
A judge has dismissed the majority of claims in a copyright lawsuit filed by developers against GitHub, Microsoft, and OpenAI.
The lawsuit was initiated by a group of developers in 2022 and originally made 22 claims against the companies, alleging copyright violations related to the AI-powered GitHub Copilot coding assistant.
Judge Jon Tigar’s ruling, unsealed last week, leaves only two claims standing: one accusing the companies of an open-source license violation and another alleging breach of contract. This decision marks a substantial setback for the developers who argued that GitHub Copilot, which uses OpenAI’s technology and is owned by Microsoft, unlawfully trained on their work.
...
Despite this significant ruling, the legal battle is not over. The remaining claims regarding breach of contract and open-source license violations are likely to continue through litigation.
The judge also noted that the cited study itself mentions that GitHub Copilot “rarely emits memorised code in benign situations.”
"Rarely" is not zero. This looks like it's opening a loophole to copying open source code with strong copyleft licenses like the GPL:
Find OSS code you want to copy
Set up conditions for Copilot to reproduce code
Copy code into your commercial product
When sued, just claim Copilot generated the code
Depending on how good your lawyers are, 2 is optional. And bingo! All the OSS code you want without those pesky restrictive licenses.
In fact, I wonder if there's a way to automate step 2. Some way to analyze an OSS GitHub repo to generate inputs for Copilot that will then regurgitate that same repo.
It doesn't work like that. A copy is a copy. Only if you can make it credible that you independently produced the same code, can you get through with that. Hence, clean room implementations. It's not strictly necessary but deters lawsuits.
Apparently there's some confusion here what the judge ruled. This particular part is about claims under the DMCA, not copyright infringement. The relevant sections can be seen here: https://www.copyright.gov/title17/92chap12.html
[edit: link fixed. The claim was that "copyright management information" was removed; prohibited under these sections.]
Immediately lose the case because nobody is claiming that when copilot does emit copyrighted code verbatim it is magically stripped of copyright protections.