Policy·Europe

FSF Urges Open Access for LLM Training Inputs

Global AI Watch · Editorial Team··3 min read·The Register
FSF Urges Open Access for LLM Training Inputs

The Free Software Foundation (FSF) has publicly urged Anthropic to disclose its training data used in developing large language models (LLMs), citing copyright infringement related to the unauthorized use of works under fair use provisions. This appeal follows Anthropic's recent settlement, wherein they established a $1.5 billion fund to remedy claims from copyright holders after facing a class-action lawsuit. The FSF specifically referenced their materials in the datasets used by Anthropic, advocating for transparency and sharing of model training inputs following the GNU Free Documentation License. The implications of this move underscore the growing debate surrounding data sovereignty in AI development. By pressing for open access to training datasets and full disclosure of training methodologies, the FSF aims to foster a culture of collaboration and fairness within the AI sector. This push could stimulate national AI strategies that prioritize ethical practices and assert more control over proprietary technologies, potentially reducing foreign dependency but also raising questions about compliance with existing intellectual property frameworks.

Free Daily Briefing

Top AI intelligence stories delivered each morning.

Subscribe Free →
SourceThe RegisterRead original

Explore Trackers