Training Is Everything: Artificial Intelligence, Copyright, and Fair Training
To learn how to behave, the current revolutionary generation of AIs must be trained on vast quantities of published images, written works, and sounds, many of which fall within the core subject matter of copyright law. To some, the use of copyrighted works as training sets for AI is merely a transitory and non-consumptive use that does not materially interfere with owners' content or copyrights protecting it. Companies that use such content to train their AI engine often believe such usage should be considered "fair use" under United States law (sometimes known as "fair dealing" in other countries). By contrast, many copyright owners, as well as their supporters, consider the incorporation of copyrighted works into training sets for AI to constitute misappropriation of owners' intellectual property, and, thus, decidedly not fair use under the law. This debate is vital to the future trajectory of AI and its applications. In this article, we analyze the arguments in favor of, and against
To learn how to behave, the current revolutionary generation of AIs must be trained on vast quantities of published images, written works, and sounds, many of which fall within the core subject matter of copyright law. To some, the use of copyrighted works as training sets for AI is merely a transitory and non-consumptive use that does not materially interfere with owners' content or copyrights protecting it. Companies that use such content to train their AI engine often believe such usage should be considered "fair use" under United States law (sometimes known as "fair dealing" in other countries). By contrast, many copyright owners, as well as their supporters, consider the incorporation of copyrighted works into training sets for AI to constitute misappropriation of owners' intellectual property, and, thus, decidedly not fair use under the law. This debate is vital to the future trajectory of AI and its applications. In this article, we analyze the arguments in favor of, and against, viewing the use of copyrighted works in training sets for AI as fair use. We call this form of fair use "fair training". We identify both strong and spurious arguments on both sides of this debate. In addition, we attempt to take a broader perspective, weighing the societal costs (e.g., replacement of certain forms of human employment) and benefits (e.g., the possibility of novel AI-based approaches to global issues such as environmental disruption) of allowing AI to make easy use of copyrighted works as training sets to facilitate the development, improvement, adoption, and diffusion of AI. Finally, we suggest that the debate over AI and copyrighted works may be a tempest in a teapot when placed in the wider context of massive societal challenges such as poverty, equality, climate change, and loss of biodiversity, to which AI may be part of the solution.
Executive Summary
This article examines the debate surrounding the use of copyrighted works as training sets for artificial intelligence (AI) and whether such use constitutes 'fair use' under US law. The authors analyze arguments for and against 'fair training' and consider the broader societal implications of allowing AI to use copyrighted works. They weigh the potential benefits, such as novel AI-based approaches to global issues, against the costs, including job replacement and intellectual property misappropriation. The article concludes that the debate may be overstated in the context of larger societal challenges, but still requires careful consideration of the legal and ethical implications.
Key Points
- ▸ The use of copyrighted works as training sets for AI is a crucial aspect of AI development
- ▸ The debate surrounding 'fair training' is contentious, with proponents arguing it is a transitory and non-consumptive use, while opponents consider it misappropriation of intellectual property
- ▸ The article considers the broader societal implications of allowing AI to use copyrighted works, including potential benefits and costs
Merits
Encourages AI Innovation
Allowing AI to use copyrighted works as training sets could facilitate the development of novel AI-based approaches to global issues, such as environmental disruption and climate change
Demerits
Intellectual Property Misappropriation
The use of copyrighted works without permission or compensation could be seen as misappropriation of intellectual property, potentially harming content creators and owners
Expert Commentary
The article provides a nuanced analysis of the complex issues surrounding the use of copyrighted works as training sets for AI. The authors' consideration of the broader societal implications is particularly noteworthy, as it highlights the need for a balanced approach that weighs the potential benefits of AI innovation against the potential costs and risks. Ultimately, the debate surrounding 'fair training' requires careful consideration of the legal, ethical, and social implications, and policymakers, industry leaders, and stakeholders must work together to develop solutions that promote innovation while protecting intellectual property rights.
Recommendations
- ✓ Policymakers should conduct a comprehensive review of existing copyright laws and regulations to ensure they are adapted to the AI landscape
- ✓ Companies developing AI systems should prioritize transparency and accountability in their use of copyrighted works, and consider alternative approaches that respect intellectual property rights