Meta Strikes Back With Defense of AI Training Against Copyright Claims

September 22, 2023

In a world where technology advances unprecedentedly, copyright issues are shifting to unexpected battlefields.

The recent legal clash involving Meta’s artificial intelligence (AI) model, LLaMA, and a group of authors, including comedian Sarah Silverman, has sparked fierce debates about whether training AIs with copyrighted material is the new frontier for copyright disputes.

Meta tells court AI software does not violate author copyrights https://t.co/vPXMFQyhIl pic.twitter.com/ZUc9VzE5AM

— Reuters (@Reuters) September 19, 2023

Meta’s defense: Training AI as fair use

Meta is at the center of a legal maelstrom as it faces accusations of training its LLaMA AI model using copyrighted works without permission.

Sarah Silverman and authors Christopher Golden and Richard Kadrey have asserted that Meta’s LLaMA and OpenAI’s ChatGPT were trained on datasets obtained illicitly from “shadow library” sites, such as Bibliotik and Library Genesis.

Meta, refuting these claims, emphasized the “transformative” nature of using texts to train their AI. Drawing parallels with a precedent case, the company mentioned the ruling in Authors Guild v. Google, Inc., where Google’s copying of books for creating a search tool was deemed fair use.

One of Meta’s primary defenses rests on the comparison with Google’s past case. In 2015, the 2nd Circuit found that Google’s copying of books for its search tool was a transformative, fair use. This decision underscored the idea that if copying serves a new and distinct purpose, it can be considered fair use under copyright law.

Meta is leveraging this logic, suggesting that using copyrighted texts to train an AI and generate unique outputs is similarly transformative.

A broader legal challenge for the tech world

This isn’t an isolated incident. As AI becomes integral to more industries, its methods, especially data sourcing and training, are under heightened scrutiny. Both OpenAI and Google have faced related legal challenges, and just a month prior, Getty Images took on Stability AI for allegedly training its image generation tool on copyrighted images.

Joseph Saveri and Matthew Butterick, representing the three authors against Meta, highlight a growing sentiment among creators. Numerous authors, writers, and publishers are apprehensive about AIs producing content reminiscent of copyrighted materials.

The human touch in a digital age

Despite the rising reliance on AI for various tasks, the human element remains irreplaceable, especially in fields that rely on emotional connection and intuition. This sentiment is echoed in industries outside the legal realm.

According to experts in branding and marketing, while AI can predict market trends, the human ability to forge genuine connections with audiences delivers lasting impact. AI can analyze and predict, but empathy, values, and emotions can’t be algorithmically determined or replicated.

Where to draw the line?

As AI integration deepens, the line between transformative use and copyright infringement grows blurrier. Companies assert that training AIs is transformative, as the AI produces original outputs distinct from the training data. However, content creators are alarmed at how easily their copyrighted materials can be accessed and used without consent.

This legal challenge poses fundamental questions: At what point does using copyrighted material for AI training cross the line? Can the industry establish clear guidelines for AI training that respect copyright laws while promoting innovation?

The collision of AI advancement and copyright law is inevitable, and this case is just the tip of the iceberg. As Sarah Silverman and other authors challenge the tech world’s giants, the outcome of these lawsuits will shape the future of AI training and potentially redefine the boundaries of copyright in the digital age. As AI technologies continue to evolve, industries and legal systems will need to find a balance between protecting the rights of content creators and fostering innovation.