{"id":2872,"date":"2024-10-03T01:10:08","date_gmt":"2024-10-03T01:10:08","guid":{"rendered":"https:\/\/usatrustedlawyers.com\/blog\/meta-hit-with-class-action-for-allegedly-using-pirated-books-to-train-ai-models\/"},"modified":"2024-10-03T01:10:08","modified_gmt":"2024-10-03T01:10:08","slug":"meta-hit-with-class-action-for-allegedly-using-pirated-books-to-train-ai-models","status":"publish","type":"post","link":"https:\/\/usatrustedlawyers.com\/blog\/meta-hit-with-class-action-for-allegedly-using-pirated-books-to-train-ai-models\/","title":{"rendered":"Meta Hit With Class Action for Allegedly Using Pirated Books to Train AI Models"},"content":{"rendered":"\n<div data-v-1b187944=\"\">\n<div data-v-1b187944=\"\">\n<p>Facebook parent Meta Platforms Inc. is the latest target of litigation aimed at Big Tech companies that allegedly use copyright-protected books to train their artificial intelligence models without the authors&#8217; consent.<\/p>\n<\/div>\n<p>  <!----> <!----> <!----><\/p>\n<div data-v-1b187944=\"\">\n<p>Lieff Cabraser\u00a0Heimann &amp; Bernstein and Cowan, DeBaets, Abrahams &amp; Sheppard filed a class action on behalf of\u00a0lead plaintiff Christopher Farnsworth, author of the\u00a0&#8220;Nathaniel Cade&#8221; fiction series, against Meta on Tuesday, claiming it stole &#8220;hundreds of thousands&#8221; of\u00a0copyrighted books from a pirated online collection to build its large language model set, &#8220;Llama.&#8221; The complaint, filed in the U.S. District Court for the Northern District of California\u00a0in San Jose\u00a0alleged copyright infringement under\u00a017 U.S. Code \u00a7 501. Counsel has not yet appeared for the defendant.<\/p>\n<\/div>\n<p> <!----> <!----> <!----> <!----><\/p>\n<div data-v-1b187944=\"\">\n<p>Meta first launched its flagship LLM family, then stylized as\u00a0LLaMA, in Feb. 2023\u00a0in the Big Tech race to compete with the debut of OpenAI&#8217;s trailblazing generative AI chatbot, ChatGPT, in Nov. 2022. Meta released &#8220;Llama 2\u2033\u00a0for commercial use in July 2023 and its latest iteration, &#8220;Llama 3,&#8221; to build its AI assistant &#8220;Meta AI&#8221; on April 18, 2024.<\/p>\n<\/div>\n<p> <!----> <!----> <!----> <!----><\/p>\n<div data-v-1b187944=\"\">\n<p>According to the complaint, Meta downloaded and copied almost 200,000 copyrighted books from &#8220;Books3,&#8221; a library of copyrighted works\u00a0scraped by developer Shawn Presser from\u00a0the pirated book website Bibliotik.\u00a0&#8220;Books3\u2033 is part of &#8220;The Pile,&#8221; an open-source online dataset\u00a0hosted by nonprofit EleutherAI that was specifically designed to train large language models. LLMs are conditioned to simulate human communication by ingesting and processing massive quantities of data that effectively &#8220;teach&#8221; it to generate predictive written responses. The\u00a0complaint claims that Meta publicly disclosed it used data from Books3 to train its LLMs in a Feb. 2023 <a href=\"https:\/\/arxiv.org\/pdf\/2302.13971\" rel=\"nofollow noopener\" target=\"_blank\">research paper<\/a>.<\/p>\n<\/div>\n<p> <!---->  <!----> <!----><\/p>\n<div data-v-1b187944=\"\">\n<p>Meta and the plaintiff&#8217;s counsel did not immediately respond to requests for comment.<\/p>\n<\/div>\n<p> <!----> <!----> <!----> <!----><\/p>\n<div data-v-1b187944=\"\">\n<p>&#8220;These platforms are operating on the principle &#8216;move fast and break things and pay for it later,'&#8221; said\u00a0Sullivan &amp; Worcester partner Mike Palmisciano, who specializes in transactional intellectual property matters. &#8220;Let&#8217;s develop these products, become kind of essential in the marketplace, and then figure out how we go from there.&#8221;<\/p>\n<\/div>\n<p> <!----> <!----> <!----> <!----><\/p>\n<div data-v-1b187944=\"\">\n<p>This is not the first time Meta has faced allegations of\u00a0stealing copyrighted material from Books3 for AI training purposes. A coalition of writers including\u00a0comedian Sarah Silverman sued both Meta and OpenAI\u00a0in California federal court in July 2023 on similar claims of copyright infringement. The Associated Press <a href=\"https:\/\/apnews.com\/article\/ai-copyright-lawsuit-zuckerberg-deposition-sarah-silverman-meta-df4dec4aef8924d38d258212e0654a3d\" rel=\"nofollow noopener\" target=\"_blank\">reported<\/a> on Sept. 27 that Meta&#8217;s CEO, Mark Zuckerberg, will be deposed as part of the class action against Meta.<\/p>\n<\/div>\n<p> <!----> <!---->  <!----><\/p>\n<div data-v-1b187944=\"\">\n<p>Lieff Cabraser, along with co-counsel at Susman Godfrey, are also representing the plaintiffs in a class action filed in August that accuses the AI startup Anthropic of misappropriating the texts on Books3 to train its own LLM collection, &#8220;Claude.&#8221;<\/p>\n<\/div>\n<p> <!----> <!----> <!----> <!----><\/p>\n<div data-v-1b187944=\"\">\n<p>Palmisciano said that these types of copyright infringement claims will continue to escalate until a regulatory solution or court judgment &#8220;sets the guidelines for what&#8217;s permissible in the AI context.&#8221;<\/p>\n<\/div>\n<p> <!----> <!----> <!----> <!----><\/p>\n<div data-v-1b187944=\"\">\n<p>&#8220;I think the fair use argument that&#8217;s being made in the defense is hard to square with decades of case law on copyright fair use,&#8221; he said. &#8220;That being said, I would assume at some point we will get a \u2026 Supreme Court ruling on what constitutes fair use in the AI context and whether this type of large dataset ingestion is transformative in a way that protects the providers.&#8221;<\/p>\n<\/div>\n<p> <!----> <!----> <!----> <!----><\/p>\n<div data-v-1b187944=\"\">\n<p>Until the high court rules on the fair use issue, Palmisciano predicts that companies targeted by the litigation will continue to reach one-off settlements and monetary agreements.<\/p>\n<\/div>\n<p> <!----> <!----> <!----> <!----><\/p>\n<div data-v-1b187944=\"\">\n<p>&#8220;That seems to be what a lot of early financing for platforms like OpenAI is earmarked toward,&#8221; he said. &#8220;They have their tech development, of course, but they&#8217;re also reaching these really expensive and extensive licensing agreements for content that they&#8217;ve already ingested into their platform.&#8221;<\/p>\n<\/p><\/div>\n<p> <!----> <!----> <!----> <!----><\/div>\n","protected":false},"excerpt":{"rendered":"<p>Facebook parent Meta Platforms Inc. is the latest target of litigation aimed at Big Tech companies that allegedly use copyright-protected books to train their artificial intelligence models without [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":2873,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[6],"tags":[715,2097,3110,650,1067,1290,199,3678,660],"class_list":["post-2872","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-lawyers","tag-action","tag-allegedly","tag-books","tag-class","tag-hit","tag-meta","tag-models","tag-pirated","tag-train"],"_links":{"self":[{"href":"https:\/\/usatrustedlawyers.com\/blog\/wp-json\/wp\/v2\/posts\/2872","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/usatrustedlawyers.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/usatrustedlawyers.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/usatrustedlawyers.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/usatrustedlawyers.com\/blog\/wp-json\/wp\/v2\/comments?post=2872"}],"version-history":[{"count":0,"href":"https:\/\/usatrustedlawyers.com\/blog\/wp-json\/wp\/v2\/posts\/2872\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/usatrustedlawyers.com\/blog\/wp-json\/wp\/v2\/media\/2873"}],"wp:attachment":[{"href":"https:\/\/usatrustedlawyers.com\/blog\/wp-json\/wp\/v2\/media?parent=2872"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/usatrustedlawyers.com\/blog\/wp-json\/wp\/v2\/categories?post=2872"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/usatrustedlawyers.com\/blog\/wp-json\/wp\/v2\/tags?post=2872"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}