{"id":1515,"date":"2024-02-12T11:03:41","date_gmt":"2024-02-12T11:03:41","guid":{"rendered":"https:\/\/parrottcliff.com\/?p=1515"},"modified":"2025-01-03T09:19:12","modified_gmt":"2025-01-03T09:19:12","slug":"is-there-a-chink-in-the-ai-dragons-armour","status":"publish","type":"post","link":"https:\/\/parrottcliff.com\/index.php\/2024\/02\/12\/is-there-a-chink-in-the-ai-dragons-armour\/","title":{"rendered":"Is There A Chink In The AI Dragon\u2019s Armour?"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"1515\" class=\"elementor elementor-1515\">\n\t\t\t\t<div class=\"elementor-element elementor-element-u94fs0b e-flex e-con-boxed e-con e-parent\" data-id=\"u94fs0b\" data-element_type=\"container\" data-e-type=\"container\" data-settings=\"{&quot;background_background&quot;:&quot;classic&quot;}\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t<div class=\"elementor-element elementor-element-fe6a729 e-con-full e-flex e-con e-child\" data-id=\"fe6a729\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-c7345d2 elementor-widget elementor-widget-heading\" data-id=\"c7345d2\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Is There A Chink In The AI Dragon\u2019s Armour?<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-9290cc1 e-flex e-con-boxed e-con e-parent\" data-id=\"9290cc1\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-8d4de57 elementor-widget elementor-widget-text-editor\" data-id=\"8d4de57\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p><strong>L<\/strong>ately, we\u2019ve been hearing a lot about the inevitable AI revolution. It\u2019s going to be a bigger disruption than the Industrial Revolution! Many people are going to be made redundant! The big Hollywood studios will be able to make movies with just a series of magical prompts and whallah, an instant blockbuster. And all this, without paying those pesky artists! Well, maybe and then again, maybe not.<\/p><p>Will this seemingly unstoppable AI Dragon swoop down upon us, laying waste to jobs, human creativity, and the way we work forever? Like Tolkien\u2019s dreaded Smaug, maybe there\u2019s a scale missing near its heart. A weak spot where a miracle shot from a skilled bowman could take it down.<\/p><p>Whether the metaphorical bowman is considered a hero, or an annoying obstruction, all depends on which camp you\u2019re in. Are you in the one that\u2019s gleefully anticipating the oncoming AI Revolution? Or, instead of a revolution would you prefer a slower paced evolution? I place myself squarely in the latter camp.<\/p><h5>A Quick Back Story<\/h5><p>I\u2019ve been teaching myself Blender using one of the best teaching resources, YouTube. Perusing the multitude of quality tutorials, I came upon one that showed excerpts from the Blender Conference 2023. In one particular video entitled, \u201cAI, The Commons and the Limits of Copyright\u201d a dude named Paul Keller from an EU think tank called \u201cOpen Future\u201d mentioned the \u2018against\u2019 and \u2018for\u2019 camps in his talk.<a href=\"https:\/\/youtu.be\/QEJfOEk4WZ0?feature=shared\"> https:\/\/youtu.be\/QEJfOEk4WZ0?feature=shared<\/a><\/p><p>Mr. Keller\u2019s first camp consisted of people who feel that using other\u2019s copyrighted material to train generative AI on Large Language Models (LLM\u2019s), is tantamount to theft. On one of his slides, he quoted Naomi Klein, \u201cIt\u2019s daylight robbery.\u201d<\/p><p>In the opposing camp to which Mr. Keller placed himself, were the people who feel it\u2019s unjust to lock down AI technology \u201cthrough means of copyright.\u201d (@06:32 in the video). Mr. Keller\u2019s argues that copyright law is unfit to solve the issue.<\/p><p>His position is that copyright attaches to the outmoded concept of copying and making content available for sale, or otherwise. He posits that \u201cpeople in the know\u201d realise that to train AI, they just need to \u201c<strong><em>make a copy once, very early in the process<\/em><\/strong>, then the model learns from this copy, and there\u2019s no more copying.\u201d<\/p><p>His statement really struck a chord in me. They just have to copy it once, very early on in the process, then whallah &#8211; like magic the rest of the process isn\u2019t tainted? Really? What sort of thinking is going on in that so-called, Think Tank?<\/p><p>The question that kept rolling around in my head was whether the actions of tech giants taking copyrighted material without permission, to train their Generative AI (GenAI), is an unfair infringement of copyright? Then in December 2023, came news of a new lawsuit filed in New York State that may answer the question.<\/p><h5>New York Times v. OpenAI &amp; Microsoft<\/h5><p>Before the New York Times <strong><em>(NYT)<\/em><\/strong> filed their case, there were a number of US copyright cases previously filed that deal with the same or similar issue. The more notable are, \u201c<em>Getty Images v Stability AI, Authors Guild (George R.R. Martin) v OpenAI<\/em>\u201d, and \u201c<em>Sarah Silverman v OpenAI<\/em>\u201d. The slew of these AI copyright infringement cases are working their way through the litigation process. Similar to Tolkien\u2019s \u201cBard the Bowman\u201d shooting the legendary Black Arrow into the chink in Smaug\u2019s armour, each case has the potential to cripple the fledgeling AI industry. Or if AI wins one of them, it could set a precedent for the other ongoing cases. The stakes are high for the claimants and the rest of humanity.<\/p><p>But it&#8217;s NYT\u2019s case against OpenAI and Microsoft that caught my attention and, to me, it looks like the one that may have a good chance at hindering, if not stop altogether, massive wholesale copyright infringement. What must the NYT prove in court in order to win?<\/p><h5>Rules of the Game<\/h5><p>Without getting too caught up legal jargon, there are just two <em><strong>elements<\/strong><\/em> the NYT needs to prove by the \u2018preponderance of the evidence\u2019<a id=\"#ffn1\" class=\"footnote\" href=\"fn1\">1<\/a>. And under the US Federal statute <em>17 USC \u00a7501<\/em>, the two elements to win a copyright infringement case are:<\/p><p>The NYT is the owner of a valid copyright; and<br \/>OpenAI and Microsoft copied the original expression from the copyrighted work.<\/p><p>And that\u2019s it! Seems like a cakewalk, right? Well, the NYT is smart enough not to pop the champagne too early because copyright law can be a bit murky. There are defences that are available to OpenAI and Microsoft. But let\u2019s first go through the story of what happened, as provided by the NYT\u2019s attorneys in their complaint. To download and review yourself, please click on this link to download the <a href=\"https:\/\/parrottcliff.com\/wp-content\/uploads\/2024\/02\/NYT_Complaint_Dec2023.pdf\"><em><strong>Complaint<\/strong><\/em><\/a>.<\/p><h5>The Players<\/h5><p>As mentioned, it\u2019s the New York Times (NYT) filing the case so, they are the \u201cPlaintiff\u201d. The claim is against OpenAI, also known as \u201cChatGPT,\u201d making them a \u201cDefendant\u201d. Microsoft is also included as a Defendant. Why is Microsoft part of this? Because it\u2019s their Azure cloud servers that are the sole computing services for OpenAI. They designed the training of the GenAI by using the entire internet to scour content. It\u2019s alleged that they collaborated with OpenAI to take NYT\u2019s content, plus other people\u2019s content. The NYT claims that Microsoft\u2019s \u201cBing Chat\u201d service has created synthetic search results using its content.<\/p><h5>The Accusation<\/h5><p>The NYT\u2019s states that the Defendants together unlawfully used millions of its copyrighted materials to train their Generative AI (GenAI) tool via large-language-models (LLM\u2019s). They copied and stored the content in their computers and are currently still doing so without its permission, nor are they willing to fairly compensate the NYT for it.<\/p><p>The point the NYT makes is that it spent many years and a massive amount of investment creating their content and the Defendants are trying to get a free ride using it. Their content is the result nearly a 100 years of work, made by thousands of journalists, and costing hundreds of millions of dollars every year. To make these stories, some journalists had to be in harm\u2019s way, or even lost their lives.<\/p><h5>Proof?<\/h5><p>In its Complaint, the NYT claims that OpenAI outputs thousands of verbatim copies of its articles. It claims there are also close summaries to its articles and that OpenAI mimics the NYT\u2019s expressive style. There were many examples of verbatim copying in the NYT\u2019s Complaint. I\u2019ve included one of them so you can read for yourself.<\/p><p><img fetchpriority=\"high\" decoding=\"async\" class=\"aligncenter size-full wp-image-2741\" src=\"https:\/\/parrottcliff.com\/wp-content\/uploads\/2024\/02\/NYT_Complaint_Dec2023.jpg\" alt=\"\" width=\"869\" height=\"882\" srcset=\"https:\/\/parrottcliff.com\/wp-content\/uploads\/2024\/02\/NYT_Complaint_Dec2023.jpg 869w, https:\/\/parrottcliff.com\/wp-content\/uploads\/2024\/02\/NYT_Complaint_Dec2023-296x300.jpg 296w, https:\/\/parrottcliff.com\/wp-content\/uploads\/2024\/02\/NYT_Complaint_Dec2023-768x779.jpg 768w\" sizes=\"(max-width: 869px) 100vw, 869px\" \/><\/p><p>One of the kickers in copyright law is that if it\u2019s judged that a Defendant breached a copyright \u201c<strong>knowingly and wilfully<\/strong>\u201d, the damages are dramatically increased. Which is what the NYT\u2019s claims against the Defendants, they did it knowingly and wilfully. In fact, back in April 2023, the NYT confronted the Defendants and tried to enter into negotiations for fair compensation but to no avail.<\/p><p>The NYT\u2019s admits it allows search engines to index their content so people can find it in order to attract paying customers. They claim that the Defendants used this as an open invitation to take whatever content they wanted and then had the audacity to provide the same or very similar content as the NYT. Not only are the Defendants taking content without permission, they\u2019re competing with the NYT with their own content and losing paying customers.<\/p><p>The NYT claims that OpenAI attributes wrong information to the NYT\u2019s articles. ChatGPT sometimes glitches and lies, or gives misinformation, stating falsely that the info came from the NYT. In the complaint, it gives several examples of this glitch, which the industry refer to as \u201challucinations.\u201d This false attribution has hurt the NYT\u2019s reputation for integrity.<\/p><h5>The \u201cFair Use\u201d Defence<\/h5><p>The Defendants admit to using the NYT\u2019s unlicensed content to train their GenAI models but they claim that what they did falls under the \u201c<strong>Fair Use Doctrine<\/strong>\u201d (<em>17 USC \u00a7107<\/em>). This doctrine covers a lot of common sense issues such as, a plaintiff can\u2019t claim copyright on every day words and phrases. OpenAI\u2019s defence is that it\u2019s using the NYT\u2019s content for a \u201c<em>transformative<\/em>\u201d purpose. What is a \u201c<em>transformative use<\/em>\u201d? Sounds like alchemy, doesn\u2019t it?<\/p><p>Basically, the Defendants claim that what they did with the NYT\u2019s content was to transform and change it so it has a new meaning, expression, or message so it\u2019s different from the original work.<\/p><p>In a case where the \u201cfair use by transforming\u201d defence was upheld, very little of the original content was used and there was an element of parodying the original content, mocking and making it into new \u2018transformed content\u2019 (<em>Campbell v. Acuff-Rose Music, 510 U.S. 569 (1994)<\/em>). From the case law I read, it appears that transforming content by parodying it, was a common theme.<\/p><p>On the other hand, there\u2019s case law where this \u201cnew transformative use\u201d was rejected. In the case of <em>Warner Bros. v RDR Books, 575 F. Supp. 2d 513 (S.D.N.Y. 2008)<\/em>, a Harry Potter Dictionary was claimed to breach copyright. This claim was upheld because because it used original content verbatim.<\/p><h5>Is There An Opening For An Arrow?<\/h5><p>Looking at the NYT\u2019s complaint against OpenAI and Microsoft, out of the 7 counts, the one that looks like it could really hit the mark is the <span style=\"text-decoration: underline;\">verbatim copying<\/span> of its articles. And with millions of articles it claims were knowingly and wilfully pilfered by the Defendants, the damages could be in the billions. Now, that\u2019s a Black Arrow into the heart.<\/p><p>Not only are the NYT asking for damages, they are entitled to ask the court to demand the Defendants to destroy ChatGPT\u2019s LLM models and training sets that contain its content. It\u2019s a big if but, <em><strong>IF<\/strong> <\/em>this is the result, it would curtail AI\u2019s onslaught and in my opinion, rightly so.<\/p><p>Getting back to Mr. Keller\u2019s Blender Conference 2023 remarks, by holding OpenAI and other tech giants accountable for their \u201cdaylight robbery,\u201d it will stop this \u201c<em>copying just once<\/em>\u201d practice. If these well funded AI developers want to use content to train their models, then make them pay for it like everyone else.<\/p><p>The content owners have a copyright and, contrary to what Mr. Keller &#8216;thinks&#8217; in his think tank, copyright law is exactly what needed here and is the correct tool to administer justice. Nothing\u2019s novel here, there are people who want to take other\u2019s property without paying for it and then make billions from it. That\u2019s what I call, good old fashioned corporate greed.<\/p><p>It\u2019s up to lawyers, judges, and juries now. Hopefully there\u2019s a <strong>Bard the Bowman<\/strong> attorney that will plant an arrow in the heart of the AI Dragon. And it\u2019s plaintiff\u2019s the size of the New York Times and Getty Images that can battle these AI behemoths.<\/p><p>I will keep an eye on this slew of AI infringement cases and will follow up on this article as they progress. And in the name of full disclosure, and to rub its nose in it, I have indeed\u00a0 used AI to create the cover artwork for this article &#8211; haha!<\/p><h6>Footnotes<\/h6><ol><li id=\"fn1\u201d\">In criminal law, the burden of proof is usually, \u201cBeyond a reasonable doubt.\u201d But in certain civil cases, the burden is \u201cBy a preponderance of the evidence\u201d. Meaning that the plaintiff has to prove to a judge\/jury that there is a greater than 50% chance their claim is true.<a href=\"\u201c#ffn1\u201d\">back<\/a><\/li><\/ol>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-3522f3d e-flex e-con-boxed e-con e-parent\" data-id=\"3522f3d\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-a0e325e elementor-widget elementor-widget-text-editor\" data-id=\"a0e325e\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p><a href=\"https:\/\/parrottcliff.com\"><img decoding=\"async\" class=\"wp-image-1517 alignleft\" src=\"https:\/\/parrottcliff.com\/wp-content\/uploads\/2023\/12\/Caricature-02-400x400-1.png\" alt=\"\" width=\"118\" height=\"125\" \/>Parrott.Cliff<\/a> is the website and source for blog articles for the award winning animator, writer, director and producer, Clifford Parrott. He resides on Ireland&#8217;s west coast where he tries to surf as much as possible, and also help run Magpie 6 Media with his wife Christina, in that order.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>OpenAI and US Copyright Clash<\/p>\n","protected":false},"author":1,"featured_media":2719,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"default","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","ast-disable-related-posts":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"set","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"footnotes":""},"categories":[11,1],"tags":[24,25,34,26,27,36,32,31,30,28,29,33,35],"class_list":["post-1515","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai_tech","category-blog","tag-ai","tag-ai-generated","tag-big-tech","tag-copyright","tag-copyright-infringement","tag-davey-v-goliath","tag-federal-court","tag-lawsuit","tag-microsoft","tag-new-york-times","tag-openai","tag-tech","tag-tech-giants"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/parrottcliff.com\/index.php\/wp-json\/wp\/v2\/posts\/1515","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/parrottcliff.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/parrottcliff.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/parrottcliff.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/parrottcliff.com\/index.php\/wp-json\/wp\/v2\/comments?post=1515"}],"version-history":[{"count":38,"href":"https:\/\/parrottcliff.com\/index.php\/wp-json\/wp\/v2\/posts\/1515\/revisions"}],"predecessor-version":[{"id":2908,"href":"https:\/\/parrottcliff.com\/index.php\/wp-json\/wp\/v2\/posts\/1515\/revisions\/2908"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/parrottcliff.com\/index.php\/wp-json\/wp\/v2\/media\/2719"}],"wp:attachment":[{"href":"https:\/\/parrottcliff.com\/index.php\/wp-json\/wp\/v2\/media?parent=1515"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/parrottcliff.com\/index.php\/wp-json\/wp\/v2\/categories?post=1515"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/parrottcliff.com\/index.php\/wp-json\/wp\/v2\/tags?post=1515"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}