{"id":77894,"date":"2024-07-05T09:00:38","date_gmt":"2024-07-05T09:00:38","guid":{"rendered":"https:\/\/www.cryptocabaret.com\/?p=77894"},"modified":"2024-07-05T09:00:38","modified_gmt":"2024-07-05T09:00:38","slug":"openai-wants-new-york-times-to-show-how-original-its-copyrighted-articles-are","status":"publish","type":"post","link":"https:\/\/www.cryptocabaret.com\/?p=77894","title":{"rendered":"OpenAI Wants New York Times to Show How Original Its Copyrighted Articles Are"},"content":{"rendered":"<p><img fetchpriority=\"high\" decoding=\"async\" src=\"https:\/\/www.cryptocabaret.com\/wp-content\/uploads\/2024\/07\/openai-logo.jpg\" alt=\"openai logo\" width=\"300\" height=\"188\" class=\"alignright size-full wp-image-246884\" srcset=\"https:\/\/www.cryptocabaret.com\/wp-content\/uploads\/2024\/07\/openai-logo.jpg 693w, https:\/\/torrentfreak.com\/images\/openai-logo-300x188.jpg 300w\" sizes=\"(max-width: 300px) 100vw, 300px\">Starting last year, various rightsholders began filing lawsuits against companies that develop AI models.<\/p>\n<p>The list of complainants includes record labels, book authors, visual artists, and newspapers, including the New York Times. These rightsholders all object to the presumed use of their work to train AI models without proper compensation.<\/p>\n<p>The New York Times lawsuit targets <a href=\"https:\/\/en.wikipedia.org\/wiki\/OpenAI#Controversies\">OpenAI and Microsoft<\/a> and is steadily moving forward. OpenAI recently indicated that it would like to consolidate this case with a similar lawsuit <a href=\"https:\/\/torrentfreak.com\/newspapers-sue-openai-for-copyright-infringement-and-fake-news-240501\/\">filed by other newspapers<\/a>, but the Times objects to the proposal.<\/p>\n<p>While these issues are fought out in court, both parties have also moved into discovery. That basically allows one party to request evidence from the other, to properly support or refute the copyright infringement claims that form the basis of the lawsuit. <\/p>\n<h2>OpenAI Seeks NYT \u2018Source\u2019 Material<\/h2>\n<p>In its quest for evidence, OpenAI is particularly interested in the copyrights of the New York Times\u2019 works. This includes copyrighted news articles, which are often based on a variety of information gathered by its journalists. <\/p>\n<p>For example, discovery requests no. 10-12 read as follows:<\/p>\n<blockquote>\n<p><em>NO. 10: Documents sufficient to identify the expressive, original, and human-authored content of each of Your Asserted Works. <\/em><\/p>\n<p>NO. 11: Documents sufficient to identify the non-expressive, non-original, or non-human-authored content of each of Your Asserted Works. <\/p>\n<p>NO. 12: Documents sufficient to show each and every written work that informed the preparation of each of Your Asserted Works, regardless of its length, format, or medium.<\/p>\n<\/blockquote>\n<p>The New York Times is not happy with this approach. The company has refused to share reporter\u2019s notes and other information, as this would be overbroad and too burdensome. In addition, the Times pointed out that much of the information sought by OpenAI is protected by the reporter\u2019s privilege. <\/p>\n<h2>OpenAI Files Motion to Compel<\/h2>\n<p>The refusal has created a dispute between the parties and OpenAI has urged the court to weigh in. Ideally, the AI company wants the court to compel the Times to cooperate.<\/p>\n<p>OpenAI suggests that the \u2018source\u2019 information will help to determine what parts of the articles are \u2018original\u2019 and worthy of copyright protection, which may help it to counter the copyright infringement claims. <\/p>\n<p>\u201c[T]he Times cannot pursue a claim for infringement over any part of a copyrighted work that is not original to the Times, as would be the case if the Times copied another\u2019s work or elements in the public domain,\u201d OpenAI writes. <\/p>\n<p>In its complaint, the Times described how it invests enormous amounts of time and expertise on its articles, which are sometimes the results of months or years of in-depth investigations. OpenAI would like to know what this claim entails.<\/p>\n<p>\u201cHaving chosen to put directly at issue how the Times created the works at issue\u2014including the methods, time, labor, and investment\u2014OpenAI has a right to discovery into the same,\u201d OpenAI writes.    <\/p>\n<p>In preparation for its defense, OpenAI further wants to know what portions of the copyrighted articles are \u201cexpressive, original, human-authored content\u201d, and what parts are  \u201cnon-expressive, non-original, or non-human-authored content.\u201d <\/p>\n<h2>New York Times Refuses to Comply<\/h2>\n<p>Responding to the motion to compel, the Times makes it clear that the company doesn\u2019t intend to give in. It stresses that its articles are copyrightable, whether they include third-party material or not. <\/p>\n<p>\u201cOpenAI claims that the reporters\u2019 notes underlying the asserted works may shed light on whether The Times\u2019s news articles are really original, expressive content\u2014but that is not how copyright law works. The expressive nature of a work is determined by reference to the work itself. <\/p>\n<p>\u201cMoreover, even in the improbable case that a reporter\u2019s notes show that 90% of an article comprises verbatim quotes from the author\u2019s original sources, that article would still be protected by copyright,\u201d the Times adds.<\/p>\n<p>In addition, the newspaper reiterates that the discovery requests are overbroad, and invade the reporter\u2019s privilege. Although OpenAI stressed that it\u2019s not seeking to identify any confidential sources, its discovery request could have a chilling effect. <\/p>\n<p>If journalistic outfits are required to disclose all source material for every copyrighted article, it may severely impact their ability or willingness to bring copyright lawsuits against potential infringers. <\/p>\n<p>But perhaps that\u2019s precisely what OpenAI tried to achieve here, the newspaper notes. <\/p>\n<p>\u201cIndeed, given the wildly improper scope of this request, one has to wonder if a chilling effect is exactly what OpenAI, who appears to have stolen from millions of content creators, is hoping for,\u201d the Times writes.<\/p>\n<p><em>\u2014<\/em><\/p>\n<p>A copy of OpenAI\u2019s request to compel The New York Times to share the requested information is available <a href=\"https:\/\/storage.courtlistener.com\/recap\/gov.uscourts.nysd.612697\/gov.uscourts.nysd.612697.152.0.pdf\">here (pdf)<\/a>. This also includes other disputed requests. The New York Times\u2019 response can be found <a href=\"https:\/\/storage.courtlistener.com\/recap\/gov.uscourts.nysd.612697\/gov.uscourts.nysd.612697.156.0.pdf\">here (pdf)<\/a>.<\/p>\n<p><sub>* For the purpose of record keeping, the referenced court filings are the sources we relied on for this article. No AI assistance was involved. Human labor was required to select and organize some of the arguments put forward by the parties, while intentionally excluding others to add focus.<\/sub> <\/p>\n<p>From: <a href=\"https:\/\/torrentfreak.com\/\">TF<\/a>, for the latest news on copyright battles, piracy and more.<\/p>\n<p class=\"wpematico_credit\"><small>Powered by <a href=\"http:\/\/www.wpematico.com\" target=\"_blank\" rel=\"noopener\">WPeMatico<\/a><\/small><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Starting last year, various rightsholders began filing lawsuits against companies that develop AI models. The list of complainants includes record labels, book authors, visual artists, and newspapers, including the New York Times. These rightsholders all object to the presumed use of their work to train AI models without proper compensation. The New York Times lawsuit [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":77895,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[308],"tags":[],"class_list":["post-77894","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-torrent"],"_links":{"self":[{"href":"https:\/\/www.cryptocabaret.com\/index.php?rest_route=\/wp\/v2\/posts\/77894","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.cryptocabaret.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.cryptocabaret.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.cryptocabaret.com\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.cryptocabaret.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=77894"}],"version-history":[{"count":0,"href":"https:\/\/www.cryptocabaret.com\/index.php?rest_route=\/wp\/v2\/posts\/77894\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.cryptocabaret.com\/index.php?rest_route=\/wp\/v2\/media\/77895"}],"wp:attachment":[{"href":"https:\/\/www.cryptocabaret.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=77894"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.cryptocabaret.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=77894"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.cryptocabaret.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=77894"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}