Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

OpenAI Is Faulted by Media for Using Articles To Train ChatGPT (bloomberg.com) 89

Posted by msmash on Monday February 20, 2023 @10:00AM from the tussle-continues dept.

Major news outlets have begun criticizing OpenAI and its ChatGPT software, saying the lab is using their articles to train its artificial intelligence tool without paying them. From a report: "Anyone who wants to use the work of Wall Street Journal journalists to train artificial intelligence should be properly licensing the rights to do so from Dow Jones," Jason Conti, general counsel for News Corp's Dow Jones unit, said in a statement provided to Bloomberg News. "Dow Jones does not have such a deal with OpenAI." Conti added: "We take the misuse of our journalists' work seriously, and are reviewing this situation." The news groups' concerns arose when the computational journalist Francesco Marconi posted a tweet last week saying their work was being used to train ChatGPT. Marconi said he asked the chatbot for a list of news sources it was trained on and received a response naming 20 outlets including the WSJ, New York Times, Bloomberg, Associated Press, Reuters, CNN and TechCrunch.

This discussion has been archived. No new comments can be posted.

OpenAI Is Faulted by Media for Using Articles To Train ChatGPT

Load All Comments

Search 89 Comments Log In/Create an Account

Comments Filter:

OpenAI has at least 3 fair use claims (Score:5, Insightful)

by dmay34 ( 6770232 ) writes: on Monday February 20, 2023 @10:15AM (#63308089)

I understand why these companies and their artists/writers are concerned and upset, but OpenAI has at least 3 very strong fair use claims in their favor.
1) Educational, the articles were used to train the AI. The Output is not a copy.
2) The use of the articles is different than originally intended, to train an AI instead of sale of publication.
3) The output is decidedly transformative.
Basically, if the publishers win, then ANY use of any copyrighted material is in violation. Artists would not be able to save a copy of a photo to create transformative art from.

Share
twitter facebook
- Re: (Score:2, Interesting)
  
  by AmiMoJo ( 196126 ) writes:
  
  (1) Does educational use get them an exemption from copyright rules? I don't think teachers are allowed to copy entire articles just because they are using them to educate children, but correct me if I'm wrong.
  Of course the first hurdle is getting a court to agree that it is education, and not some kind of engineering process. Education generally only applies to humans, with even animals being trained rather than educated.
  (2) I don't see the relevance of. If anything it's an argument in the publisher's favo
  - Re: (Score:3, Insightful)
    
    by Immerman ( 2627577 ) writes:
    
    It doesn't really matter *what* they use the articles for. Copyright prevents people from redistributing the original or derivative works, or (sometimes) displaying/performing the work publicly. That's it.
    And the AI systems aren't doing any of that.
    If you can't point to a derivative work as say "look, this part right here is obviously copied from this piece of my work", then it's all but guaranteed to NOT be considered a derivative work.
    More conceptual derivatives - "inspired by", "done in the style of", e
    - Re: (Score:1)
      
      by sound+vision ( 884283 ) writes:
      
      It does matter what they use the articles for. Specific purposes are exempted in the law, like education and parody.
      "It educated an AI" sounds like a legal contortion that will only work with a crooked judge, though. When I download a movie off the Pirate Bay, it "educates" my laptop on how to display it. The problem is my laptop is an inanimate object and it cannot be educated.
      - Re:OpenAI has at least 3 fair use claims (Score:5, Insightful)
        
        by Immerman ( 2627577 ) writes: on Monday February 20, 2023 @11:12AM (#63308247)
        
        They are, but those are only relevant if you have otherwise infringed the copyright.
        Exactly how is letting an AI "learn" from your article infringing copyright?
        If you can point at an AI-generated work and say "this part right here was clearly copied from this work of mine", *then* you can make a copyright infringement claim on that particular AI-generated work. Until then you've got nothing.
        
        Parent Share
        twitter facebook
      - Re: (Score:2, Insightful)
        
        by AmiMoJo ( 196126 ) writes:
        
        If it were allowed it would seem to open the floodgates. No point paying for an expensive celebrity voice actor, just "educate" an AI with their voice and get it to produce a "derivative" work.
        The only time that would work is for parody. Maybe we will see AI voices used there soon.
        
        Re: (Score:1)
        
        by Tx ( 96709 ) writes:
        
        If it were allowed it would seem to open the floodgates. No point paying for an expensive celebrity voice actor, just "educate" an AI with their voice and get it to produce a "derivative" work.
        Current copyright laws weren't written with this technology in mind though. You can't currently copyright the sound of your voice or your artistic style. Trying to stop it at the training phase, which is what we're talking about, saying "don't use our copyrighted works to train your AI", idk, I'm not a lawyer, so we'l
        
        Re: (Score:2, Insightful)
        
        by drinkypoo ( 153816 ) writes:
        
        If it were allowed it would seem to open the floodgates. No point paying for an expensive celebrity voice actor, just "educate" an AI with their voice and get it to produce a "derivative" work.
        In the US we have the right of publicity [cornell.edu], and you get to control use of your likeness for non-fair-use purposes. So no.
        
        Re: (Score:1)
        
        by Iamthecheese ( 1264298 ) writes:
        
        Right of publicity is not copyright, and copyright laws cannot be used to directly support it. Also according to your own link only half of the states have such a law. I would bet the number which protect voice specifically is much smaller.
    - Re: (Score:2)
      
      by account_deleted ( 4530225 ) writes:
      
      Comment removed based on user account deletion
      - Re: (Score:2)
        
        by Immerman ( 2627577 ) writes:
        
        We're in the digital age - you cannot read this comment without having first copied it onto your computer.
        And copyright (as enforced) doesn't restrict *copying* it restricts *distribution* (including public performance, where you're "distributing" it into other people's brains). If you want to print stills from Disney images as posters for your walls, you're fine. Try to sell one of those posters (or give it away) and Disney can come down on you.
        And yes, if ChatGPT makes an exact copy of something (or cle
        
        Re: (Score:2)
        
        by account_deleted ( 4530225 ) writes:
        
        Comment removed based on user account deletion
  - Re:OpenAI has at least 3 fair use claims (Score:5, Interesting)
    
    by dmay34 ( 6770232 ) writes: on Monday February 20, 2023 @11:14AM (#63308251)
    
    The court precedent for #2 is Google books. Google was copying the books in their entirety, letting users search the books, showing the users clips of the books that matched their search criteria, and then offered to sell them the books.
    The courts found that all of this was fair use.
    
    Parent Share
    twitter facebook
  - Re: (Score:2)
    
    by jenningsthecat ( 1525947 ) writes:
    
    (1) Does educational use get them an exemption from copyright rules? I don't think teachers are allowed to copy entire articles just because they are using them to educate children, but correct me if I'm wrong.
    Here we get into the rather rubbery definition of "copy". If I read a news article on the Web, then do the contents of my computer screen constitute a copy? What about what's in my browser cache?
    Given that ChatGPT was likely trained by being given URL's, I suspect that any copyright claim here is not only dead in the water, it's dead before it even gets wet.
  - Re: (Score:3)
    
    by Rei ( 128717 ) writes:
    
    What copywritable material, under copyright, can ChatGPT produce verbatim? I add that adjective because basic facts are not copywritable. Please be specific.
    I'll note that Google Books scanning in copyrighted books, en masse, without permission, and showing blurbs and even whole pages to users without compensation, was deemed by the courts to be transformative and fair use.
    - Re: (Score:3)
      
      by fibonacci8 ( 260615 ) writes:
      
      The legal standard is substantial similarity, not verbatim. Don't feed the troll.
    - Re: (Score:2)
      
      by account_deleted ( 4530225 ) writes:
      
      Comment removed based on user account deletion
      - Re: (Score:2)
        
        by cowdung ( 702933 ) writes:
        
        I can believe this.
        Because Neural Nets are a form of lossy "compression" in one sense.
        So at times it could reproduce code verbatim (or text from an article) or paraphrase without giving credit (plagiarism).
        So this is a bit of a legal minefield.
        It will be interesting to see how it can be resolved.
      - Re: (Score:2)
        
        by Rei ( 128717 ) writes:
        
        We're not talking about GitHub Co-Pilot, we're talking about ChatGPT. Let's see it. Come on, how hard is it to be specific?
        
        Re: (Score:2)
        
        by account_deleted ( 4530225 ) writes:
        
        Comment removed based on user account deletion
        
        Re: (Score:2)
        
        by Rei ( 128717 ) writes:
        
        And I'LL repeat, so YOU learn to read: we're NOT talking about GitHub Co-Pilot, so stop trying to introduce it into the conversation. Rather than trying this straw man / red herring approach, you have a simple task: present ACTUAL EXAMPLES of CHATGPT (not GitHub Co-Pilot) reproducing copywrited material in a non-transformative manner.
    - Re: (Score:2)
      
      by Rei ( 128717 ) writes:
      
      The legal standard is how substantially transformative it is, the amount of reproduced materials, the goals of reproduction, and a number of other factors.
      But let's see some examples. A claim has been made, let's see them.
  - Re: (Score:2)
    
    by drinkypoo ( 153816 ) writes:
    
    (1) Does educational use get them an exemption from copyright rules? I don't think teachers are allowed to copy entire articles just because they are using them to educate children, but correct me if I'm wrong.
    Yes, they can [university...fornia.edu], if the entire article is needed for an educational purpose.
  - Re: (Score:2)
    
    by MooseTick ( 895855 ) writes:
    
    "Does educational use get them an exemption from copyright rules?"
    I'd have to think not. Otherwise, textbooks wouldn't cost hundreds of dollars since students would/could scan them and make free copies without consequence.
- Re: (Score:2)
  
  by DarkOx ( 621550 ) writes:
  
  Of course technically speaking its not the same thing but if a professor assigned their article as class reading I wonder if they would be upset?
  Would they be upset if students or anyone else for that matter accepted and explored their ideas. We generally cite facts, and opinions from other authors but we don't generally cite broad widely reproduced ideas and concepts. You would not put a citation after the sentence 'Many people believe the ability to set interest rates is an important monetary tool.' you
  - Re: (Score:2)
    
    by cowdung ( 702933 ) writes:
    
    But what part of GPT makes sure that it isn't paraphrasing or that it limits itself to "general knowledge".
    Quite to the contrary it very well may copy niche information or expression so the argument could be made that it can violate copyright and/or anti-plagiarism standards.
    This is an area where the law will need to legislate, and in the present climate where politicians see tech companies "not doing enough for X or Y", I don't see it going well for tech companies.
- Re:OpenAI has at least 3 fair use claims (Score:5, Insightful)
  
  by Joce640k ( 829181 ) writes: on Monday February 20, 2023 @10:59AM (#63308195) Homepage
  
  I wonder where those journalists got their writing ideas from?
  Was it from reading other journalists work?
  I'm sure they didn't learn in a vacuum.
  
  Parent Share
  twitter facebook
  - Re: (Score:3)
    
    by bagofbeans ( 567926 ) writes:
    
    Eaxactly. If I learn what certain financial terms mean by reading the Financial Times, I don't owe the publisher for training me.
  - Re: (Score:1)
    
    by Jayhawk0123 ( 8440955 ) writes:
    
    Apparently, I owe tons of publishers extra money for reading their books, journals, websites and using that accumulated knowledge in my career. Go figure. Wish i knew earlier, wouldn't have read so much.
    also a cautionary tale for teachers if they use any online resources or books to teach students.
- A thought about teaching it (Score:2)
  
  by JamesTRexx ( 675890 ) writes:
  
  Speaking of education, if someone wants to learns a skill, they need a teacher who is usually paid unless they're intelligent enough to deduce the process themselves.
  An AI that doesn't know anything needs training because it can't teach itself, thus it also needs a teacher, which are in this case the writers of articles.
  - Re: (Score:3)
    
    by Joce640k ( 829181 ) writes:
    
    Speaking of education, if someone wants to learns a skill, they need a teacher who is usually paid unless they're intelligent enough to deduce the process themselves.
    If that teacher makes you read a load of history books and you go on to become a history teacher, do you have to pay him/her and all the book authors royalties for all the things they thought you?
    Does everybody pay royalties to their university after they get a job and start earning money?
- Re: (Score:3)
  
  by cfulmer ( 3166 ) writes:
  
  You don't even need to get to fair use. Copyright protects a set of exclusive rights: the right to copy, to prepare derivative works, to distribute and to display publicly. If you're not doing one of those, then you're not infringing. Further, copyright only protects *expression* -- the underlying *ideas* are not protected. Derivative works are things like translations, screenplays, and so on. ChatGPT isn't doing that, AND it's not re-using the original expression.
  - Re: (Score:2)
    
    by drinkypoo ( 153816 ) writes:
    
    copyright only protects *expression* -- the underlying *ideas* are not protected. Derivative works are things like translations, screenplays, and so on.
    Derivative works are defined by a combination of their origin, and recognizable elements from it. If ChatGPT is trained on something, and then produces something literally indistinguishable from that thing, then there is an argument to be made that it's violating copyright. And it's actually capable of doing that, because unlike the image-generating diffusion models, it's producing text output.
  - Re: (Score:2)
    
    by account_deleted ( 4530225 ) writes:
    
    Comment removed based on user account deletion
- Re: (Score:2)
  
  by account_deleted ( 4530225 ) writes:
  
  Comment removed based on user account deletion
  - Re: (Score:2)
    
    by cowdung ( 702933 ) writes:
    
    The problem is that modern language models such as GPT doesn't just retain the rules of grammar, but actually retains the data, the style and expression of millions of articles. And thats why it can reproduce some of it verbatim if it's rare enough.
    - Re: (Score:2)
      
      by account_deleted ( 4530225 ) writes:
      
      Comment removed based on user account deletion
- Re: (Score:2)
  
  by gillbates ( 106458 ) writes:
  
  And they have one very strong argument against them: They copied the copyrighted works, and that copying diminished the value of the copyrighted works.
  That they violated copyright is not disputed. That the use was "fair use" is what OpenAI will have to establish to defend themselves against the claim of copyright infringement. Since OpenAI can negatively affect the value of the original authors' works to such a large degree, effectively making it worthless, OpenAI has a rather steep hill to climb.
  - Re: (Score:2)
    
    by dmay34 ( 6770232 ) writes:
    
    OpenAI's best argument is Authors Guild, Inc. v. Google, Inc.
    In that court case, Google, through the Google Books website, was copying the books in their entirety, saving them in their entirety into a database, letting users search the books for free without notice or license to the publishers or authors, showing the users images clips of the published books that matched their search criteria, and then offered to sell them the books.
    Courts decided that was totally cool.
    - Re: (Score:2)
      
      by account_deleted ( 4530225 ) writes:
      
      Comment removed based on user account deletion
      - Re: (Score:2)
        
        by dmay34 ( 6770232 ) writes:
        
        ...because Google made an agreement with the publishers and because of the implied subtext that Google wasn't actually damaging the market for the books.
        No, that's not what happened. The deals you mention that google tried to cut fell through (and probably would have been met with major anti-trust hurdles). The lawsuits went to trial and Google won everything.
    - Re: (Score:2)
      
      by account_deleted ( 4530225 ) writes:
      
      Comment removed based on user account deletion
      - Re: (Score:2)
        
        by dmay34 ( 6770232 ) writes:
        
        ChatGPT cannot reproduce any books because that's not how the data is stored in it's database. There are no complete works stored. It stores and processed the data through patterns. When you type in a prompt, it runs an algorithm on it's pattern recognition software. This is why it has such a hard time reporting correct sources, and why asking for sources is the easiest way for teachers to catch students using ChatGPT for essays. No sources of any kind are stored to be referenced.
        
        Re: (Score:2)
        
        by account_deleted ( 4530225 ) writes:
        
        Comment removed based on user account deletion
- Re: (Score:2)
  
  by account_deleted ( 4530225 ) writes:
  
  Comment removed based on user account deletion
- Re: (Score:2)
  
  by Kisai ( 213879 ) writes:
  
  And that's what they want. They want Google to pay them for linking to their articles, and they want twitter to pay them for linking to their articles behind paywalls, which makes their links as good as spam.
  Burn down any media that tries to have it both ways, a subscription model and an ad model.
  I'm not saying you shouldn't pay for WaPo and NYT, but these companies kinda have no trust to them since they put the article behind a paywall, thus allowing misinformation to flow around it, since free news sites
Factual or just word prediction? (Score:4, Interesting)

by RKThoadan ( 89437 ) writes: on Monday February 20, 2023 @10:18AM (#63308095)

Does this AI expose whether this is factual results or just derived from it's language learning algorithm? My understanding is you cannot take anything it says, even about itself, as being "real".

Share
twitter facebook
- Re: (Score:2)
  
  by steveinaccounting ( 6597448 ) writes:
  
  They really going to submit its own answer as evidence that it was trained by the WSJ. Do they have 0 idea how large language models work?
- Re: (Score:2)
  
  by AmiMoJo ( 196126 ) writes:
  
  I'm surprised that Microsoft didn't make it include sources when they integrated it into Bing. I know nothing about the ChatGPT API so maybe it can't simply supply a list of sources.
  - Re:Factual or just word prediction? (Score:5, Insightful)
    
    by laughingskeptic ( 1004414 ) writes: on Monday February 20, 2023 @06:35PM (#63309675)
    
    The text becomes a pile of vectors in multiple stages that are related to each other using various error functions. One of the big problems in AI/ML is "explainablility". DNNs as we build them today do not lend themselves to explainability. It is generally not possible to work backwards from an output and understand how the output is driven by the input. This is because trying to track labeling metadata such a source document ID through the entire training process for every change to to the internal state of an input during training would require more memory than the universe has atoms. Much simpler ML methods like Tree-Based ML methods are highly explainable because the "IF-THEN" operations that led to the result can be extracted for a given input. However these methods do not produce as good results for complex problems as DNN methods.
    
    In some specific cases a human can identify what in the training set leads to certain behaviors. For instance, an image of a riot involving tear gas might be identified as "water buffalo" by ImageNet. Why? Because many pictures of water buffalo involve a dusty haze. So we learn that "dusty haze" was learned by the network to be a feature of "water buffalo". A human can figure this out after the fact, it is impossible for the network to understand the concept of "dusty haze" since that was never a training input much less track this decision back through the 21 DNN layers to 50 of 100 images of water buffaloes training images that contained dusty haze.
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by account_deleted ( 4530225 ) writes:
      
      Comment removed based on user account deletion
- Re: (Score:1)
  
  by fermion ( 181285 ) writes:
  
  Not if it is trained using the WSJ. What it will learned is the last US was fraudulent and stolen. Lower taxis will spur the e onto y and eliminate the debt. Workers paid more will just waste the money buying meat. Natures; disasters are never a crisis as you can just go to your second home or Cancun.
I think I see the problem (Score:4, Insightful)

by Miles_O'Toole ( 5152533 ) writes: on Monday February 20, 2023 @10:27AM (#63308107)

They're using mainstream news media to train chatbots? No wonder they're turning into something like David 8 from Prometheus.

Share
twitter facebook
- Re: (Score:1)
  
  by Motleypuss ( 10291831 ) writes:
  
  David 8 had better hair, though.
How is this different than search engines? (Score:4, Interesting)

by wokka1 ( 913473 ) writes: on Monday February 20, 2023 @10:31AM (#63308115)

WSJ certainly doesn't mind the search engines using/scraping their data. I know this is over simplifying the concept, but the point stands.

Share
twitter facebook
- Re: (Score:2)
  
  by steveinaccounting ( 6597448 ) writes:
  
  They made Google pay as well. https://gizmodo.com/google-new... [gizmodo.com] and https://www.aljazeera.com/econ... [aljazeera.com]
- Re: (Score:1)
  
  by tokul ( 682258 ) writes:
  
  Search engines index their data and link to them. Chat GPT takes their copyrighted data and produces own derivative of it without any reference to original.
  What happens if you train it on copyrighted code, which is publicly available under specific license and then have "AI" produce identical code without any understanding why code is doing specific actions.
  - Re: (Score:3)
    
    by account_deleted ( 4530225 ) writes:
    
    Comment removed based on user account deletion
    - Re: (Score:2)
      
      by LuniticusTheSane ( 1195389 ) writes:
      
      What about opinion pieces?
      - Re: (Score:1)
        
        by account_deleted ( 4530225 ) writes:
        
        Comment removed based on user account deletion
Another cash grab by the failing news companies (Score:1, Troll)

by steveinaccounting ( 6597448 ) writes:

First it was invasive ads, then paywalls. After that came them getting governments to pass laws requiring social media and search engines to pay them when they are linked. Now they are sueing OpenAI because its ChatBot said that it was trained its websites? How long until they just directly take $3 a month out our bank accounts to "keep journalism alive"
How is this different than humans? (Score:3)

by Framboise ( 521772 ) writes: on Monday February 20, 2023 @10:41AM (#63308149)

Humans, like WSJ journalists, do the same than chatbots, sometimes even with better skills.

Share
twitter facebook
- Comment removed (Score:5, Insightful)
  
  by account_deleted ( 4530225 ) writes: on Monday February 20, 2023 @11:11AM (#63308237)
  
  Comment removed based on user account deletion
  
  Parent Share
  twitter facebook
These companies don't own the news... (Score:3, Insightful)

by Flytrap ( 939609 ) writes: on Monday February 20, 2023 @11:06AM (#63308219)

These companies don't own the news... they never have. They used to control the distribution of the news content in the days when owning a printing press, controlling an efficient physical delivery mechanism and having access to prime space on street poles, grocery stores and busy sidewalks was a prerequisite to being able to operate a successful newspaper or magazine business. Of course I am not forgetting the importance of the people who ran around and collected and bundled all this content into interesting news stories that we would want to read... but you know what I mean.
I thought that this had been settled in the late nineties (or was it the early noughts) when these businesses realised that the internet was a way more efficient and faster distributer of the same time sensitive content (ahem... news) that they were peddling a day later. I remember almost every news outlet (even CNN) experimented with tryng to turn ordinary joe (I should also say "and jane" just to be politically correct) into a citizen journalist - they were usually the source of the story to begin with. But soon hundreds of thousands of people realised that they could tell their own story... skip the middleman. Ahhhh... I miss the internet chaos of the nineties, when blogs were cool and awesome.
I think that the few news businesses that hire passionate journalists (who need to be more than just content gatherers) who do real jourlalism (that stuff that is taught in journalism schools) - as opposed to just reporting and distributing the news - are thriving. Why... because... well... the internet thingie got out of hand... who knows what is real and what is fake these days - it's just your intepretation of what you think are the facts vs my intepretation of what I think are the facts.
Most of these companies are in the business of vacuming up new interesting content that is already out there, curating it to fit whatever bias their audience holds, calling it news and shoveling it out. Just because the public is willing to pay for that curated delivery (because heavan forbid that we should inadvertantly come across a view point from a source that is contradictory to our own bias) does not give them ownership to the content... just its curation and delivery.
Just my 2 cents worth.

Share
twitter facebook
- Re: (Score:2)
  
  by Tony Isaac ( 1301187 ) writes:
  
  They don't own the news, but they do own the stories written about the news. Under copyright, any written work, regardless of topic, is copyrighted unless the author specifically states that it is public domain.
Death Throes (Score:2)

by bill_mcgonigle ( 4333 ) * writes:

There's no difference with a human reading news to gain intelligence - which is not additionally paid for.
If OpenAI is using NYT to learn, they absolutely should pay the $14/mo for the net to have a subscription. That's obviously fair.
They should also mark NYT as adversarial in their GAN and teach the AI how Fake News is lies and propaganda.
If you tell it that NYT got a Pulitzer for covering up the Holodomor perhaps it can find patterns in other coverups over time that we've never discovered.
That's the sys
How else is it going to link to your articles? (Score:3)

by aldousd666 ( 640240 ) writes: on Monday February 20, 2023 @11:28AM (#63308299) Journal

This is like saying you're mad that Google didn't pay to index your article. You idiots, this is how you get eyeballs on your article.

Share
twitter facebook
Me too! (Score:2)

by nospam007 ( 722110 ) * writes:

"Major news outlets have begun criticizing OpenAI and its ChatGPT software, saying the lab is using their articles to train its artificial intelligence tool without paying them."
I train my brain the very same way!
Fair Use (Score:2)

by mspohr ( 589790 ) writes:

Section 107 calls for consideration of the following four factors in evaluating a question of fair use:
Purpose and character of the use, including whether the use is of a commercial nature or is for nonprofit educational purposes: Courts look at how the party claiming fair use is using the copyrighted work, and are more likely to find that nonprofit educational and noncommercial uses are fair. This does not mean, however, that all nonprofit education and noncommercial uses are fair and all commercial uses a
Schools use newspapers to train *humans*. (Score:4, Informative)

by OldMugwump ( 4760237 ) writes: on Monday February 20, 2023 @01:19PM (#63308661) Homepage

Newspapers are used to train humans all the time. Schools use them; always have. Using newspaper contents to train AIs is no different, legally.
Once I buy their paper (or get access to it online), I'm free do anything I want with it, excepting only copying it and giving it to others (because there's copyright law about that). There are no other legal restrictions on use.

Share
twitter facebook
- Re: (Score:1)
  
  by LuniticusTheSane ( 1195389 ) writes:
  
  Seriously, their arguments only have a point if they didn't pay for a subscription.
ChatGPT trained by ChatGPT? (Score:2)

by dlleigh ( 313922 ) writes:

With all the hype about various media bodies using ChatGPT to write articles, are we now going to see ChatGPT being trained by ChatGPT output?
What horrors will be produced by this incestuous vicious circle?
Facts are not copyrightable (Score:3)

by drkshadow ( 6277460 ) writes: on Monday February 20, 2023 @01:32PM (#63308717)

It's pretty apparently clear that ChatGPT is not outputting these news articles any more than you or me describing what we heard recently -- using _completely_ different rhetoric.
Facts are not copyrightable. The rules of a game are not copyrightable. The presentation and representation are not copyrightable.
It would be a hard sell to say that the transformation of the facts into another new output is copyrightable, otherwise everything that you or I say or think or put out is a derivation of something that someone, sometime had a copyright to.

Share
twitter facebook
- Re: (Score:2)
  
  by Tony Isaac ( 1301187 ) writes:
  
  No, but text written about facts certainly is copyrightable. If ChatGPT is regurgitating copyrighted text written by news or other publishers, they might be in violation of copyright.
What happened to the paywalls? (Score:2)

by Lije Baley ( 88936 ) writes:

Some of the companies complaining have paywalls, so why aren't they already covered? Seems like more of a problem for advertising-supported content, since the AIs probably don't have any spending money.
Comment removed (Score:3)

by account_deleted ( 4530225 ) writes: on Monday February 20, 2023 @01:54PM (#63308803)

Comment removed based on user account deletion

Share
twitter facebook
First come the beggars, then will the luddites (Score:1)

by nian ( 10298731 ) writes:

Expect everything from the guys who don't want to ponder on how they could do better using a new tool.
Apparently AIs Need Citations (Score:2)

by nichogenius1 ( 9413819 ) writes:

The only reason people even kind of trust Wikipedia is the religious use of citations. AI powered chat bots should use something similar.
- Re: (Score:2)
  
  by cowdung ( 702933 ) writes:
  
  That is a challenge to be resolved.
  Right now it seems quite difficult. Maybe the back propagation would need to keep track of what % each article added to each weight.. that would be very hard and space intensive.
- Re: (Score:2)
  
  by cowdung ( 702933 ) writes:
  
  That's exactly the problem.. in some cases it will reproduce or paraphrase the source without attribution.
the internet is lost (Score:1)

by RagingAtheist ( 1058836 ) writes:

If you put it on the internet it's no longer just yours.
Did they pay for a subscription? (Score:2)

by sabbede ( 2678435 ) writes:

Because I'm pretty sure that's all the publisher has a right to demand. If you pay for a subscription, you get to read the material. I don't see how a publisher can assert the right to charge extra for who or what reads the already-paid-for material.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

OpenAI has at least 3 fair use claims (Score:5, Insightful)

Re: (Score:2, Interesting)

Re: (Score:3, Insightful)

Re: (Score:1)

Re:OpenAI has at least 3 fair use claims (Score:5, Insightful)

Re: (Score:2, Insightful)

Re: (Score:1)

Re: (Score:2, Insightful)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re:OpenAI has at least 3 fair use claims (Score:5, Interesting)

Re: (Score:2)

Re: (Score:3)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re:OpenAI has at least 3 fair use claims (Score:5, Insightful)

Re: (Score:3)

Re: (Score:1)

A thought about teaching it (Score:2)

Re: (Score:3)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Factual or just word prediction? (Score:4, Interesting)

Re: (Score:2)

Re: (Score:2)

Re:Factual or just word prediction? (Score:5, Insightful)

Re: (Score:2)

Re: (Score:1)

I think I see the problem (Score:4, Insightful)

Re: (Score:1)

How is this different than search engines? (Score:4, Interesting)

Re: (Score:2)

Re: (Score:1)

Re: (Score:3)

Re: (Score:2)

Re: (Score:1)

Another cash grab by the failing news companies (Score:1, Troll)

How is this different than humans? (Score:3)

Comment removed (Score:5, Insightful)

These companies don't own the news... (Score:3, Insightful)

Re: (Score:2)

Death Throes (Score:2)

How else is it going to link to your articles? (Score:3)

Me too! (Score:2)

Fair Use (Score:2)

Schools use newspapers to train *humans*. (Score:4, Informative)

Re: (Score:1)

ChatGPT trained by ChatGPT? (Score:2)

Facts are not copyrightable (Score:3)

Re: (Score:2)

What happened to the paywalls? (Score:2)

Comment removed (Score:3)

First come the beggars, then will the luddites (Score:1)

Apparently AIs Need Citations (Score:2)

Re: (Score:2)

Re: (Score:2)

Schools use newspapers to train humans. (Score:4, Informative)