Catch up on stories from the past week (and beyond) at the Slashdot story archive

 



Forgot your password?
typodupeerror
×

Comment Re:"Can't have it both ways" is the core argument (Score 1) 282

They're almost always the same. If there are any that aren't, I'd be shocked. He occupies the same sort of 'designated target of hate' that the Rothschilds did. In fact, that's really where it all starts -- a couple of political consultants working for Victor Orban, the Hungarian dictator, decided that a useful political tactic would be to have an enemy to demonize, so they rather arbitrarily decided it would be Soros. Read all about it.

And so we wound up with Hungary being thoroughly fucked up, Hungary impairing the functioning of the EU and NATO, increases in anti-semitism and fascism, probably daily death threats against a guy who did nothing wrong, and all to score some cheap political points.

It's disgusting.

Comment Re:"Can't have it both ways" is the core argument (Score 1) 282

Good thing that wasn't the argument, then. In fact, your summary of it doesn't even make sense -- middlemen don't get in trouble for taking things down, they get in trouble for not taking things down.

What actually happened was that just before the Internet got big, two cases came down concerning different online services. CompuServe got sued for user-posted content, but was found not to be liable because they had not moderated anything and were just a middleman. Prodigy got sued for user-posted content and was found to be liable because they moderated their boards (for things like bad language; they wanted to be family friendly) but had failed to moderate every post perfectly. By letting one bad thing through, they were liable for it -- and by extension, anything else they had failed to catch.

Since Congress wanted sites to moderate user content -- they were really concerned about porn -- they passed a law that encouraged sites to do moderation but did not hold them responsible for failing to moderate every single little thing perfectly in every respect. Further, sites got to choose what they were moderating for -- could be porn, but could just as easily be off-topic posts, like talking about carrots when everyone else is talking about money.

In practice, sites don't like to moderate much -- it takes effort, it may lose engaged users, it costs money, it can't please everyone -- but they certainly can, and there's nothing wrong with it. Get rid of the protection of the CDA and sites won't be able to do mandatory moderation sufficiently, so they'll fall back on none. This is apparently okay with scum who get kicked off of boards left and right, but should not be okay with people who have standards and don't want to put up with that crap.

Comment Re:If it's not fair use (Score 1) 64

It is?

I assume that you're replying to the part about fair use turning prima facie infringements into non-infringing uses, but it's difficult to tell.

If so, well, that's the statute at work:

Notwithstanding the provisions of sections 106 and 106A, the fair use of a copyrighted work ... is not an infringement of copyright.

You don't think it's at all strange that the Supreme Court insists on explicit analysis of all four fair use factors, yet if while doing so you refer or even rely on the work's transformative nature, hey, don't sweat thinking about the right to prepare derivative works, close enough I'm sure.

No factor is determinative in fair use, even though often enough the fourth factor is. Always do the full test, every time. And recognize that much comes down to which side of the bed the judge got up on. It's not as bad as the copyright utility doctrine and conceptual separability, but it's not great. Look at time shifting; people extoll it as a classic fair use but really it fails on three out of four factors and is middling at best on the fourth, and was originally going to go the other way. Prof. Litman has a nice write-up about it. And for every example of fair use, there's always a counter example where the same sort of thing came out differently due to a minor factual difference.

Anyway, got a cite where a court held that a prima facie infringing use was a lawful fair use and which was nevertheless still an unlawfully infringing use meriting damages and/or injunctive relief? I'd love to read it. Closest thing that comes to mind was some of the section 1201 stuff from ages ago, distinguishing fair use from circumvention.

Indeed, the very problem I was trying to point out.

Doesn't sound like a problem to me.

Comment Re:If it's not fair use (Score 1) 64

Unless you're a copyright lawyer I'd avoid the "I'm a lawyer" credential in these discussions.

Oh boy, guess what? Even have an LL.M in IP.

the initial step being obvious fair use

There's really no such thing as an obvious fair use. It's always fact-intensive, always case-by-case. And there's always the risk of times changing. Format shifting comes to us from the RIAA v. Diamond case, and it's terrifying to think of how differently that gone had it been litigated just a few years later when the iTunes Music Store was open. I'll agree though that the use of pirated books for training was a bad idea, in that it does not help how a court will perceive the AI developers. Never a good idea to piss off a judge, and it's clear these guys were not thinking.

Still, there's no market for AI analysis of books as far as I know, and the use was transformative in nature given that no one appeared to be reading them, they were just grist for the mill. This is actually more favorable to fair use than Google Books, which is intended for some human to ultimately use to get to read snippets of text in search results. Imagine how strong Google's case would've been if they'd said that searching for a string of text points you to a book with no snippet or anything else to tell you what's inside. Maybe a page or chapter number, for all the good that does.

Once you get over the training hurdle though, it looks like pretty smooth sailing. The model is demonstrably lossy. It's too abstract to mesh with the concept of an abridgement or condensation. You might be able to get snippets here and there out of it, but I would imagine that given a few examples to dive deep into analyizing, OpenAI can show that they appear because they're built out of common chains of text from multiple works, or are overrepresented due to multiple copies of the work. (Kind of like how the image-generating AIs are said to like to make fake Getty Images watermarks because it saw so many of them in training data, so it must be important)

Personally, I like their chances. Either way, it'll be interesting to see.

Comment Re:If it's not fair use (Score 1) 64

2/ the nature of the copyrighted work;

Users are allowed more freedom with regard to non-fiction, since the facts therein are uncopyrightable, and the general organization may be as well. Fiction, being more creative, enjoys a bit more protection from potential fair users.

There is no way a fair use defense will pass muster here. Of course, I'm not a lawyer. But I have talked to a few about the scope of fair use before reusing data in my projects.

I'm a lawyer. I would suggest you take a look at the court opinions concerning Google Book Search. You may be surprised. Try Authors Guild, Inc. v. Google Inc., 954 F. Supp. 2d 282 (S.D.N.Y. 2013) which was at the trial level, and then Authors Guild, Inc. v. Google Inc., 804 F.3d 202 (2d Cir. 2015), when it went up on appeal.

Comment Re:If it's not fair use (Score 2) 64

What constitutes a derivative work though? A quote? An analysis? Using the ideas of a book in abstract to answer a question the book touches on?

Gotcha covered:

17 USC 501(a): Anyone who violates any of the exclusive rights of the copyright owner as provided by sections 106 through 122 ... is an infringer of the copyright ....

17 USC 106: Subject to sections 107 through 122, the owner of copyright under this title has the exclusive rights to do and to authorize any of the following: ... to prepare derivative works based upon the copyrighted work ....

17 USC 101: A âoederivative workâ is a work based upon one or more preexisting works, such as a translation, musical arrangement, dramatization, fictionalization, motion picture version, sound recording, art reproduction, abridgment, condensation, or any other form in which a work may be recast, transformed, or adapted. A work consisting of editorial revisions, annotations, elaborations, or other modifications which, as a whole, represent an original work of authorship, is a âoederivative workâ.

So there's your answer.

A quote is not a derivative work because it's not based on a preexisting work. Instead, that's a reproduction of part of the work (a separate exclusive right under 106, however). A literary analysis is not a derivative work, but if you dug too deep and merely produced an annotated work or adaptation, then it would be. It's not too hard to stay on the correct side of that line.

Using the ideas of a book in abstract to answer a question the book touches on?

Ideas can always be used. Facts -- or things claimed to be a real-life fact -- can always be used. But be cautious again of digging in too deeply. Castle Rock Entertainment Inc. v. Carol Publishing Group, 150 F.3d 132 (2d Cir. 1998) was a case where someone wrote a book of Seinfeld trivia questions. The questions weren't about the show, per se, but about the contents of the show. (That is, they did not ask on what soundstage it was filmed, but things like what is the number of the apartment, or what tagline did character X say repeatedly in a particular episode, that sort of thing) This was held to be copying many little fragments of the show itself, and thus infringing. And it wasn't held to be fair use.

Comment Re:If it's not fair use (Score 1) 64

This would cause problems for text-to-speech used by the blind... a machine reading her work and then plagiarising it, reading it out loud to someone? THEIVES!

I remember this being a big area of interest about 15 years ago, but there doesn't seem to have been much about it lately. So long as it's done in real time and just for the reader, and not an audience, I would be inclined to say that it's not infringing the reproduction right, derivative right, or most significantly, the public performance right.

Comment Re:If it's not fair use (Score 2) 64

If it's not fair use, and the court rules that it's not, do we then have to buy a new license every time we want to read a book once more?

You've never had to buy a license to read a book. Perhaps if you're doing other things in association with your book reading, you might need a license for that but not for just reading.

How does a machine reading a book fundamentally differ from a human, and why would the act of reading constitute a copyright violation?

We've yet to invent a general-purpose AI. These things aren't reading books the way we do. My understanding is that they're basically compiling statistical models, and figuring out how each word or part of word relates to others. Kind of a more complicated version of playing the autocomplete game on your phone, where you input one word in a text chat and then just choose whatever the next suggested word is. The sentences you get make sense for a little while, but become run-ons and start to loop.

Analyzing a work isn't a copyright infringement. Predicate work needed to reach the point of conducting an analysis might, though. And the output of the software might as well.

Am I misreading this, or is Sarah Silverman's argument really that she doesn't want machines reading her work without a pay-per-read license?

That and also not wanting machines to be able to write things that sound like something she might write. Licensing isn't all that relevant, as in many cases a license would simply not be granted.

Transportation

Clean Jet Fuel Startup Fires Up New Carbon Converter (spokesman.com) 41

Thursday a climate technology startup called Twelve "took a major step toward producing sustainable aviation fuel..." reports Bloomberg, "by launching its commercial-scale carbon transformation unit." Twelve is among the emerging companies working on ways to transform captured CO2 into useful products. In the case of the Berkeley, California-based startup, its nascent technology will be critical to cleaning up one of the hardest-to-decarbonize sectors: aviation. Twelve uses a technique called electrolysis that uses electricity to repurpose carbon dioxide and water into various products. When the electricity is generated from renewables, the process is essentially no-carbon. The company's CO2 electrochemical reactor — called OPUS — will be at the center of its first commercial production plant for sustainable aviation fuel, under construction in Moses Lake and set to be completed this year. The plant will run on hydropower and use CO2 captured from a nearby ethanol plant. That CO2 and water will be fed through OPUS and turned into synthetic gas, the basis of sustainable aviation fuel.

Twelve's airline customers can blend it with traditional jet fuel. The resulting carbon credit can be bought by corporate customers like Microsoft to offset their business travel-related emissions...

Although Twelve's carbon transformation technology can be used to make products ranging from spandex pants to car parts, it pivoted to focus more fully on sustainable aviation fuel after the announcement of tax credits for SAF blending, carbon capture and utilization, and hydrogen production, said Twelve co-founder and Chief Science Officer Etosha Cave. Those tax credits helped the company launch this commercial unit. "Without that, we would not be competitive in terms of being able to get to market at the stage we're at," Cave said.

It's still not cost competitive with traditional jet fuel, the article points out, "but airlines are under increasing pressure from governments and their own net zero commitments to integrate SAF into their fuel mix.

"Twelve would not disclose its cost to make the fuel, though it said it expects prices to go down as its technology scales up and eventually reach parity with traditional jet fuel."

Comment Re:Simple answer (Score 1) 82

The Internet is less of a wild west than it used to be, but it's still pretty wild.

There's plenty big countries like Russia where the authorities won't care at all about any of your local rules. That may not be fun, but it's a fact. If somebody from Russia runs a scan on you, and then distributes information in Russian forums, there's pretty much nothing you can do about that.

Even less now with the Ukraine situation, where Russia is heavily sanctioned and even less inclined to be friendly to other countries.

Comment Simple answer (Score 2) 82

You can't.

Even if you could obtain some compliance within a given country, the Internet is international and people from various less liked jurisdictions couldn't care less about what rules you might have.

Once the data is out there, it's out there. You can't ever have any confidence that nobody will notice that a given port is open or a given service is buggy, and that this fact won't spread through various parties, including things like underground forums most people don't know even exist.

Slashdot Top Deals

A morsel of genuine history is a thing so rare as to be always valuable. -- Thomas Jefferson

Working...