Archive.org Sued By Colorado Woman 797
Posted
by
CowboyNeal
from the legally-binding-http-requests dept.
from the legally-binding-http-requests dept.
An anonymous reader writes "The Internet Archive is being sued by a Colorado woman for spidering her site. Suzanne Shell posted a notice on her site saying she wasn't allowing it to be crawled. When it was, she sued for civil theft, breach of contract, and violations of the Racketeering Influence and Corrupt Organizations act and the Colorado Organized Crime Control Act. A court ruling last month granted the Internet Archive's motion to dismiss the charges, except for the breach of contract claim. If Shell prevails on that count, sites like Google will have to get online publishers to 'opt in' before they can be crawled, radically changing the nature of Web search."
Posted notice? (Score:5, Insightful)
Similarly, there are ways to post that notice on your website as well. robots.txt comes to mind. If she didn't bother to post the notice correctly, the case should be just thrown out.
Re:Maybe I'm new here... (Score:2, Insightful)
Re:Posted notice? (Score:2, Insightful)
The way I see it, there are certain ways to make sure spiders don't index your pages, and we should agree that if one is smart enough to put a web site on the net, one should also be smart enough to learn how these work. Good spiders like any of the major search engines and archive.org will restrain themselves if you setup a robots.txt file (or meta tags) that tell them they aren't welcome.
If she really didn't want her data to be copied, she should have stuck it in a password protected directory, and she could have made her nifty copyright exclusion/contract agreement to apply after users entered the supplied passwords.
Re:Maybe I'm new here... (Score:4, Insightful)
No it's not an excuse. By placing her information on the internet, she must comply with the standards of the internet. The important thing is that there was a legitimate way to prevent this which she could have implemented, but she didn't. Just as the cartoon advertisement got in a shitload of trouble in Boston. It's not that they weren't allowed, but rather they simply needed to file the proper form.
If you were to use a felt tip marker to put your copytight notice on a braille flyer, but didn't put the notice in braille, and someone who read the braille and was incapable of reading the marker then sent that notice to their friends, I suspect they would not be in violation of the copyright notice.
Re:Posted notice? (Score:5, Insightful)
If you want something published on the public internet to be private, require viewers to enter a password or present a cryptographic certificate. Everything else is public.
World is bigger than the US... (Score:5, Insightful)
No that would only change the nature of the web for US citizens, just like the online internet gamble thingie, the rest of us will shrug and move about our businesses as usual.
Send her love. (Score:2, Insightful)
wget -r -l0 --delete-after http://www.profane-justice.org/ [profane-justice.org]
Big deal? (Score:3, Insightful)
My goodness, what an eyesore... (Score:2, Insightful)
By putting material on the web, it is understood that any publicly accessible material is copied multiple times every time the page is viewed. The ISPs get it, the browser gets it, the disk cache gets it, the user reads it. The internet is a *protocol* for copying data. To put a silly notice with insane demands at the bottom of the page is not simply futile, but laughably ridiculous. To sue over having content copied from a webserver (which to put it in terms that she might understand is basically like a public photocopier with her webpages permanently installed for every passerby to simply push *copy* and read the material) is like suing people for having a picture of that happens to have your billboard in the background. You put the data out to be seen, but as soon as it's not exactly where you want it, you get angry.
Above and beyond that, it is clear that Archive.org is not trying to steal her profit. You can opt out. You can tell it not to archive in the first place. You can even make a simple script that blocks access to the rest of the site until you click "I agree" to the terms. None of these solutions would take more than 5 minutes.
Re:Posted notice? (Score:5, Insightful)
Then don't post it on the internet. Just like if you don't want people to see you in your underwear you should stand in your front yard in just your underwear holding a sign saying "Don't look at me". The space might be private, but the view is public.
Re:Posted notice? (Score:5, Insightful)
As has been stated before, you save a copy of all websites you visit. It's called cache. Archive.org could be seen as a computer browser with a robotic viewer and a bigass cache folder.
Equally, the internet works on the premise that the information is -out there-. If the site existed only in a local intranet, then that would be a different story. This site was published to be seen and, by extension, copied, as so long as the rights of the content provider are upheld.
If you don't want your site copied, DON'T PUT IT ON THE INTERNET. Or, at the very least, put some level of protection on it. A human-only readable notice does not protection make. Otherwise, you are defaulting to a "copy me" state, as that is the default state of http. She has the right, however, as the copyright holder, to ask that all copies be purged.
To use a real world analogue. Say you go golfing. While golfing, you hear "FORE!" but keep playing. A golf ball smacks you in the head and you get a concussion, the extent of which prohibits you from going to work that week. Can you sue the golf course or the golfer(assuming he wasn't maliciously attempting to zing you across the skull)? No. You did not have a reasonable expectation not to be around flying white harbingers of pain. If you go golfing at a golf course, any reasonable person knows that there will be other golfers. So you accept the possibility and the consequences.
When you publish to the internet, your site will be copied. If someone you don't like does it too, you have no right to bitch about it. You published it out to be seen by the world. It's out there now. Too bad. However, as the copyright holder, you reserve the right to approach one of the copiers and ask, "I wish for you to delete that." The copier may have fair use defenses for it or may end up deleting it out of respect. Equally, as the copyright holder, you can definitely say, "That copy you have, you can't make money off it or take credit for it or deface it in any way or hide its source." Archive.org is not doing anything of the sort.
This, of course, is ignoring the fact that this woman more then likely has used a search engine at least once in her life. As a result, should she attest that these activities are illegal, it means she willingly participated in a query of a database doing that which she believes is wrong. That is not only hypocritical, but downright vile and questionably as illegal as that which she is alleging.
I'm not quite sure if you were playing devil's advocate or are actually an RIAA-worthy ignorant copyright Nazi, but either way, you definitely didn't put much thought into your post.
Re:Posted notice? (Score:5, Insightful)
Either she has the world's dumbest lawyer (RICO charges? please), or she's filing pro se (see previous about world's dumbest lawyer). In either case she's an ideal test case for the webcrawlers, because she will almost certainly get completely demolished in court.
She might have a copyright claim, but she couldn't even get that one up the steps. Did she even file for infringement?
Re:Posted notice? (Score:3, Insightful)
If an agent of the RIAA were to blanket a whole block with one of their songs, they do not have the right to collect royalties or payment from the denizens who resided within that block had they not agreed to the blanketing in the first place. That is broadcast law. But they do have the right, as copyright holders, to disallow anyone from making a tape of said broadcast.
Now, if the RIAA broadcasted their song with a sign saying, "RECORD ME", they lose that right to take the one guy with a tape recorder(and bad taste in music) to court.
The Internet is a defaulted "RECORD ME". Unless you don't want it seen by anyone. At which point you're in intranet territory. But even there, you would be stating RECORD ME to all persons within the intranet.
I won't lie and state that copyright law as it is has any clue in the modern world of information exchange. But certainly, this isn't blanket copyright infringement. And apparently, the legal system agrees:
A court ruling last month granted the Internet Archive's motion to dismiss the charges
However, she isn't even challenging on the basis of copyright infringement. She is suing pertaining to a shrink wrap EULA of, "By viewing this site, of which you must be viewing to read this, you agree to the following..." which have been thrown out in the past in the cases of Microsoft and Symantec software licenses anyways.
Re:Posted notice? (Score:5, Insightful)
Oops! Looks like somebody doesn't understand the internet.
Robots.txt is the way to block web spiders from your site. That's not somebody "dictating your rights", that just the way it fucking works. It dictates your rights the same way a steering wheel dictates the way you have to steer your car. You don't have to use it, but your solution probably won't work, and you'll look like a moron. When you have a blank or non-existant robots.txt, it's understood by billions of people on the internet that you don't mind if web spiders crawl your site and add it to their index, make cached copies, etc. That's the way it was designed, and that's the way its worked from the very beginning. It's not rocket science.
Also, every person who visits your site gets a complete copy of the pages they visit in their browser cache. Once your page is cached in my browser, I have that information forever. I can delete it, view it, save it to CD, make a PDF, etc. Just like the person who owns a book that's no longer published. There's not some magic "delete fairy" who goes around deleting everyone's browser cache when you decide to delete a page.
Maybe not everyone knows about their browser cache or robots.txt, but that doesn't mean they don't exist. You can't change the way the internet works because a bunch of morons failed to do even the most basic research before throwing their crap on the web.
Re:Maybe I'm new here... (Score:3, Insightful)
As an example, having a website is like having a store. If you leave the front door of your store open, with the lights on, and someone inside, entering is not a tresspass. If you put a small lock on that door, no matter how easy to pick, entering that store becomes tresspass. If you put a huge sign on the door that says "door lock is broken, private property, KEEP OUT," then entering that store is very probably tresspass, unless you're blind. Putting up a huge sign on the door that says "DO NOT ENTER", but doing so in braille, will probably not count as sufficient warning.
So, um... (Score:3, Insightful)
Re:Posted notice? (Score:2, Insightful)
Re:Why not change the crawlers (Score:5, Insightful)
Yes, it would be hard. However the technical problems would certainly be solvable for a company like google. The real problem is that this would serve to further entrench google's monopoly on search. I love google as much as the next guy, but I certainly don't want to start introducing measures that will make it more difficult for a google-killer to arise.
That's the worst analogy I've ever heard that didn't fulfill Godwin's Law. The web was designed to be a public place for the sharing of information. Trying to think of it like the inside of someone's house is just going to lead you in the wrong direction. A much better analogy would be that information posted on the web is like a campaign poster or some other notice posted on someone's lawn. Yes, it's hosted on private property and the information posted there belongs to the person posting it, but it is displayed publicly and acting as if it weren't is just silly.
Re:Why not change the crawlers (Score:5, Insightful)
I guess you could, you could just as easily put terms in the licenses of web servers that state that by using the web server software, you agree to let all your documents be crawled except where you deny that with
That wouldn't be too hard either would it?
all the best,
drew
http://www.youtube.com/watch?v=vcaf2ThG7q4 [youtube.com]
UFO seen in skies over Winton!
Re:Posted notice? (Score:4, Insightful)
Idiot for a lawyer and an idiot for a client. It's not going to go well for her, is it?
I mean, just how did she figure she'd get away with charges for "civil theft" (generally defined as "taking property with the intent of permanently depriving its rightful owner of its use") or "conversion" (which also relates to taking property) when no property was taken? And RICO without a racket?
The contract claim will clearly fail -- reading between the lines of the document you link, it was essentially only not struck out at this stage because the court wasn't allowed to consider the evidence presented by Internet Archive that they hadn't seen the notice of the contract.
That leaves only the copyright claim, and I think it's pretty clear that Internet Archive's use is fair use.
Scientology strikes again??? (Score:1, Insightful)
Re:SITE SLASHDOTTED (Score:1, Insightful)