Mining Neologisms from Wikipedia 93

holy_calamity writes "Natual Language Programming researchers have developed a tool called Zeitgeist that can discover the meaning of new words for itself using Wikipedia. It looks for entries for words not in the WordNet database and works out their meaning by looking for known words linked to them. Development of the tool is focusing on using it to understand what bloggers (using slang and neologisms) are saying about companies' products."

The Top 100 Best-Selling PC Games of the Century 97

Ground Glass writes "They already did this for consoles and handhelds, but now Next Generationhas finished the cycle by releasing a rather more interesting list of the best-selling PC games released since 2000. It's more interesting as, since most everyone has a Windows PC in some form or another, the games that are purchased for it are...rather more esoteric than you'd see being bought on console. You may also notice the sales numbers are quite a bit lower than on the other lists — is this the spectre of piracy given form? In any case, there's plenty of data to interpret here."

Google to Give Data To Brazilian Court 182

Edu writes to mention a Washington Post article about Google's olive branch to the Brazilian courts. Despite previously refusing to reveal search information to the U.S. government, the company has announced they'll be releasing information on hate groups to the Brazilian courts. The move is intended to allow the Brazilian government to identify users associated with homophobic and racist groups. From the article: "Orkut pulls objectionable words and pictures from user sites, but Google stores content it feels could be useful in a lawsuit. Orkut is especially popular in Brazil, which accounts for 75 percent of its 17 million users. Legal and privacy experts said that Google had no choice but to comply with the court order. 'From the law enforcement perspective, if the records are in the possession of the business, the business can be compelled to produce them,' said Marc Rotenberg, executive director of the Washington-based Electronic Privacy Information Center."

The Beautiful Chaos of 1,000 Trackmania Racers 74

Mark Wallace writes "This 3-minute video of 1,000 runs of the same Trackmania Sunrise road course, overlaid on each other, turns the game into a gorgeous picture of an ordered system tending toward chaos. The pack starts out in perfect shape and becomes a glorious mess by the end of the course. Plus, it's just beautiful stuff." I'm normally not one for linking videos, but this is a great way to spend a few minutes on a Tuesday morning.

Who (Really) Writes Wikipedia 175

Nico ? La ! writes "Aaron Swartz questions Jimbo Wales' (Wikimedia's founder) belief and evangelized truth that only around 500 people are the most important contributors to Wikipedia. Whereas the truth is that they probably are the people who do the most editing. From the post: 'For example, the largest portion of the Anaconda article was written by a user who only made 2 edits to it (and only 100 on the entire site). By contrast, the largest number of edits were made by a user who appears to have contributed no text to the final article (the edits were all deleting things and moving things around).'" Which ultimately means that Wikipedia in some ways much more closely mimics a real encyclopedia, with many contributors writing the bulk of the content, but a small group massaging that text to insure standards compliance with the overall work. Interesting thing there and worth your time, although the super-computer thing doesn't make a lot of sense to me.

Microsoft Research Builds 'BrowserShield' 226

SteelyBen writes "Researchers at Microsoft have completed work on a prototype framework called BrowserShield that promises to intercept and remove, on the fly, malicious code hidden on Web pages, instead showing users safe equivalents of those pages. The BrowserShield project, an outgrowth of the company's 'Shield' initiative, could one day even become Microsoft's answer to zero-day browser exploits such as the WMF (Windows Metafile) attack that spread like wildfire in December 2005."

Canadian Copyright Group Seeks To License the Net 149

An anonymous reader writes "A new Toronto Star article from Michael Geist not only describes why Canadian Ministers of Education are pushing a copyright proposal that will harm Internet access, but also reveals how a copyright group is seeking to create a new license for Internet content. Access Copyright, a copyright collective, wants to use a new international text standard to license everything from books to blogs. Geist outlines in his blog how Canadians can fight back against these bonehead proposals."

SanDisk MP3 Players Seized in MP3 Licence Dispute 299

MrSteveSD writes "According to the BBC, German officials have seized Sandisk's MP3 players at the IFA show in Berlin. The Italian company Sisvel claims that Sandisk has refused to pay license fees for the MP3 codec. Sisvel President Roberto Dini has said that Sandisk could get an edge over competitors by not paying the fees. How much are proprietary format licensing fees pushing up the cost of consumer goods?"

Google Releases Tesseract as Open Source 251

An anonymous reader writes "Google recently released Tesseract as open source. Originally developed at the HP Labs from 1985-1995, it has been touted as one of the most accurate Optical Character Recognition (OCR) programs available. Having sat on the shelf gathering dust for so many years, Google cleaned up some of the more outdated portions of the code and released it for general consumption. You can download Tesseract over at Sourceforge.

Podcasts of University Lectures? 601

theslashdot asks: "I'm working at a major university in the US, and have been charged with posting pod-casts of class lectures on the internet. The problem is whether or not posting the videos would allow students to skip class and just download the lecture, instead. I guess the problem is trying to strike the right balance between allowing good students to take advantage of this resource, but discourage bad students from staying at home all the time and watching all the lectures right before the exam. So what methods can be used to provide these pod-casts for the students who actually attended class? In terms of when the lecture should be posted, what would be a good time-frame? Immediately after the class? 24 hours? One week? One class behind schedule?"

California Passes Wi-Fi Guidance Law 204

MrNonchalant writes, "California's legislature has passed a law requiring Wi-Fi device manufacturers to include warnings about security. From the article: 'From 1 October 2007, manufacturers must place warning labels on all equipment capable of receiving Wi-Fi signals, according to the new state law. These can take the form of box stickers, special notification in setup software, notification during the router setup, or through automatic securing of the connection. One warning sticker must be positioned so that it must be removed by a consumer before the product can be used.'"

FreeDOS 1.0 Released 365

Noksagt writes, "FreeDOS 1.0 has been released only a little bit later than planned. The 1.0 milestone is considered to be 'a stable and viable MS-DOS replacement' and features long filename support, HIMEM and EMM386 management, and CD-ROM support."

zCodec Video Codec Is a Trojan 188

Bride of Chucky writes "There's a new video codec out there that claims to offer 'up to 40 percent better video quality' but that resets your computer's DNS settings — opening the way for Trojans, rootkits, or whatever. Techworld warns that zCodec looks professional enough, is widely available, and comes in at 100KB. What's the bet the media companies are behind this somewhere?"

You Have Been 'Randomly' Selected? 1160

dpbsmith asks: "One thing I've noticed is that the people who are told by the TSA that they have been 'randomly' selected for baggage inspection have a tendency not to believe it. I know one couple whose wife has been 'randomly' selected four times, while the husband never has been. The wife believes that it is because each of those times, she was traveling by herself, without checked baggage, (whereas she has never been inspected when traveling with her husband with checked baggage). In 'Uncommon Carriers', John McPhee accompanied a truck driver to write about the experience, and bought a trucker's cap to blend in. He says 'I would pay for my freedom at the Seattle-Tacoma airport when, with a one-way ticket bought the previous day, I would arrive to check in my baggage.' His baggage was 'randomly' selected for inspection, and later he was 'once again "randomly selected" for a shoes-off, belt-rolled, head-to-toe frisk.' So, what about it? Is the TSA simply flat-out lying when they tell you that you have been 'randomly selected?'" The better question to ask is: "Are random searches effective in keeping everyone safe?"

Slashdot Top Deals