Plouf - Slashdot User

Optical Character Recognition Still Struggling With Handwriting 150

Posted by Soulskill on Sunday October 05, 2008 @12:37PM from the i-can't-read-my-handwriting-either dept.

Ian Lamont recently asked Google if they planned to extend their transcription of books and other printed media to include public records, many of which were handwritten before word processors became ubiquitous. Google wouldn't talk about any potential plans, but Lamont found out a bit more about the limits of optical character recognition in the process: "Even though some CAPTCHA schemes have been cracked in the past year, a far more difficult challenge lies in using software to recognize handwritten text. Optical character recognition has been used for years to convert printed documents into text data, but the enormous variation in handwriting styles has thwarted large-scale OCR imports of handwritten public documents and historical records. Ancestry.com took a surprising approach to digitizing and converting all publicly released US census records from 1790 to 1930: It contracted the job to Chinese firms whose staff manually transcribed the names and other information. The Chinese staff are specially trained to read the cursive and other handwriting styles from digitized paper records and microfilm. The task is ongoing with other handwritten records, at a cost of approximately $10 million per year, the company's CEO says."

Netbook Return Rates Much Higher For Linux Than Windows 663

Posted by Soulskill on Sunday October 05, 2008 @09:17AM from the old-dogs-new-tricks-etc. dept.

ivoras writes "An interview with MSI's director of US Sales, Andy Tung, contains this interesting snippet: "We have done a lot of studies on the return rates and haven't really talked about it much until now. Our internal research has shown that the return of netbooks is higher than regular notebooks, but the main cause of that is Linux. People would love to pay $299 or $399 but they don't know what they get until they open the box. They start playing around with Linux and start realizing that it's not what they are used to. They don't want to spend time to learn it so they bring it back to the store. The return rate is at least four times higher for Linux netbooks than Windows XP netbooks.'"

Report Says China Will Demand Source Code 305

Posted by kdawson on Sunday October 05, 2008 @04:53AM from the said-the-spider-to-the-fly dept.

An anonymous reader alerts us to a two-week-old story that hasn't gotten much traction in the press to date. A Japanese newspaper and the AP report that China plans to demand source code from hardware manufacturers, and ban the sale of products from companies that don't comply. China is calling this an "obligatory accreditation system for IT security products." The plan is to go into effect next May, according to sources. "Products expected to be subject to the system are those equipped with secret coding, such as [a] contactless smart card system developed by Sony Corp., digital copiers, and computer servers. The Chinese government said it needs the source code to prevent computer viruses taking advantage of software vulnerabilities and to shut out hackers. However, this explanation is unlikely to satisfy concerns that disclosed information might be handed from the Chinese government to Chinese companies. There also are fears that Chinese intelligence services could exploit such confidential information by making it easier to break codes used in... digital devices."

IBM Wants Patent On Finding Areas Lacking Patents 151

Posted by CmdrTaco on Monday September 29, 2008 @10:50AM from the all-for-me-none-for-you dept.

theodp writes "It sounds like a goof — especially coming from a company that pledged to raise the bar on patent quality — but the USPTO last week disclosed that IBM is seeking a patent for Methodologies and Analytics Tools for Identifying White Space Opportunities in a Given Industry, which Big Blue explains allows one 'to maximize the value of its IP by investigating and identifying areas of relevant patent 'white space' in an industry, where white space is a term generally used to designate one or more technical fields in which little or no IP may exist,' and filling those voids with the creation of additional IP."

New Approach To Malware Modifies Linux Kernel 170

Posted by timothy on Sunday September 28, 2008 @02:05PM

Hugh Pickens writes "Professor Avishai Wool has unveiled a program to watch for malware on servers with a modification to the Linux kernel. 'We modified the kernel in the system's operating system so that it monitors and tracks the behavior of the programs installed on it,' says Wool. Essentially, Wool says, his software team has built a model that predicts how software running on a server should work (pdf). If the kernel senses abnormal activity, it stops the program from working before malicious actions occur. 'When we see a deviation, we know for sure there's something bad going on,' Wool explains. Wool cites problems with costly anti-virus protection. 'Our methods are much more efficient and don't chew up the computer's resources.'"

Game Distribution and the 'Idiocy' of DRM 271

Posted by Soulskill on Sunday September 28, 2008 @01:01PM from the completely-neutral-titles dept.

In light of the increased focus on the DRM controversy in recent days, Ars Technica did an interview with execs from CD Projekt's Good Old Games about where the problems are with current DRM implementation. "For me, the idiocy of those protection solutions shows how far from reality and from customers a lot of executives at big companies can be. You don't have to be a genius to check the internet and see all the pros and cons of those actions." Penny Arcade is also running a three-part series on DRM from game journalists Brian Crecente and Chris Remo. Crecente talks about how some companies are making progress in developing acceptable DRM, and some aren't. Remo recommends against a trend of overreaction to minor gripes.

Princeton Researchers Say Feds Need Data Standard 49

Posted by Soulskill on Sunday September 28, 2008 @08:20AM from the yes,-but-which? dept.

dcblogs writes "The federal government's data-sharing efforts are a mess, and if Barack Obama really wants a useful 'Google for government,' he would have to set the government's vast amount of data free by exposing it and ensuring it complies to standards. Once that happens, commercial sites, aggregators, bloggers and everyone else will be able to access it, use it and transform it, argue a group of Princeton researchers (follow Download link for full PDF)."

Australia Mulling a Nationwide Vehicle-Tracking System 176

Posted by timothy on Sunday September 28, 2008 @04:02AM from the coming-soon-to-$yourcountry dept.

An anonymous reader writes "It seems that as political support for Australia's version of the national ID card is waning, the powers that be have found a far more effective way to catalog the populace. CrimTrac, an Australian government agency responsible for designing technical solutions to aid policing, is due to hand in a $2.2 million scoping study for the introduction of a nationwide automatic number plate recognition system (ANPR). It seems that as well as ANPR, the system will also collect images of drivers and passengers with high enough resolution for identification purposes. All ANPR data collected would be made available to participating agencies in real time, and retained for five years for future investigations."

W3C.org Briefly Censored In Finland 115

Posted by timothy on Saturday September 27, 2008 @08:06PM from the well-they-do-inspire-impure-thoughts dept.

k33l0r writes "The web site of W3C, w3.org or w3c.org, was briefly censored (Google Translation) by at least some of the local ISPs. For an unknown reason the URL was mistakenly entered into the Federal Police's censor database. Some of the Finnish ISPs use the database to filter out questionable content such as child pornography." Finnish online activist Matti Nikki describes some of the problems with this database-based censorship.

Studies Say Ideology Trumps Facts 784

Posted by samzenpus on Thursday September 25, 2008 @02:54AM from the water-still-wet dept.

Anti-Globalism writes "We like to think that people will be well informed before making important decisions, such as who to vote for, but the truth is that's not always the case. Being uninformed is one thing, but having a population that's actively misinformed presents problems when it comes to participating in the national debate, or the democratic process. If the findings of some political scientists are right, attempting to correct misinformation might do nothing more than reinforce the false belief."

Slashdot Top Deals