Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!


Forgot your password?
Back for a limited time - Get 15% off sitewide on Slashdot Deals with coupon code "BLACKFRIDAY" (some exclusions apply)". ×

Submission + - Corpus of 147 million quasi-relational Web tables released for public download (webdatacommons.org)

Robert Meusel writes: The Web contains vast amounts of HTML tables. Most of these tables are used for layout purposes, but a fraction of the tables is also quasi-relational, meaning that they contain structured data describing a set of entities.
A corpus of Web tables can be useful for research and applications in areas such as data search, table augmentation, knowledge base construction, and for various NLP tasks.
The WDC Web Tables corpus has been extracted from the 2012 version of the Common Crawl (http://commoncrawl.org) , the largest Web crawl that is available to the public. The corpus contains the subset of the 11 billion HTML tables found in the Common Crawl that are likely quasi-relational.
There are similar corpora at Google and Microsoft, but our corpus is the only one of this size available to the public.
Beside of being a good test bed for your Search Join engine and a great resource for enriching DBpedia, the tables corpus might also be useful for some of the people in the group working on NLP tasks.

Submission + - GnuTLS Bug Breaks x509 Certificate Validation (arstechnica.com)

mysqlbytes writes: Ars are reporting that a bug introduced in GnuTLS, makes it trivial for attackers to bypass SSL and TLS. The bug relates to how the code handles the validation of TLS certificates. The coding error may have been present since 2005, meaning that the blast radius for this bug could be huge. "Attackers can exploit the error by presenting vulnerable systems with a fraudulent certificate that is never rejected, despite its failure to pass routine security checks. The failure may allow attackers using a self-signed certificate to pose as the cryptographically authenticated operator of a vulnerable website and to decrypt protected communications."

Submission + - Stanford Team Tries for Zippier Wi-Fi in Crowded Buildings (itworld.com)

itwbennett writes: Having lots of Wi-Fi networks packed into a condominium or apartment building can hurt everyone's wireless performance, but Stanford University researchers say they've found a way to turn crowding into an advantage. In a dorm on the Stanford campus, they're building a single, dense Wi-Fi infrastructure that each resident can use and manage like their own private network. That means the shared system, called BeHop, can be centrally managed for maximum performance and efficiency while users still assign their own SSIDs (service set identifiers), passwords and other settings, according to Yiannis Yiakoumis, a Stanford doctoral student who presented a paper at the Open Networking Summit this week.
Open Source

Submission + - Free Software NVIDIA driver now supports 3D acceleration with all GeForce GPUs (h-online.com) 2

aloniv writes: The reverse-engineered free/libre and open source driver for NVIDIA cards Nouveau has reached a new milestone. The Nouveau driver in the current Linux 3.8 development branch has recently acquired everything that's necessary to support the 3D acceleration features of any GeForce graphics hardware. Together with a current version of libdrm and the Nouveau 3D driver in Mesa 3D 9.0, this allows Linux applications to use 3D acceleration even with the most recent GeForce graphics cards.

How many Bavarian Illuminati does it take to screw in a lightbulb? Three: one to screw it in, and one to confuse the issue.