Forgot your password?

typodupeerror

Oracle bug takes down core JP Morgan Chase apps->

Submitted by CurtMonash
CurtMonash writes "An Oracle bug took down multiple JP Morgan Chase applications Monday night and Tuesday, if a very credible-seeming source is to be believed. The damage included $132 million in delayed ACH transactions, 500-1000 failed applications each for auto and student loans, and a whole lot of bad publicity around the unavailability of JP Morgan Chase's core online banking portal. To top things off, other factors brought Morgan's online portal to its knees again on Wednesday."
Link to Original Source

How should a non-techie learn programming?-> 2

Submitted by CurtMonash
CurtMonash writes "Nontechnical people — for example marketers or small business owners — increasingly get the feeling they should know more about technology. And they're right. If you can throw up a small website or do some real number-crunching, chances are those skills will help you feed your family. But how should they get started? I started a thread with the question on DBMS2, and some consistent themes emerged, including:
  • * Learn HTML + CSS early on.
  • * Learn a bit of SQL, but you needn't make that your focus.
  • * Have your first real programming language be one of the modern ones, such as PHP or Python.
  • * MySQL is a good vehicle to learn SQL.
  • * It's a great idea to start with a project you actually want to accomplish, and that can be done by modifying a starter set of sample code. (E.g., a WordPress blog.)
  • * Microsoft's technology stack is an interesting alternative to some of the other technology ideas.

A variety of books and websites were suggested, most notably MIT's Scratch. But, frankly, it would really help to get more suggestions for sites and books that help one get started with HTML/CSS, or with MySQL, or with PHP. And so, techie studs and studdettes, I ask you — how should a non-techie go about learning some basic technological skills?"
Link to Original Source

Databases

What's the best way to scale out Postgres?->

Submitted by
CurtMonash
CurtMonash writes "I have a client building a new app that will peak at several million updates per hour to a 1-2 terabyte database. They don't have much budget, and they don't want to use MySQL. What are their options? They tried EnterpriseDB's GridSQL scale-out technology, but quickly discarded it because it didn't support the latest PostgreSQL releases. I put up a blog post with this question, asking about software and design options such as Hibernate and hand scaling. But the early responses have focused on SSDs. Also on the table are ACID-compliant in-memory DBMS like the Groovy SQL Switch or VoltDB, but those are immature products, and obviously require a lot of silicon. PostgreSQL has the benefit of being pretty mature, with some SQL features the client really values (e.g., windowing, geospatial datatype). What are the most effective and proven ways to scale out PostgreSQL?"
Link to Original Source
Security

Two dangerous kinds of blog comment spam-> 1

Submitted by
CurtMonash
CurtMonash writes "As the owner of several WordPress blogs, I get over 100,000 spam comments per year, of which Akismet lets through a delightfully tiny fraction. Most of that spam seems to either be selling various (probably shady) products and services, or just attempt to transmit "link juice" to other sites for SEO purposes. At least two forms of comment spam, however, are more sinister than that.

First, some comments — especially ones focused on obscure "long-tail" keywords — try to direct visitors to actual malware delivery sites. I started seeing examples of those back in 2007; they're still coming in with high frequency. Second — and so far I've seen a single example of this one — I just got a comment containing a brazen offer to provide website-attacking services. Prices range from $25-70, depending on the duration of the attack, and the spammer promises "On average the data, ordered the site falls within 5 minutes after the start."

Aw shucks. I long for the good ol' days when spam just promised payday loans I could use to buy Viagra to enhance my enjoyment of free porn sites ..."

Link to Original Source
Databases

MMO games use very strange database designs->

Submitted by
CurtMonash
CurtMonash writes "The technology of MMO RPGs (Massively Multiplayer Online Roleplaying Games) is interesting and even groundbreaking on multiple levels — graphics, AI, networking/security, and more. But one areas where it's downright funky is in database management. Nominally, the major MMOs seem to use recognizable database technology — MySQL (World of Warcraft, Second Life), Postgres (Everquest), SQL Server (Guild Wars), StreamBase (also Second Life), and so on. But what they do with it is another matter. In Lord of the Rings Online (LOTRO), for example, minor database-value errors are only fixed in major quarterly releases, suggesting a rather cavalier attitude toward database integrity. Authentication servers go down all the time too, on what probably is a SQL- rather than LDAP-based system. Meanwhile, Guild Wars opts out of conventional database architecture altogether, and just saves character state in 10K-30K BLOBs.

Perhaps these are actually good technical decisions for the MMO developers to have made. But I can't help thinking that a little bit of enterprise IT savvy would save MMO developers a lot of embarrassment and aggravation, and make their gamers happier as well."

Link to Original Source
Medicine

Hospital turns away ambulances after EHRs go down->

Submitted by
CurtMonash
CurtMonash writes "The Indianapolis Star reports that Tuesday Morning, Methodist Hospital turned away patients in ambulances, for the first time in its 100-plus history. Why? Because the electronic health records (EHR) system had gone down the prior afternoon — due to a power surge — and the backlog of paperwork was no longer tolerable.

If you think about that story, it has a couple of disturbing aspects. Clearly the investment in or design of high availability, surge protection, etc. were sadly lacking. But even leaving that aside — why do problems with paperwork make it necessary to turn away patients?

Maybe the latter is OK, since there obviously were other, more smoothly running hospitals to send the patient to. Still, the whole story should be held up as a cautionary tale for hospitals and IT suppliers everywhere."

Link to Original Source
Software

Facebook's 2 1/2 petabyte Hadoop-based warehouse->

Submitted by
CurtMonash
CurtMonash writes "Facebook has a 2 1/2 petabyte data warehouse, running on Hadoop/Hive rather than a database management system. I already mentioned this in a recent story, "Web Analytics Databases Get Even Larger"*, but subsequently I've gotten a lot more detail from Ashish Thusoo and Joydeep Sen Sarma of Facebook. Highlights include:
  • Facebook operates a single cluster of 610 nodes, running multiple Hadoop MapReduce jobs at once.
  • Hardware only costs $2000-$4000/node, versus the $10,000+/node that might be required for a DBMS.
  • Hive implements some basic SQL functionality over MapReduce.
  • Reliability is so-so. Long-running queries fail more often than ETL — which actually is ELT — because getting the data in the first place is more important than having query results be current to the nearest 15 minutes.
  • The most rapidly-cycled queries are re-run approximately hourly.
  • Facebook is really making serious extensions to open source Hadoop code.

*Sorry about the misspelling of "ever"!"
Link to Original Source

Databases

Web analytics databases get every larger->

Submitted by
CurtMonash
CurtMonash writes "Web analytics databases are getting every larger. eBay now has a 6 1/2 petabyte warehouse running on Greenplum — user data — to go with its more established 2 1/2 petabyte Teradata system. Between the two databases, the metrics are enormous — 17 trillion rows, 150 billion new rows per day, millions of queries per day, and so on. Meanwhile, Facebook has 2/12 petabytes managed by Hadoop, not running on a conventional DBMS at all, Yahoo has over a petabyte (on a homegrown system), and Fox/MySpace has two different multi-hundred terabyte systems (Greenplum and Aster Data nCluster). eBay and Fox are the two Greenplum customers I wrote in about last August, when they both seemed to be headed to the petabyte range in a hurry. These are basically all web log/clickstream databases, except that network event data is even more voluminous than the pure clickstream stuff."
Link to Original Source
Databases

MapReduce can't keep up with MPP DBMS->

Submitted by
CurtMonash
CurtMonash writes "Monday was a bad night for the MapReduce advocates. First, famed MapReduce skeptics Michael Stonebraker and David DeWitt released a series of benchmarks that suggest MPP database management systems far outperform Mapreduce (specifically Hadoop). I piled on by posting some thoughts from even-more-skeptical eBay, which thinks MapReduce is 6-8X slower than MPP database managers for comparable tasks.

That doesn't mean MapReduce advocates need to jump off of a ledge. Much of what these benchmarks show is the should-have-been obvious point that MapReduce shouldn't be used to replace DBMS for tasks DBMS are good at. MapReduce applications tend to be concentrated in four areas:
  • Text tokenization, indexing, and search
  • Creation of other kinds of data structures (e.g., graphs)
  • Data mining and machine learning
  • Data transformation

and the benchmarks didn't really speak to any of those. But some of those areas may equally fall victim to the "Don't reinvent the wheel argument."

MapReduce is surely an appealing paradigm for lightweight, reliably-parallel programming. At least for research into parallel algorithms, it has much to recommend it. But whether MapReduce will play a major role going forward in production use seems at this point to still be an open question."

Link to Original Source

One meets his destiny often on the road he takes to avoid it.

Working...