jbolden - Slashdot User

Comment Re:The Fuck? (Score 1) 175

by jbolden on Tuesday June 23, 2015 @10:48AM (#49969997) Attached to: MEAN Vs. LAMP: Finding the Right Fit For Your Next Project

Comment Re:The Fuck? (Score 1) 175

by jbolden on Tuesday June 23, 2015 @10:44AM (#49969955) Attached to: MEAN Vs. LAMP: Finding the Right Fit For Your Next Project

Comment Re:Terrible arguments for Big Data (Score 1) 175

by jbolden on Tuesday June 23, 2015 @07:05AM (#49968579) Attached to: MEAN Vs. LAMP: Finding the Right Fit For Your Next Project

Comment Re:Terrible arguments for Big Data (Score 1) 175

by jbolden on Tuesday June 23, 2015 @07:05AM (#49968575) Attached to: MEAN Vs. LAMP: Finding the Right Fit For Your Next Project

Comment Re:The Fuck? (Score 1) 175

by jbolden on Tuesday June 23, 2015 @06:57AM (#49968559) Attached to: MEAN Vs. LAMP: Finding the Right Fit For Your Next Project

Comment Re:The Fuck? (Score 0) 175

by jbolden on Tuesday June 23, 2015 @06:54AM (#49968547) Attached to: MEAN Vs. LAMP: Finding the Right Fit For Your Next Project

Comment Re:The Fuck? (Score 1) 175

by jbolden on Tuesday June 23, 2015 @06:50AM (#49968541) Attached to: MEAN Vs. LAMP: Finding the Right Fit For Your Next Project

Comment Re:The Fuck? (Score 1) 175

by jbolden on Tuesday June 23, 2015 @06:49AM (#49968537) Attached to: MEAN Vs. LAMP: Finding the Right Fit For Your Next Project

Comment Re:The Fuck? (Score 4, Informative) 175

by jbolden on Tuesday June 23, 2015 @06:46AM (#49968533) Attached to: MEAN Vs. LAMP: Finding the Right Fit For Your Next Project

SQL engines are often slower than what?

Than engines designed for massive parallelism in dealing with workloads which can be effectually processed in parallel.

Operating on what hypothetical database schema with how many records spread across how many tables?

Generally NoSQL engines use schema on read techniques not schema on write. The table structure comes during the read. To get some sort of fair comparison something like a typical star schema with a much too large fact table (think billions or trillions of rows) and a half dozen dimension tables.

Or if you really want to make it worse. The same query where the table is getting 1m writes / second and you want an accurate stream.

SQL engines have problems with massive parallelism? Why? Which ones?

Because SQL by its nature operates on the table not the individual rows. Older database technologies that were row oriented like what you see on a mainframe on in SaS work better when the ratio of table size to computation speed is low. Today because disk storage size per dollar has gone up so fast, we disk we face many of the same problems systems in the 1980s faced with tape.

And the next question is pretty much all of them. The big data SQL engines have the least problems though and via. their execution plans turning into map-reduces might present a viable long term solution.

How well do you *really* know SQL in general and the capabilities of different database engines in particular?

Assume I don't know anything. Oracle, which has the best engine and SQL people on the planet has a guide for hybridization to handle things their engine can't handle well. IBM which probably comes in second and invented the relational database produces their own Hadoop / R to handle queries that DB2 (which is BTW far better than Oracle at stream) can't handle. Teradata's engine which was originally written specifically for larger amounts of data for a decade has had specific features of another subsystem to do enhanced big data, they also have guides for hybridization for things even their enhanced engine can't handle And Microsoft which writes the 3rd most popular engine has spent many millions on hybridization strategies. Enterprise DB (postgres) fully supports the IBM strategy.

I don't know anyone in the space who does agree with the /. "SQL can do everything" attitude.

but that portion off the article was ridiculous, and thus far all of the comments in support of it have demonstrated a similar lack of familiarity with actual databases, their operation, or performance tuning.

The article was ridiculous I said as much in another response. However the comment I was responding to went much too far in the other direction. As for performance tuning -- performance tuning is designed to avoid full table scans and expensive joins. To goal of many hybridization strategies is to take a raw data flow and convert it into a relational ETL using a big data engine which can take advantage of indexing and a better execution plan. It doesn't do much good when the initial goal is to do a full table scan.

Comment Re:The Fuck? (Score 4, Informative) 175

by jbolden on Monday June 22, 2015 @06:03PM (#49965399) Attached to: MEAN Vs. LAMP: Finding the Right Fit For Your Next Project

Comment Terrible arguments for Big Data (Score 1) 175

by jbolden on Monday June 22, 2015 @05:57PM (#49965375) Attached to: MEAN Vs. LAMP: Finding the Right Fit For Your Next Project

Comment Re:Good. (Score 1) 292

by jbolden on Monday June 22, 2015 @03:08PM (#49964169) Attached to: Political Polls Become Less Reliable As We Head Into 2016 Presidential Election

Comment Re:Hackability (Score 1) 292

by jbolden on Monday June 22, 2015 @03:03PM (#49964123) Attached to: Political Polls Become Less Reliable As We Head Into 2016 Presidential Election

Comment Re:Causes of hording. (Score 1) 107

by jbolden on Monday June 22, 2015 @02:34PM (#49963867) Attached to: 1 In 3 Data Center Servers Is a Zombie

Comment Re:Am I included? (Score 2) 134

by jbolden on Monday June 22, 2015 @09:12AM (#49961177) Attached to: Apple To Pay Musicians For Free Streams, After All

Slashdot Top Deals