Comment Re:ETL (Score 1) 55
<quote><p>I remember Cloudera saying that most people use hadoop for ETL. Not sure if you've checked, but hadoop is like the ne plus ultra of ETL tools. It's worth a look if you have to transform lots and lots of data.</p></quote>
The problem is you still have to Extract data from other systems, Transform them to make them suitable for Hadoop and Load them in HDFS (or S3). Once that data is available to Hadoop, it becomes extremely powerful.
Practically all analytical systems have the same issue. The reason to use an analytics system, like Hadoop, is because the database is not fast enough to query. I say "fast enough" because even though many databases *could* be fast enough, it become contentious to perform queries that utilize resources required in production.
I'm not holding my breath for ETL companies to arrive that make this initial process easier as each client would have different network, databases, and software that would have to be supported. A better tactic is to work towards publishing streams of data from the start and building an ETL system that can help distribute the leg work across an organization.
The problem is you still have to Extract data from other systems, Transform them to make them suitable for Hadoop and Load them in HDFS (or S3). Once that data is available to Hadoop, it becomes extremely powerful.
Practically all analytical systems have the same issue. The reason to use an analytics system, like Hadoop, is because the database is not fast enough to query. I say "fast enough" because even though many databases *could* be fast enough, it become contentious to perform queries that utilize resources required in production.
I'm not holding my breath for ETL companies to arrive that make this initial process easier as each client would have different network, databases, and software that would have to be supported. A better tactic is to work towards publishing streams of data from the start and building an ETL system that can help distribute the leg work across an organization.