Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
Azkaban serves as a distributed Workflow Manager developed by LinkedIn to address the complexities of Hadoop job dependencies. There were instances where jobs required a specific order of execution, ranging from ETL processes to data analysis applications. Following the release of version 3.0, Azkaban offers two distinct operational modes: the standalone “solo-server” mode and the distributed multiple-executor mode. The solo-server mode utilizes an embedded H2 database, allowing both the web server and executor server to operate within the same process, making it ideal for initial experimentation or small-scale applications. In contrast, the multiple-executor mode is designed for serious production environments, requiring a MySQL database configured with a master-slave arrangement. Ideally, the web server and executor servers are hosted on separate machines to ensure that system upgrades and maintenance do not disrupt user experience. This configuration not only enhances Azkaban’s robustness but also significantly improves its scalability, making it suitable for larger, more complex workflows. By offering these two modes, Azkaban caters to a wide range of user needs, from casual experimentation to enterprise-level deployments.
Description
Connect and centralize data. Transform your data landscape into a flexible powerhouse. Data Virtuality is a data integration platform that allows for instant data access, data centralization, and data governance. Logical Data Warehouse combines materialization and virtualization to provide the best performance. For high data quality, governance, and speed-to-market, create your single source data truth by adding a virtual layer to your existing data environment. Hosted on-premises or in the cloud. Data Virtuality offers three modules: Pipes Professional, Pipes Professional, or Logical Data Warehouse. You can cut down on development time up to 80% Access any data in seconds and automate data workflows with SQL. Rapid BI Prototyping allows for a significantly faster time to market. Data quality is essential for consistent, accurate, and complete data. Metadata repositories can be used to improve master data management.
API Access
Has API
API Access
Has API
Integrations
MySQL
Amazon EMR
Amazon Redshift
Apache Cassandra
Google Cloud BigQuery
InterSystems Caché
Magento
Marketo
Microsoft Dynamics 365
Oracle Cloud Infrastructure
Integrations
MySQL
Amazon EMR
Amazon Redshift
Apache Cassandra
Google Cloud BigQuery
InterSystems Caché
Magento
Marketo
Microsoft Dynamics 365
Oracle Cloud Infrastructure
Pricing Details
No price information available.
Free Trial
Free Version
Pricing Details
No price information available.
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
Azkaban
Website
azkaban.github.io
Vendor Details
Company Name
Data Virtuality
Founded
2012
Country
Germany
Website
datavirtuality.com
Product Features
Product Features
Data Extraction
Disparate Data Collection
Document Extraction
Email Address Extraction
IP Address Extraction
Image Extraction
Phone Number Extraction
Pricing Extraction
Web Data Extraction
Data Warehouse
Ad hoc Query
Analytics
Data Integration
Data Migration
Data Quality Control
ETL - Extract / Transfer / Load
In-Memory Processing
Match & Merge
ETL
Data Analysis
Data Filtering
Data Quality Control
Job Scheduling
Match & Merge
Metadata Management
Non-Relational Transformations
Version Control
Integration
Dashboard
ETL - Extract / Transform / Load
Metadata Management
Multiple Data Sources
Web Services