Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
Apache Trafodion serves as a webscale SQL-on-Hadoop solution that facilitates transactional or operational processes within the Apache Hadoop ecosystem. By leveraging the inherent scalability, elasticity, and flexibility of Hadoop, Trafodion enhances its capabilities to ensure transactional integrity, which opens the door for a new wave of big data applications to operate seamlessly on Hadoop. The platform supports the full ANSI SQL language, allowing for JDBC/ODBC connectivity suitable for both Linux and Windows clients. It provides distributed ACID transaction protection that spans multiple statements, tables, and rows, all while delivering performance enhancements specifically designed for OLTP workloads through both compile-time and run-time optimizations. Trafodion is also equipped with a parallel-aware query optimizer that efficiently handles large datasets, enabling developers to utilize their existing SQL knowledge and boost productivity. Furthermore, its distributed ACID transactions maintain data consistency across various rows and tables, making it interoperable with a wide range of existing tools and applications. This solution is neutral to both Hadoop and Linux distributions, providing a straightforward integration path into any existing Hadoop infrastructure. Thus, Apache Trafodion not only enhances the power of Hadoop but also simplifies the development process for users.
Description
MLlib, the machine learning library of Apache Spark, is designed to be highly scalable and integrates effortlessly with Spark's various APIs, accommodating programming languages such as Java, Scala, Python, and R. It provides an extensive range of algorithms and utilities, which encompass classification, regression, clustering, collaborative filtering, and the capabilities to build machine learning pipelines. By harnessing Spark's iterative computation features, MLlib achieves performance improvements that can be as much as 100 times faster than conventional MapReduce methods. Furthermore, it is built to function in a variety of environments, whether on Hadoop, Apache Mesos, Kubernetes, standalone clusters, or within cloud infrastructures, while also being able to access multiple data sources, including HDFS, HBase, and local files. This versatility not only enhances its usability but also establishes MLlib as a powerful tool for executing scalable and efficient machine learning operations in the Apache Spark framework. The combination of speed, flexibility, and a rich set of features renders MLlib an essential resource for data scientists and engineers alike.
API Access
Has API
API Access
Has API
Integrations
Hadoop
Amazon EC2
Apache Cassandra
Apache HBase
Apache Hive
Apache Mesos
Apache Spark
Java
Kubernetes
MapReduce
Integrations
Hadoop
Amazon EC2
Apache Cassandra
Apache HBase
Apache Hive
Apache Mesos
Apache Spark
Java
Kubernetes
MapReduce
Pricing Details
Free
Free Trial
Free Version
Pricing Details
No price information available.
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
Apache Software Foundation
Country
United States
Website
trafodion.apache.org
Vendor Details
Company Name
Apache Software Foundation
Founded
1995
Country
United States
Website
spark.apache.org/mllib/
Product Features
Product Features
Machine Learning
Deep Learning
ML Algorithm Library
Model Training
Natural Language Processing (NLP)
Predictive Modeling
Statistical / Mathematical Tools
Templates
Visualization