What Integrates with Protegrity?
Find out what Protegrity integrations exist in 2025. Learn what software and services currently integrate with Protegrity, and sort them by reviews, cost, features, and more. Below is a list of products that Protegrity currently integrates with:
-
1
Google Cloud BigQuery
Google
$0.04 per slot hour 1,710 RatingsANSI SQL allows you to analyze petabytes worth of data at lightning-fast speeds with no operational overhead. Analytics at scale with 26%-34% less three-year TCO than cloud-based data warehouse alternatives. You can unleash your insights with a trusted platform that is more secure and scales with you. Multi-cloud analytics solutions that allow you to gain insights from all types of data. You can query streaming data in real-time and get the most current information about all your business processes. Machine learning is built-in and allows you to predict business outcomes quickly without having to move data. With just a few clicks, you can securely access and share the analytical insights within your organization. Easy creation of stunning dashboards and reports using popular business intelligence tools right out of the box. BigQuery's strong security, governance, and reliability controls ensure high availability and a 99.9% uptime SLA. Encrypt your data by default and with customer-managed encryption keys -
2
Kubernetes
Kubernetes
Free 1 RatingKubernetes (K8s), an open-source software that automates deployment, scaling and management of containerized apps, is available as an open-source project. It organizes containers that make up an app into logical units, which makes it easy to manage and discover. Kubernetes is based on 15 years of Google's experience in running production workloads. It also incorporates best-of-breed practices and ideas from the community. Kubernetes is built on the same principles that allow Google to run billions upon billions of containers per week. It can scale without increasing your operations team. Kubernetes flexibility allows you to deliver applications consistently and efficiently, no matter how complex they are, whether you're testing locally or working in a global enterprise. Kubernetes is an open-source project that allows you to use hybrid, on-premises, and public cloud infrastructures. This allows you to move workloads where they are most important. -
3
Amazon Simple Storage Service (Amazon S3), an object storage service, offers industry-leading scalability and data availability, security, performance, and scalability. Customers of all sizes and industries can use Amazon S3 to store and protect any amount data for a variety of purposes, including data lakes, websites and mobile applications, backup, restore, archive, enterprise apps, big data analytics, and IoT devices. Amazon S3 offers easy-to-use management tools that allow you to organize your data and set up access controls that are tailored to your business, organizational, or compliance needs. Amazon S3 is built for 99.999999999% (11 9,'s) of durability and stores data for millions applications for companies around the globe. You can scale your storage resources to meet changing demands without having to invest upfront or go through resource procurement cycles. Amazon S3 is designed to last 99.999999999% (11 9,'s) of data endurance.
-
4
Google Cloud Storage
Google
4 RatingsAll sizes of companies can use object storage. You can store any amount of data. You can retrieve it as often and as often as you like. You can configure your data with Object Lifecycle Management to automatically transition to lower cost storage classes when it meets certain criteria, such as when it reaches a certain date or when you have stored a newer version. Cloud Storage offers a growing number of storage locations that you can store your data, with multiple redundancy options. You can choose where and how to store your data, regardless of whether you want to optimize for a split-second response or create a robust disaster recovery strategy. Two highly efficient online routes to Cloud Storage are Storage Transfer Service and Transfer Service. Both offer the speed and scalability you need to make data transfers faster. Our Transfer Appliance is a shippable storage device that can be used for offline data transfer. -
5
Amazon EC2
Amazon
2 RatingsAmazon Elastic Compute Cloud (Amazon EC2) provides secure, resizable cloud computing capacity. It was designed to make cloud computing at web scale easier for developers. Amazon EC2's web service interface makes it easy to configure and obtain capacity with minimal effort. It gives you complete control over your computing resources and allows you to run on Amazon's proven computing environment. -
6
Secure and manage the data lifecycle, from Edge to AI in any cloud or data centre. Operates on all major public clouds as well as the private cloud with a public experience everywhere. Integrates data management and analytics experiences across the entire data lifecycle. All environments are covered by security, compliance, migration, metadata management. Open source, extensible, and open to multiple data stores. Self-service analytics that is faster, safer, and easier to use. Self-service access to multi-function, integrated analytics on centrally managed business data. This allows for consistent experiences anywhere, whether it is in the cloud or hybrid. You can enjoy consistent data security, governance and lineage as well as deploying the cloud analytics services that business users need. This eliminates the need for shadow IT solutions.
-
7
Apache Hive
Apache Software Foundation
1 RatingApache Hive™, a data warehouse software, facilitates the reading, writing and management of large datasets that are stored in distributed storage using SQL. Structure can be projected onto existing data. Hive provides a command line tool and a JDBC driver to allow users to connect to it. Apache Hive is an Apache Software Foundation open-source project. It was previously a subproject to Apache® Hadoop®, but it has now become a top-level project. We encourage you to read about the project and share your knowledge. To execute traditional SQL queries, you must use the MapReduce Java API. Hive provides the SQL abstraction needed to integrate SQL-like query (HiveQL), into the underlying Java. This is in addition to the Java API that implements queries. -
8
What if your data had a recommendation engine? Automated data inventory was created. A searchable catalog showed user behavior. Smart recommendations were made inline by the system as you typed queries. Alation, the first enterprise-wide collaborative data catalog, makes all this possible. It's a powerful tool that dramatically increases the productivity of analysts and the accuracy of analytics. It also empowers business decision-making for everyone. Alation provides proactive recommendations to data users through applications. Google inspired us to create a simple interface that connects the language of your business with the technical schema of your data. No more is it difficult to find the data you need due to complicated semantic translations. Are you unfamiliar with the data environment and unsure which data to use in your query. Alation allows you to build your query and provides inline recommendations that indicate whether data is trustworthy.
-
9
MySQL is the most widely used open-source database in the world. MySQL is the most popular open source database for web-based applications. It has been proven to be reliable, performant, and easy-to-use. This database is used by many high-profile web properties, including Facebook, Twitter and YouTube. It is also a popular choice for embedded databases, distributed by thousands ISVs and OEMs.
-
10
AWS offers a wide range of services, including database storage, compute power, content delivery, and other functionality. This allows you to build complex applications with greater flexibility, scalability, and reliability. Amazon Web Services (AWS), the world's largest and most widely used cloud platform, offers over 175 fully featured services from more than 150 data centers worldwide. AWS is used by millions of customers, including the fastest-growing startups, large enterprises, and top government agencies, to reduce costs, be more agile, and innovate faster. AWS offers more services and features than any other cloud provider, including infrastructure technologies such as storage and databases, and emerging technologies such as machine learning, artificial intelligence, data lakes, analytics, and the Internet of Things. It is now easier, cheaper, and faster to move your existing apps to the cloud.
-
11
Your cloud data platform. Access to any data you need with unlimited scalability. All your data is available to you, with the near-infinite performance and concurrency required by your organization. You can seamlessly share and consume shared data across your organization to collaborate and solve your most difficult business problems. You can increase productivity and reduce time to value by collaborating with data professionals to quickly deliver integrated data solutions from any location in your organization. Our technology partners and system integrators can help you deploy Snowflake to your success, no matter if you are moving data into Snowflake.
-
12
Microsoft Entra ID
Microsoft
4 RatingsMicrosoft Entra ID, formerly known as Azure Active Directory, is a comprehensive cloud-based identity and access management solution that combines core directory service, application access management and advanced identity protection. Cloud identity and access management solutions connect employees, customers and partners with their apps, devices and data. Protect data and resources with adaptive access policies and strong authentication without compromising the user experience. Provide a quick, easy sign-in across your multicloud environment in order to keep your users productive and reduce time spent managing passwords. Manage all your identities, and access to your applications, in one central location, whether in the cloud, or on-premises. This will improve visibility and control. -
13
One platform, infinite ways for you to connect with your customers and employees. Any app can be made authable. Okta can help you create secure and delightful experiences quickly. Okta's Customer ID products can be combined to create the stack you need. This will provide security, scalability and reliability. Protect and empower your employees, contractors, partners. Okta's workforce identification solutions will protect your employees no matter where they are. You will have the tools you need to automate cloud journeys and support hybrid environments. Okta is trusted by companies around the globe to protect their workforce identities.
-
14
SQL Server
Microsoft
Free 2 RatingsMicrosoft SQL Server 2019 includes intelligence and security. You get more without paying extra, as well as best-in-class performance for your on-premises requirements. You can easily migrate to the cloud without having to change any code. Azure makes it easier to gain insights and make better predictions. You can use the technology you choose, including open-source, and Microsoft's innovations to help you develop. Integrate data into your apps easily and access a rich set cognitive services to build human-like intelligence on any data scale. AI is built into the data platform, so you can get insights faster from all of your data, both on-premises or in the cloud. To build an intelligence-driven company, combine your enterprise data with the world's data. You can build your apps anywhere with a flexible platform that offers a consistent experience across platforms. -
15
Amazon Athena
Amazon
2 RatingsAmazon Athena allows you to easily analyze data in Amazon S3 with standard SQL. Athena is serverless so there is no infrastructure to maintain and you only pay for the queries you run. Athena is simple to use. Simply point to your data in Amazon S3 and define the schema. Then, you can query standard SQL. Most results are delivered in a matter of seconds. Athena makes it easy to prepare your data for analysis without the need for complicated ETL jobs. Anyone with SQL skills can quickly analyze large-scale data sets. Athena integrates with AWS Glue Data Catalog out-of-the box. This allows you to create a unified metadata repositorie across multiple services, crawl data sources and discover schemas. You can also populate your Catalog by adding new and modified partition and table definitions. Schema versioning is possible. -
16
Tableau, a comprehensive business intelligence (BI/analytics) solution, allows you to generate, analyze, and interpret business data. Tableau allows users to gather data from many sources, including spreadsheets, SQL databases and Salesforce. Tableau offers real-time visual analytics as well as an interactive dashboard that allows users to slice and dice data to make relevant insights and find new opportunities. Tableau allows users to customize the platform for different industry verticals such as communication, banking, and more.
-
17
Microsoft Power BI
Microsoft
$10 per user per month 8 RatingsPower BI provides advanced data analysis, leveraging AI features to transform complex datasets into visual insights. It integrates data into a single source, OneLake, reducing duplication and streamlining analysis. The platform enhances decision-making by integrating insights into everyday tools like Microsoft 365 and is bolstered by Microsoft Fabric for data team empowerment. Power BI is scalable, handling extensive data without performance loss, and integrates well with Microsoft's ecosystem for coherent data management. Its AI tools are user-friendly and contribute to efficient and accurate insights, supported by strong data governance measures. The Copilot function in Power BI enables quick and efficient report creation. Power BI Pro licenses individuals for self-service analytics, while the free account offers data connection and visualization capabilities. The platform ensures ease of use and accessibility, backed by comprehensive training. It has shown a notable return on investment and economic benefits, as reported in a Forrester study. Gartner's Magic Quadrant recognizes Power BI for its ability to execute and completeness of vision. -
18
Amazon RDS
Amazon
$0.01 per month 3 RatingsAmazon Relational Database Service (Amazon RDS), makes it easy to create, manage, and scale a cloud-based relational database. It offers a cost-efficient, resizable storage capacity and automates time-consuming admin tasks like database setup, patching, backups, and hardware provisioning. It allows you to concentrate on your applications, so they can provide the high performance, security, compatibility, and high availability that they require. Amazon RDS can be used on several database instance types, optimized for memory, performance, or I/O. It offers six familiar database engines to choose, including PostgreSQL and MySQL, MariaDB, Oracle Database and SQL Server. To easily replicate or migrate your existing databases to Amazon RDS, you can use the AWS Database Migration Service. -
19
Amazon CloudWatch
Amazon
3 RatingsAmazon CloudWatch is a monitoring service that provides observability and data for developers, DevOps engineers, site reliability engineers (SREs), IT managers, and other users. CloudWatch gives you data and actionable insights that will help you monitor your applications, respond quickly to system-wide performance changes and optimize resource utilization. It also provides a unified view on operational health. CloudWatch gathers operational and monitoring data in the form logs, metrics and events. This gives you a single view of AWS resources, applications and services that are hosted on AWS and on-premises. CloudWatch can be used to detect anomalous behavior, set alarms, visualize logs side-by, take automated actions, troubleshoot problems, and uncover insights to help you keep your applications running smoothly. -
20
Microsoft Azure
Microsoft
21 RatingsMicrosoft Azure is a cloud computing platform that allows you to quickly develop, test and manage applications. Azure. Invent with purpose. With more than 100 services, you can turn ideas into solutions. Microsoft continues to innovate to support your development today and your product visions tomorrow. Open source and support for all languages, frameworks and languages allow you to build what you want and deploy wherever you want. We can meet you at the edge, on-premises, or in the cloud. Services for hybrid cloud enable you to integrate and manage your environments. Secure your environment from the ground up with proactive compliance and support from experts. This is a trusted service for startups, governments, and enterprises. With the numbers to prove it, the cloud you can trust. -
21
Amazon Aurora
Amazon
$0.02 per month 1 RatingAmazon Aurora is a MySQL and PostgreSQL-compatible relational database built for the cloud, that combines the performance and availability of traditional enterprise databases with the simplicity and cost-effectiveness of open source databases. Amazon Aurora is five times faster that standard MySQL databases and three time faster than standard PostgreSQL database. It offers the same security, availability, reliability, and cost-effectiveness as commercial databases, but at a fraction of the cost. Amazon Aurora is fully managed and maintained by Amazon Relational Database Service, (RDS). This automates tedious administration tasks such as hardware provisioning, database setup, patching and backups. Amazon Aurora is a distributed, fault-tolerant and self-healing storage that auto-scales up 64TB per database instance. It offers high availability and performance with up to 15 low latency read replicas, point in time recovery, continuous backup to Amazon S3, replication across threeAvailability Zones, and continuous backup to Amazon S3. -
22
Azure Synapse Analytics
Microsoft
1 RatingAzure Synapse is the Azure SQL Data Warehouse. Azure Synapse, a limitless analytics platform that combines enterprise data warehouse and Big Data analytics, is called Azure Synapse. It allows you to query data at your own pace, with either serverless or provisioned resources - at scale. Azure Synapse combines these two worlds with a single experience to ingest and prepare, manage and serve data for machine learning and BI needs. -
23
Amazon Redshift
Amazon
$0.25 per hourAmazon Redshift is preferred by more customers than any other cloud data storage. Redshift powers analytic workloads for Fortune 500 companies and startups, as well as everything in between. Redshift has helped Lyft grow from a startup to multi-billion-dollar enterprises. It's easier than any other data warehouse to gain new insights from all of your data. Redshift allows you to query petabytes (or more) of structured and semi-structured information across your operational database, data warehouse, and data lake using standard SQL. Redshift allows you to save your queries to your S3 database using open formats such as Apache Parquet. This allows you to further analyze other analytics services like Amazon EMR and Amazon Athena. Redshift is the fastest cloud data warehouse in the world and it gets faster each year. The new RA3 instances can be used for performance-intensive workloads to achieve up to 3x the performance compared to any cloud data warehouse. -
24
Amazon SageMaker
Amazon
Amazon SageMaker, a fully managed service, provides data scientists and developers with the ability to quickly build, train, deploy, and deploy machine-learning (ML) models. SageMaker takes the hard work out of each step in the machine learning process, making it easier to create high-quality models. Traditional ML development can be complex, costly, and iterative. This is made worse by the lack of integrated tools to support the entire machine learning workflow. It is tedious and error-prone to combine tools and workflows. SageMaker solves the problem by combining all components needed for machine learning into a single toolset. This allows models to be produced faster and with less effort. Amazon SageMaker Studio is a web-based visual interface that allows you to perform all ML development tasks. SageMaker Studio allows you to have complete control over each step and gives you visibility. -
25
Azure Blob Storage
Microsoft
$0.00099Secure, highly scalable object storage that is both highly scalable and scalable for cloud-native workloads. Azure Blob Storage allows you to create data lakes for your analytics and storage to build powerful cloud and mobile apps. Tiered storage reduces costs and allows you to scale up for machine learning and high-performance computing workloads. Blob storage was designed from the ground up for developers of mobile, web and cloud-native applications. It supports the scale, security and availability requirements. It can be used as a foundation for serverless architectures like Azure Functions. Blob storage supports all the most popular development frameworks such as Java,.NET and Python. It is also the only cloud storage service that offers a premium SSD-based object storage tier to support interactive and low-latency scenarios. -
26
Azure Key Vault
Microsoft
Key Vault helps you to improve data protection and compliance To protect cloud data, secure key management is crucial. Azure Key Vault can encrypt keys and small secrets, such as passwords, that are stored in hardware security module (HSMs). You can import or generate keys in HSMs for additional security. Microsoft processes your keys using FIPS validated HSMs (hardware, firmware, and hardware) - FIPS 140-2 level 2 for vaults, and FIPS 140-2 level 3 for HSM pools. Microsoft can't see your keys or extract them with Key Vault. You can monitor and audit key usage with Azure logging-pipe logs to Azure HDInsight, or your security information management (SIEM), for more analysis and threat detection. -
27
Azure SQL Database
Microsoft
$0.5218 per vCore-hourAzure SQL Database is part of the Azure SQL family. It's an intelligent, scalable, and relational database service that's built for the cloud. It's always available and up-to-date, and it has AI-powered and automated features that maximize performance and durability. Serverless computing and Hyperscale storage options automatically scale resources as needed, so you can concentrate on building new apps without worrying about resource management or storage size. A fully managed SQL database eliminates the complexity of managing high availability, tuning and other database tasks. You can accelerate your application development with the only cloud that supports evergreen SQL Server capabilities. Never worry about upgrades or discontinuing support. You can build modern apps with serverless and provisioned compute options. -
28
Azure Container Registry
Microsoft
$0.167 per dayWith an OCI distribution fully managed and geo-replicated, you can create, store, secure and scan container images and artifacts. Connect across Azure services such as Azure Kubernetes Service, Azure Red Hat OpenShift and Batch. Geo-replication allows you to efficiently manage multiple registry locations. OCI artifact repository to add helm charts, singularity support and new OCI-supported formats. Automated container building, patching, and updates of base images. Task scheduling. Integrate security with Azure Active Directory (AzureAD) authentication, role-based control, Docker content trusted, and virtual network integration. Azure Container Registry Tasks streamlines the process of building, testing and pushing images to Azure. -
29
Teradata Vantage
Teradata
Businesses struggle to find answers as data volumes increase faster than ever. Teradata Vantage™, solves this problem. Vantage uses 100 per cent of the data available to uncover real-time intelligence at scale. This is the new era in Pervasive Data Intelligence. All data across the organization is available in one place. You can access it whenever you need it using preferred languages and tools. Start small and scale up compute or storage to areas that have an impact on modern architecture. Vantage unifies analytics and data lakes in the cloud to enable business intelligence. Data is growing. Business intelligence is becoming more important. Four key issues that can lead to frustration when using existing data analysis platforms include: Lack of the right tools and supportive environment required to achieve quality results. Organizations don't allow or give proper access to the tools they need. It is difficult to prepare data. -
30
Cloud Functions
Google
Cloud Functions offers a simple and intuitive user experience. Simply write your code, and Google Cloud will handle the operational infrastructure. You can develop faster by writing small code snippets that react to events. To simplify complex orchestration problems, connect to Google Cloud and third-party cloud services using triggers. -
31
AWS Glue
Amazon
AWS Glue, a fully managed extract-transform-and-load (ETL) service, makes it easy for customers prepare and load their data for analysis. With just a few clicks, you can create and run ETL jobs. AWS Glue simply points to the AWS Data Catalog and AWS Glue finds your data and stores metadata (e.g. AWS Glue Data Catalog contains the table definition and schema. Once your data has been cataloged, it is immediately searchable and queryable. It is also available for ETL. -
32
HashiCorp Vault
HashiCorp
Securely store, secure, and tightly control access tokens, passwords and certificates to protect secrets and other sensitive data using a UI or CLI or HTTP API. -
33
Starburst Enterprise
Starburst Data
Starburst allows you to make better decisions by having quick access to all of your data. Your company has more data than ever, but your data teams are still waiting to analyze it. Starburst gives your data teams quick and accurate access to more data. Starburst Enterprise, a fully supported, production-tested, enterprise-grade distribution for open source Trino (formerly Presto®, SQL), is now available. It increases performance and security, while making it easy for you to deploy, connect, manage, and manage your Trino environment. Starburst allows your team to connect to any source of data, whether it's on-premise, in a cloud, or across a hybrid cloud environment. This allows them to use the analytics tools they already love and access data that lives anywhere. -
34
Amazon MSK
Amazon
$0.0543 per hourAmazon MSK is a fully managed service that makes coding and running applications that use Apache Kafka for streaming data processing easy. Apache Kafka is an open source platform that allows you to build real-time streaming data applications and pipelines. Amazon MSK allows you to use native Apache Kafka APIs for populating data lakes, stream changes between databases, and to power machine learning or analytics applications. It is difficult to set up, scale, and manage Apache Kafka clusters in production. Apache Kafka clusters can be difficult to set up and scale on your own. -
35
Cloudera Data Platform
Cloudera
The only hybrid data platform that supports modern data architectures and data anywhere. Cloudera is an open-source hybrid data platform that allows you to choose any cloud, any analytics and any data. Cloudera provides faster and easier data analytics and management for data anywhere with optimal performance, scalability and security. Cloudera gives you all the benefits of both private and public clouds for a faster time to value, and greater IT control. Cloudera allows you to move data, applications and users in both directions between your data center and multiple clouds, no matter where the data resides. -
36
Red Hat OpenShift is now available on IBM Cloud. This provides OpenShift developers with a fast and secure method to containerize and deploy enterprise workloads within Kubernetes clusters. OpenShift Container Platform (OCP) is managed by IBM so you can focus on your core tasks. Automated provisioning, configuration and installation of infrastructure (compute and network storage), as well as configuration and installation of OpenShift. Automatic scaling, backups, and failure recovery for OpenShift configurations. Automated upgrades of all components (operating systems, OpenShift components and cluster services), as well as performance tuning and security hardening. Security features include image signing, image deployment enforcement and hardware trust. Also, security patch management and compliance (HIPAA PCI, SOC2, ISO).
-
37
AWS Marketplace
Amazon
AWS Marketplace is an online catalog that allows customers to discover, buy, deploy and manage third-party products, data and services within the AWS ecosystem. It offers thousands of listings in categories such as security, machine-learning, business applications, DevOps, and more. AWS Marketplace offers flexible pricing models, such as pay-as you-go, annual subscriptions and free trials. This simplifies billing and procurement by integrating costs in a single AWS bill. It also supports rapid implementation with pre-configured applications that can be launched using AWS infrastructure. This streamlined approach allows companies to accelerate innovation, reduce the time-to market, and maintain better controls over software usage and cost. -
38
Collibra
Collibra
The Collibra Data Intelligence Cloud offers a best-in class catalog, flexible governance and continuous quality. It also has built-in privacy. A best-in-class data catalogue that supports your users includes embedded governance, privacy, and quality. You can raise the bar by ensuring that teams can quickly access, understand, and access data from all sources, including business applications and data science tools, in one central location. Your data deserves privacy. Automate, centralize and guide workflows to encourage collaboration and operationalize privacy. Collibra Data Lineage gives you the complete story about your data. Automatically map relationships between applications, systems, and reports to provide a context rich view of the enterprise. Focus on the data that you are most concerned about and make sure it is accurate, complete, and trustworthy. -
39
Oracle Database
Oracle
Oracle database products offer customers cost-optimized, high-performance versions Oracle Database, the world's most popular converged, multi-model database management software. They also include in-memory NoSQL and MySQL databases. Oracle Autonomous Database is available on-premises via Oracle Cloud@Customer and in the Oracle Cloud Infrastructure. It allows customers to simplify relational databases environments and reduce management burdens. Oracle Autonomous Database reduces the complexity of operating and protecting Oracle Database, while delivering the highest levels performance, scalability and availability to customers. Oracle Database can also be deployed on-premises if customers have network latency and data residency concerns. Customers who depend on Oracle database versions for their applications have full control over which versions they use and when they change. -
40
PostgreSQL
PostgreSQL Global Development Group
PostgreSQL, a powerful open-source object-relational database system, has over 30 years of experience in active development. It has earned a strong reputation for reliability and feature robustness. -
41
Presto
Presto Foundation
Presto is an open-source distributed SQL query engine that allows interactive analytic queries against any data source, from gigabytes up to petabytes. -
42
AWS CloudTrail
Amazon
AWS CloudTrail allows you to manage your AWS account's compliance, risk auditing, and operational auditing. CloudTrail allows you to log, monitor, and keep track of account activity related actions within your AWS infrastructure. CloudTrail gives you an event history of all your AWS account activity. This includes actions taken through the AWS Management console, AWS SDKs and command line tools. This event history makes it easier to perform security analysis, track resource changes, and troubleshoot. CloudTrail can also be used to detect unusual activity within your AWS accounts. These capabilities simplify troubleshooting and operational analysis. -
43
Apache Spark
Apache Software Foundation
Apache Spark™, a unified analytics engine that can handle large-scale data processing, is available. Apache Spark delivers high performance for streaming and batch data. It uses a state of the art DAG scheduler, query optimizer, as well as a physical execution engine. Spark has over 80 high-level operators, making it easy to create parallel apps. You can also use it interactively via the Scala, Python and R SQL shells. Spark powers a number of libraries, including SQL and DataFrames and MLlib for machine-learning, GraphX and Spark Streaming. These libraries can be combined seamlessly in one application. Spark can run on Hadoop, Apache Mesos and Kubernetes. It can also be used standalone or in the cloud. It can access a variety of data sources. Spark can be run in standalone cluster mode on EC2, Hadoop YARN and Mesos. Access data in HDFS and Alluxio. -
44
Amazon Kinesis
Amazon
You can quickly collect, process, analyze, and analyze video and data streams. Amazon Kinesis makes it easy for you to quickly and easily collect, process, analyze, and interpret streaming data. Amazon Kinesis provides key capabilities to process streaming data at any scale cost-effectively, as well as the flexibility to select the tools that best fit your application's requirements. Amazon Kinesis allows you to ingest real-time data, including video, audio, website clickstreams, application logs, and IoT data for machine learning, analytics, or other purposes. Amazon Kinesis allows you to instantly process and analyze data, rather than waiting for all the data to be collected before processing can begin. Amazon Kinesis allows you to ingest buffer and process streaming data instantly, so you can get insights in seconds or minutes, instead of waiting for hours or days. -
45
Azure Functions
Microsoft
Functions is an event-driven, serverless computing platform that allows you to develop more efficiently. It can also solve complex orchestration issues. You can build and debug locally, deploy and operate at scale in a cloud environment, and integrate services with triggers and bindings. -
46
Amazon EMR
Amazon
Amazon EMR is the market-leading cloud big data platform. It processes large amounts of data with open source tools like Apache Spark, Apache Hive and Apache HBase. EMR allows you to run petabyte-scale analysis at a fraction of the cost of traditional on premises solutions. It is also 3x faster than standard Apache Spark. You can spin up and down clusters for short-running jobs and only pay per second for the instances. You can also create highly available clusters that scale automatically to meet the demand for long-running workloads. You can also run EMR clusters from AWS Outposts if you have on-premises open source tools like Apache Spark or Apache Hive. -
47
Oracle Autonomous Data Warehouse, a cloud-based data warehouse service, eliminates the complexity of operating a data warehouse, data warehouse center, or dw cloud. It also makes it easy to secure data and develop data-driven apps. It automates provisioning and tuning, scaling, security, tuning, scaling, as well as backing up the data warehouse. It provides tools for self-service data loading and data transformations, business models and automatic insights. There are also built-in converged databases capabilities that allow for simpler queries across multiple types of data and machine learning analysis. It is available in both the Oracle cloud public and customers' data centers using Oracle Cloud@Customer. DSC, an industry expert, has provided a detailed analysis that demonstrates why Oracle Autonomous Data Warehouse is a better choice for most global organizations. Find out about compatible applications and tools with Autonomous Data Warehouse.
-
48
You can easily store, share, or deploy container software anywhere. You can push container images to Amazon ECR, without having to install or scale infrastructure, and you can pull images from any management tool. Hypertext Transfer Protocol Secure (HTTPS), which provides access controls and automatic encryption, allows you to share and download images securely. You can access and distribute your images quicker, reduce download times, improve availability, and use a scalable and durable architecture to increase availability. Amazon ECR is a fully managed container registry that allows you to reliably deploy artifacts and application images anywhere. You can meet your organization's image compliance security needs using insights from the Common Vulnerability Scoring System and Common Vulnerability Exposures (CVEs). You can publish containerized applications using a single command. This will allow you to easily integrate your self-managed environments.
-
49
Azure HDInsight
Microsoft
Run popular open-source frameworks--including Apache Hadoop, Spark, Hive, Kafka, and more--using Azure HDInsight, a customizable, enterprise-grade service for open-source analytics. You can process huge amounts of data quickly and enjoy all the benefits of the large open-source project community with the global scale Azure. You can easily migrate your big data workloads to the cloud. Open-source projects, clusters and other software are easy to set up and manage quickly. Big data clusters can reduce costs by using autoscaling and pricing levels that allow you only to use what you use. Data protection is assured by enterprise-grade security and industry-leading compliance, with over 30 certifications. Optimized components for open source technologies like Hadoop and Spark keep your up-to-date. -
50
Azure Databricks
Microsoft
Azure Databricks allows you to unlock insights from all your data, build artificial intelligence (AI), solutions, and autoscale your Apache Spark™. You can also collaborate on shared projects with other people in an interactive workspace. Azure Databricks supports Python and Scala, R and Java, as well data science frameworks such as TensorFlow, PyTorch and scikit-learn. Azure Databricks offers the latest version of Apache Spark and allows seamless integration with open-source libraries. You can quickly spin up clusters and build in an Apache Spark environment that is fully managed and available worldwide. Clusters can be set up, configured, fine-tuned, and monitored to ensure performance and reliability. To reduce total cost of ownership (TCO), take advantage of autoscaling or auto-termination.