AnalyticsCreator
Accelerate your data journey with AnalyticsCreator—a metadata-driven data warehouse automation solution purpose-built for the Microsoft data ecosystem. AnalyticsCreator simplifies the design, development, and deployment of modern data architectures, including dimensional models, data marts, data vaults, or blended modeling approaches tailored to your business needs.
Seamlessly integrate with Microsoft SQL Server, Azure Synapse Analytics, Microsoft Fabric (including OneLake and SQL Endpoint Lakehouse environments), and Power BI. AnalyticsCreator automates ELT pipeline creation, data modeling, historization, and semantic layer generation—helping reduce tool sprawl and minimizing manual SQL coding.
Designed to support CI/CD pipelines, AnalyticsCreator connects easily with Azure DevOps and GitHub for version-controlled deployments across development, test, and production environments. This ensures faster, error-free releases while maintaining governance and control across your entire data engineering workflow.
Key features include automated documentation, end-to-end data lineage tracking, and adaptive schema evolution—enabling teams to manage change, reduce risk, and maintain auditability at scale. AnalyticsCreator empowers agile data engineering by enabling rapid prototyping and production-grade deployments for Microsoft-centric data initiatives.
By eliminating repetitive manual tasks and deployment risks, AnalyticsCreator allows your team to focus on delivering actionable business insights—accelerating time-to-value for your data products and analytics initiatives.
Learn more
MongoDB Atlas
MongoDB Atlas stands out as the leading cloud database service available, offering unparalleled data distribution and seamless mobility across all major platforms, including AWS, Azure, and Google Cloud. Its built-in automation tools enhance resource management and workload optimization, making it the go-to choice for modern application deployment. As a fully managed service, it ensures best-in-class automation and adheres to established practices that support high availability, scalability, and compliance with stringent data security and privacy regulations. Furthermore, MongoDB Atlas provides robust security controls tailored for your data needs, allowing for the integration of enterprise-grade features that align with existing security protocols and compliance measures. With preconfigured elements for authentication, authorization, and encryption, you can rest assured that your data remains secure and protected at all times. Ultimately, MongoDB Atlas not only simplifies deployment and scaling in the cloud but also fortifies your data with comprehensive security features that adapt to evolving requirements.
Learn more
Evo 2
Evo 2 represents a cutting-edge genomic foundation model that excels in making predictions and designing tasks related to DNA, RNA, and proteins. It employs an advanced deep learning architecture that allows for the modeling of biological sequences with single-nucleotide accuracy, achieving impressive scaling of both compute and memory resources as the context length increases. With a robust training of 40 billion parameters and a context length of 1 megabase, Evo 2 has analyzed over 9 trillion nucleotides sourced from a variety of eukaryotic and prokaryotic genomes. This extensive dataset facilitates Evo 2's ability to conduct zero-shot function predictions across various biological types, including DNA, RNA, and proteins, while also being capable of generating innovative sequences that maintain a plausible genomic structure. The model's versatility has been showcased through its effectiveness in designing operational CRISPR systems and in the identification of mutations that could lead to diseases in human genes. Furthermore, Evo 2 is available to the public on Arc's GitHub repository, and it is also incorporated into the NVIDIA BioNeMo framework, enhancing its accessibility for researchers and developers alike. Its integration into existing platforms signifies a major step forward for genomic modeling and analysis.
Learn more
Genome Analysis Toolkit (GATK)
Created within the Data Sciences Platform at the Broad Institute, this comprehensive toolkit provides an extensive array of features primarily aimed at variant discovery and genotyping. With its robust processing engine and high-performance computing capabilities, it is equipped to manage projects of any magnitude. The GATK has established itself as the industry benchmark for detecting SNPs and indels in both germline DNA and RNA sequencing data. Its functionalities are now broadening to encompass somatic short variant detection as well as addressing copy number variations (CNV) and structural variations (SV). Besides the core variant callers, the GATK incorporates numerous utilities for executing associated tasks, including the processing and quality assurance of high-throughput sequencing data, and it comes bundled with the well-known Picard toolkit. Originally designed for exome and whole genome data generated via Illumina sequencing technology, these tools are versatile enough to be modified for use with various other technologies and study designs. As research evolves, the adaptability of the GATK ensures it remains relevant in diverse genomic investigations.
Learn more