Resources

Updates, Blogs and Articles

  1. Accelerating genomics workflows and data analysis on Azure
  2. Microsoft for Healthcare: new people, products, and partnerships
  3. Harnessing big data in pediatric research to reimagine healthcare
  4. St. Jude Cloud to accelerate scientific discoveries through access to real-time clinical genome sequencing data
  5. Cancer researchers embrace AI to accelerate development of precision medicine
  6. Accelerate precision medicine with Microsoft Genomics
  7. De(con)struction of the lazy-F loop: improving performance of Smith Waterman alignment
  8. Parallel approach to sliding window sums
  9. Exploring the Consistency of the Quality Scores with Machine Learning for Next-Generation Sequencing Experiments
  10. Bioconductor on Microsoft Azure
  11. Nextflow On Microsoft Azure with a Blazor Frontend
  12. Nextflow Development with GitHub Codespaces
  13. Convert Synthetic FHIR and PacBio VCF Data to parquet and Explore with Azure Synapse Analytics
  14. Data Science for Merged FHIR and PacBio VCF Data on Azure Machine Learning Notebooks
  15. RNA sequencing analysis on Azure using Nextflow: configuration files and benchmarking
  16. Genomics workflows on secure lockdown environment using Cromwell on AKS
  17. RNA sequencing analysis on Azure using Nextflow: low-priority vs. dedicated machines comparison
  18. Introducing Scalable and Enterprise-Grade Genomics Workflows in Azure ML

Workflows Management resources on Azure

Design and orchestrate scalable workflows and efficiently manage genomics analysis pipelines and data manipulation tasks using the power of Azure cloud

  1. Cromwell on Azure
  2. Nextflow on Azure

Data Analytics resources on Azure

  1. Genomic Notebooks: Genomics Notebooks brings the power of Jupyter Notebooks on Azure for genomics data analysis using GATK, Picard, Bioconductor, and Python libraries
  2. Bioconductor on Azure:Bioconductor provides hundreds of R based bioinformatics tools for the analysis and comprehension of high-throughput genomic data
  3. Genomics Data Science VM: Azure Virtual Machine templates provide preinstalled and preconfigured tools, libraries and SDKs for data exploration, analysis, and modeling.
  4. FHIR Analytics Pipeline FHIR Analytics Pipelines is an open source project with the goal to help build components and pipelines for rectangularizing and moving FHIR data from Azure FHIR servers namely Azure Healthcare APIs FHIR Server, Azure API for FHIR, and FHIR server for Azure to Azure Data Lake and thereby make it available for analytics with Azure Synapse, Power BI, and Azure Machine Learning.

Open data on Azure

Power your genomics analysis and machine learning models using curated public datasets easily accessible from the Genomics data lake on Azure Open Dataset platform.