Supported Tools

MultiQC currently has modules to support 146 different bioinformatics tools, listed below.

Click the tool name to go to the MultiQC documentation for that tool.

Missing something? If you would like another tool to to be support, please open an issue .

Search for a tool:

Tool

Tool Name

Description

Removes adapter sequences and trims low quality bases from the 3' end of reads. Overlapping paired-ended reads can be merged into consensus sequences and adapter sequence can be found for paired-ended data if not known.

AfterQC

Automatic Filtering, Trimming, Error Removing and Quality Control for fastq data.

Anglerfish

Anglerfish assesses contamination and composition of Illumina sequencing libraries based on a Nanopore trial sequencing with high concordance.

Bakta

Rapid & standardized annotation of bacterial genomes, MAGs & plasmids.

Bamdst

Bamdst is a lightweight tool to stat the depth coverage of target regions of bam file(s).

Bamtools

BamTools provides both a programmer's API and an end-user's toolkit for handling BAM files.

BBDuk

Tool for common data-quality-related trimming, filtering, and masking operations

BBMap

BBMap is a suite of pre-processing, assembly, alignment, and statistics tools for DNA/RNA sequencing reads.

Bcftools

BCFtools is a set of utilities that manipulate variant calls in the Variant Call Format (VCF) and its binary counterpart BCF.

bcl2fastq

bcl2fastq can be used to both demultiplex data and convert BCL files to FASTQ file formats for downstream analysis.

BCL Convert

bclconvert can be used to both demultiplex data and convert BCL files to FASTQ file formats for downstream analysis.

biobambam2

biobambam2 contains tools for processing BAM files for early stage alignment file processing

BioBloom Tools

BioBloom Tools assigns reads to different references using bloom filters. This is faster than alignment and can be used for contamination detection.

BISCUIT

BISCUIT is a software tool suite for analyzing bisulfite-converted DNA sequencing.

Bismark

Bismark is a tool to map bisulfite converted sequence reads and determine cytosine methylation states.

Bowtie 1

Bowtie 1 is an ultrafast, memory-efficient short read aligner.

Bowtie 2

Bowtie 2 is an ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences.

Bracken

is a highly accurate statistical method that computes the abundance of species in DNA sequences from a metagenomics sample

BUSCO

BUSCO assesses genome assembly and annotation completeness with Benchmarking Universal Single-Copy Orthologs.

bustools

BUS format is a file format for single-cell RNA-seq data designed to facilitate the development of modular workflows for data processing.

CCS

CCS is a PacBio tool that generates highly accurate single-molecule consensus reads (HiFi Reads).

Cell Ranger

Summarise quality metrics from Cell Ranger count and vdj.

CheckQC

CheckQC is a program designed to check a set of quality criteria against an Illumina runfolder.

ClipAndMerge

adapter clipping and read merging in ancient DNA analysis

Cluster Flow

Cluster Flow is a simple and flexible bioinformatics pipeline tool.

Conpair

Conpair estimates concordance and contamination for tumour–normal pairs

Cutadapt

Cutadapt is a tool to find and remove adapter sequences, primers, poly-A tails and other types of unwanted sequence from your high-throughput sequencing reads.

DamageProfiler

DNA damage investigation tool for ancient DNA analysis

DeDup

Improved Duplicate Removal for merged/collapsed reads in ancient DNA analysis

deepTools

Tools to process and analyze deep sequencing data.

DIAMOND

DIAMOND is a sequence aligner for protein and translated DNA searches, designed for high performance analysis of big sequence data.

Disambiguate

Disambiguation algorithm for reads aligned to two species (e.g. human and mouse genomes) from Tophat, Hisat2, STAR or BWA mem.

DRAGEN

Illumina Bio-IT Platform that uses FPGA for secondary NGS analysis.

DRAGEN-FastQC

Illumina Bio-IT Platform that uses FPGA for accelerated primary and secondary analysis

EigenStratDatabaseTools

A set of tools to compare and manipulate the contents of EingenStrat databases, and to calculate SNP coverage statistics in such databases.

Fastp

An ultra-fast all-in-one FASTQ preprocessor

FastQ Screen

FastQ Screen allows you to screen a library of sequences in FastQ format against a set of sequence databases so you can see if the composition of the library matches with what you expect.

FastQC

FastQC is a quality control tool for high throughput sequence data, written by Simon Andrews at the Babraham Institute in Cambridge.

featureCounts

featureCounts is a highly efficient general-purpose read summarization program that counts mapped reads for genomic features such as genes, exons, promoter, gene bodies, genomic bins and chromosomal locations.

Fgbio

Fgbio can be used for processing and evaluating data containing UMIs

Filtlong

Filtlong is a tool for filtering long reads by quality.

FLASh

FLASH (Fast Length Adjustment of SHort reads) is a very fast and accurate software tool to merge paired-end reads from NGS data.

Flexbar

Flexible barcode and adapter removal

Freyja

Freyja: Recover relative lineage abundances from mixed SARS-CoV-2 samples.

GATK

Variant Discovery in High-Throughput Sequencing Data

GffCompare

A tool to compare, merge and annotate one or more GFF files with a reference annotation in GFF format.

goleft indexcov

Quickly estimate coverage from a whole-genome bam index, providing 16KB resolution. This is useful as a quick QC to get coverage values across the genome.

GoPeaks

GoPeaks is used to call peaks in CUT&TAG/CUT&RUN datasets.

Hap.py

Hap.py is a set of programs based on htslib to benchmark variant calls against gold standard truth datasets. Som.py output not currently supported.

HiCExplorer

HiCexplorer addresses the common tasks of Hi-C analysis from processing to visualization.

HiC-Pro

HiC-Pro is an optimized and flexible pipeline for Hi-C data processing.

HiCUP

HiCUP (Hi-C User Pipeline) is a tool for mapping and performing quality control on Hi-C data.

HiFiasm

A haplotype-resolved assembler for accurate Hifi reads

HISAT2

HISAT2 is a fast and sensitive alignment program for mapping NGS reads (both DNA and RNA) to reference genomes.

HOMER

HOMER is a suite of tools for Motif Discovery and next-gen sequencing analysis.

HOPS

This tool performs screening of output from the ancient DNA optimised BLAST-replacement tool MALT, to identify taxa that have expected ancient DNA characteristics.

hostile

Hostile removes host sequences from short and long read (meta)genomes, from paired or unpaired fastq[.gz] input.

HTSeq

HTSeq is a Python package that provides infrastructure to process data from high-throughput sequencing assays. HTSeq-count takes a file with aligned sequencing reads, plus a list of genomic features and counts how many reads map to each feature.

HUMID

High-performance UMI deduplicator

InterOp

The Illumina InterOp libraries are a set of common routines used for reading and writing InterOp metric files. These metric files are binary files produced during a run providing detailed statistics about a run. In a few cases, the metric files are produced after a run during secondary analysis (index metrics) or for faster display of a subset of the original data (collapsed quality scores).

Iso-Seq

Iso-Seq contains the newest tools to identify transcripts in PacBio single-molecule sequencing data (HiFi reads).

iVar

Functions for viral amplicon-based sequencing.

JCVI Genome Annotation

A tool to compute statistics on genome annotation.

Jellyfish

JELLYFISH is a tool for fast, memory-efficient counting of k-mers in DNA.

Kaiju

Fast and sensitive taxonomic classification for metagenomics

Kallisto

kallisto is a program for quantifying abundances of transcripts from RNA-Seq data.

KAT

The K-mer Analysis Toolkit (KAT) contains a number of tools that analyse and compare K-mer spectra.

Kraken

is a taxonomic classification tool that uses exact k-mer matches to find the lowest common ancestor (LCA) of a given sequence.

leeHom

leeHom is a program for the Bayesian reconstruction of ancient DNA

Librarian

A tool to predict the sequencing library type from the base composition of a supplied FastQ file.

Lima

Lima, the PacBio barcode demultiplexer, is the standard tool to identify barcode sequences in PacBio single-molecule sequencing data. Starting in SMRT Link v5.1.0, it is the tool that powers the Demultiplex Barcodes GUI-based analysis application.

Longranger

A set of analysis pipelines that perform sample demultiplexing, barcode processing, alignment, quality control, variant calling, phasing, and structural variant calling.

MACS2

MACS2 identifies transcription factor binding sites in ChIP-seq data.

MALT

MEGAN alignment tool

mapDamage

mapDamage: tracking and quantifying damage patterns in ancient DNA sequences

MEGAHIT

ultra-fast and memory-efficient NGS assembler

MetaPhlAn

MetaPhlAn is a computational tool for profiling the composition of microbial communities from metagenomic shotgun sequencing data.

methylQA

methylQA is a methylation sequencing data quality assessment tool.

minionqc

Quality control for long reads from ONT (Oxford Nanopore Technologies) sequencing.

mirtop

Command line tool to annotate miRNAs with a standard mirna/isomir naming

miRTrace

miRTrace, developed by the team of Marc Friedländer (KTH, Sweden), is a quality control software for small RNA sequencing data.

mosdepth

Mosdepth performs fast BAM/CRAM depth calculation for WGS, exome, or targeted sequencing.

mOTUs

Microbial profiling through marker gene (MG)-based operational taxonomic units (mOTUs)

MTNucRatio

A simple tool to compute mitochondrial to nuclear genome ratios.

MultiVCFAnalyzer

MultiVCFAnalyzer collects multiple VCF files and outputs combined genotype calls in a number of file formats.

NanoStat

Calculate various statistics from a long read sequencing dataset in FastQ, BAM or albacore sequencing summary format (supports NanoPack; NanoPlot, NanoComp).

Nextclade

Viral genome alignment, clade assignment, mutation calling, and quality checks

ngsderive

finds information about sequencing libraries by backwards computing sequencing data.

Nonpareil

Estimate metagenomic coverage and sequence diversity.

odgi

is an optimized dynamic graph/genome implementation.

OptiType

Precision HLA typing from next-generation sequencing data

Pangolin

Pangolin uses variant calls to assign SARS-CoV-2 genome sequences to global lineages.

pbmarkdup

pbmarkdup takes one or multiple sequencing chips of an amplified libray as HiFi reads and marks or removes duplicates.

Peddy

Peddy calculates genotype :: pedigree correspondence checks, ancestry checks and sex checks using VCF files.

phantompeakqualtools

Computes enrichment and quality measures for ChIP-seq/DNase-seq/FAIRE-seq/MNase-seq data.

Picard

Picard is a set of Java command line tools for manipulating high-throughput sequencing data.

Porechop

Porechop is a tool for finding and removing adapters from Oxford Nanopore reads. Adapters on the ends of reads are trimmed off, and when a read has an adapter in its middle, it is treated as chimeric and chopped into separate reads. Porechop performs thorough alignments to effectively find adapters, even at low sequence identity.

Preseq

Preseq estimates the complexity of a library, showing how many additional unique reads are sequenced for increasing total read count.

PRINSEQ++

PRINSEQ++ is a C++ implementation of the prinseq-lite.pl program.

Prokka

Prokka is a software tool for the rapid annotation of prokaryotic genomes.

PURPLE

A purity, ploidy and copy number estimator for whole genome tumor data

Pychopper

is a tool to identify, orient and trim full-length Nanopore cDNA reads. The tool is also able to rescue fused reads.

pycoQC

PycoQC computes metrics and generates interactive QC plots for Oxford Nanopore technologies sequencing data

qc3C

Reference-free quality control for Hi-C DNA sequencing libraries

QoRTs

QoRTs is a fast, efficient, and portable toolkit designed to assist in the analysis, QC and data management of RNA-Seq datasets.

Qualimap

Qualimap is a platform-independent application to facilitate the quality control of alignment sequencing data and its derivatives like feature counts.

QUAST

A Quality Assessment Tool for Genome Assemblies by the Center for Algorithmic Biotechnology.

RNA-SeQC

Fast, efficient RNA-Seq metrics for quality control and process optimization

Rockhopper

Rockhopper is a comprehensive and user-friendly system for computational analysis of bacterial RNA-seq data.

RSEM

RSEM (RNA-Seq by Expectation-Maximization) is a software package for estimating gene and isoform expression levels from RNA-Seq data.

RSeQC

RSeQC is a package that provides a number of useful modules that can comprehensively evaluate high throughput RNA-seq data.

Salmon

Salmon is a tool for quantifying the expression of transcripts using RNA-seq data.

Sambamba

Sambamba is a suite of programs written in the D Language for users to process high-throughput sequencing data.

Samblaster

Samblaster is a tool to mark duplicates and extract discordant and split reads from sam files.

Samtools

Samtools is a suite of programs for interacting with high-throughput sequencing data.

Sargasso

Sargasso is a tool to separate mixed-species RNA-seq reads according to their species of origin.

Seqera Platform CLI

Reports statistics generated by the Seqera Platform CLI.

Sequali

Sequali is a sequencing data quality control tool suitable for both long-read and short-read data. It features adapter search, overrepresented sequence analysis and duplication analysis and supports FASTQ and uBAM inputs.

SeqWho

SeqWho is a reliable and extremely rapid program designed to determine a FASTQ(A) sequencing file identity, both source protocol and species of origin.

SeqyClean

SeqyClean is a comprehensive preprocessing software application for NGS reads.

Sex.DetErrMine

A python script to calculate the relative coverage of X and Y chromosomes, and their associated error bars, from the depth of coverage at specified SNPs.

Sickle

Windowed Adaptive Trimming for FastQ files using quality

Skewer

Skewer is an adapter trimming tool specially designed for processing next-generation sequencing (NGS) paired-end sequences.

Slamdunk

Slamdunk is a tool to analyze SLAM-Seq data.

Snippy

Rapid haploid variant calling and core genome alignment.

SnpEff

SnpEff is a genetic variant annotation and effect prediction toolbox. It annotates and predicts the effects of variants on genes (such as amino acid changes).

SNPsplit

SNPsplit is an allele-specific alignment sorter, which is designed to read in alignment files in SAM/BAM format and determine the allelic origin of reads that cover known SNP positions.

Somalier

Somalier does fast genotype :: pedigree correspondence checks from BAM/CRAM/VCF

SortMeRNA

SortMeRNA is a program tool for filtering, mapping and OTU-picking NGS reads in metatranscriptomic and metagenomic data.

sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.

Space Ranger

Summarise quality metrics from 10x Genomics Space Ranger count.

Stacks

Stacks is a software for analyzing restriction enzyme-based data (e.g. RAD-seq)

STAR

STAR is an ultrafast universal RNA-seq aligner.

Supernova

Supernova is a de novo genome assembler for 10X Genomics linked-reads.

THeTA2

THeTA2 estimates tumour purity and clonal / subclonal copy number.

TopHat

TopHat is a fast splice junction mapper for RNA-Seq reads. It aligns RNA-Seq reads to mammalian-sized genomes.

Trimmomatic

Trimmomatic is a flexible read trimming tool for Illumina NGS data

Truvari

Truvari is a toolkit for benchmarking, merging, and annotating structural variants

UMI-tools

UMI-tools contains tools for dealing with Unique Molecular Identifiers (UMIs) / Random Molecular Tags (RMTs) and single cell RNA-Seq cell barcodes.

VarScan2

Variant detection in massively parallel sequencing data

VCFTools

VCFTools is a program for working with and reporting on VCF files.

VEP

Ensembl VEP determines the effect of your variants on genes, transcripts and protein sequences, as well as regulatory regions.

VerifyBAMID

VerifyBamID checks whether reads match known genotypes or are contaminated as a mixture of two samples.

WhatsHap

WhatsHap is a software for phasing genomic variants using DNA sequencing reads, also called read-based phasing or haplotype assembly. It is especially suitable for long reads, but works also well with short reads.

xengsort

Fast xenograft read sorter based on space-efficient k-mer hashing

Xenome

Xenome is a tool for classifying reads from xenograft sources.