Streamlined computational pipeline for genetic background characterization of genetically engineered mice based on next generation sequencing data

Date

2019

Type:

Article

item.page.extent

20 p.

item.page.accessRights

item.contributor.advisor

ORCID:

Journal Title

Journal ISSN

Volume Title

Publisher

item.page.isbn

item.page.issn

item.page.issne

item.page.doiurl

item.page.other

item.page.references

Abstract

Background: Genetically engineered mice (GEM) are essential tools for understanding gene function and disease modeling. Historically, gene targeting was first done in embryonic stem cells (ESCs) derived from the 129 family of inbred strains, leading to a mixed background or congenic mice when crossed with C57BL/6 mice. Depending on the number of backcrosses and breeding strategies, genomic segments from 129-derived ESCs can be introgressed into the C57BL/6 genome, establishing a unique genetic makeup that needs characterization in order to obtain valid conclusions from experiments using GEM lines. Currently, SNP genotyping is used to detect the extent of 129- derived ESC genome introgression into C57BL/6 recipients; however, it fails to detect novel/rare variants. Results: Here, we present a computational pipeline implemented in the Galaxy platform and in BASH/R script to determine genetic introgression of GEM using next generation sequencing data (NGS), such as whole genome sequencing (WGS), whole exome sequencing (WES) and RNA-Seq. The pipeline includes strategies to uncover variants linked to a targeted locus, genome-wide variant visualization, and the identification of potential modifier genes. Although these methods apply to congenic mice, they can also be used to describe variants fixed by genetic drift. As a proof of principle, we analyzed publicly available RNA-Seq data from five congenic knockout (KO) lines and our own RNA-Seq data from the Sall2 KO line. Additionally, we performed target validation using several genetics approaches. Conclusions: We revealed the impact of the 129-derived ESC genome introgression on gene expression, predicted potential modifier genes, and identified potential phenotypic interference in KO lines. Our results demonstrate that our new approach is an effective method to determine genetic introgression of GEM.

Description

item.page.coverage.spatial

item.page.sponsorship

Citation

BMC Genomics, 2019, 20:131

Keywords

Sequencing, Congenic mouse, Knockout mouse, Genomic variation, Genetic interactions, Modifier genes, Genetic background, RNA-Seq variant calling, qPCR validation, Ang, Cdkn1a, Sall2

item.page.dc.rights

item.page.dc.rights.url