Transcription factors (TFs) are key regulators of intrinsic cellular processes, such as differentiation and development, and of the cellular response to external perturbation through signaling pathways. Most TFs recognize and bind to specific DNA sequences named as transcription factor biding sites, leading to specific spatiotemporal expression patterns of target genes.
Breaking: Five TF target specific resources are collected.
Introduction: JASPAR is a regularly maintained open-access database storing manually curated transcription factors (TF) binding profiles as position frequency matrices (PFMs). PFMs summarize occurrences of each nucleotide at each position in a set of observed TF-DNA interactions. PFMs can be transformed to probabilistic or energistic models to construct position weight matrices (PWMs) or position-specific scoring matrices (PSSMs), which can be used to scan any DNA sequence to predict TF binding sites (TFBSs). The JASPAR database provides TFBSs predicted using the profiles in the CORE collection.
Installation: Different releases of JASPAR can also be accessed through Bioconductor data packages. Currently four JASPAR releases are available, and to install those packages, start R (version “4.2”) and enter: + JASPAR2020:
if (!require("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install("JASPAR2020")
if (!require("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install("JASPAR2018")
if (!require("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install("JASPAR2016")
if (!require("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install("JASPAR2014")
Application: The JASPAR tutorial can be found athttps://jaspar.genereg.net/docs/. The JASPAR2020 vignette can be found at http://bioconductor.org/packages/release/data/annotation/vignettes/JASPAR2020/inst/doc/JASPAR2020.html. The JASPAR2018 vignette can be found at http://bioconductor.org/packages/release/data/annotation/manuals/JASPAR2018/man/JASPAR2018.pdf. The JASPAR2016 vignette can be found at http://bioconductor.org/packages/release/data/experiment/manuals/JASPAR2016/man/JASPAR2016.pdf. The JASPAR2016 vignette can be found at <http://bioconductor.org/packages/release/data/experiment/manuals/JASPAR2014/man/JASPAR2014.pdf.
Web server: JASPAR.
Introduction: AnimalTFDB is a comprehensive database including classification and annotation of genome-wide transcription factors (TFs) and transcription cofactors in 183 animal genomes. In AnimalTFDB v4.0, there are total 270k TFs which are further classified into 73 families and six categories based on their DNA-binding domain (DBD). Meanwhile, there are also 150k cofactors classified into 82 families and six categories.
Type | Number |
---|---|
Species | 183 |
TFs | 2700000 |
TF Cofactors | 1500000 |
TF Families | 73 |
TF Cofactor Families | 83 |
Application: The AnimalTFDB tutorial can be found athttp://bioinfo.life.hust.edu.cn/AnimalTFDB4/static/site/.
Web server: AnimalTFDB.
Introduction: ChIP-X Enrichment Analysis 3 (ChEA3) is a transcription factor enrichment analysis tool that ranks TFs associated with user-submitted gene sets. The ChEA3 background database contains a collection of gene set libraries generated from multiple sources including TF–gene co-expression from RNAseq studies, TF–target associations from ChIP-seq experiments, and TF–gene co-occurrence computed from crowd-submitted gene lists. Enrichment results from these distinct sources are integrated to generate a composite rank that improves the prediction of the correct upstream TF compared to ranks produced by individual libraries.
Web server: ChEA3.
Introduction: RegNetwork is a database of transcriptional and posttranscriptional regulatory networks in human and mouse. TF and miRNA are two major regulators controlling gene expression. RegNetwork collects the knowledge-based regulatory relationships, as well as some potentially regulatory relationships between the two regulators and targets. It provides a platform of depositing the known and predicted gene regulations in the transcriptional and posttranscriptional levels simultaneously. The knowledge-derived regulatory networks is expected to be greatly beneficial for identifying critical regulatory programs in various context-specific conditions.
Application: The AnimalTFDB tutorial can be found at https://regnetworkweb.org/about.jsp.
Web server: RegNetwork.
Introduction: PlantCARE (Plant Cis-Acting Regulatory Elements Database) is a specialized resource that catalogs cis-regulatory elements (CAREs) in various plant species, providing an intuitive web interface for users to search and navigate its comprehensive dataset. It features detailed information on cis-elements, associated genes, and their functional annotations, supporting gene regulatory network studies and experimental validation efforts. The database allows for comparative analyses across different plant species and includes downloadable datasets for further research. Regular updates and links to relevant scientific literature enhance its utility, making PlantCARE a vital tool for researchers in plant molecular biology and agricultural biotechnology, facilitating insights into gene regulation mechanisms essential for improving crop traits. Especially, it adopts a quality-based clustering method and a motif search algorithm called Motif Sampler, and a probabilistic approach based on Gibbs Sampling, which looks for over-represented motifs in upstream regions and is beneficial for predicting binding motifs of TFs in the gene promoters of plants.
Web server: PlantCARE.