Sources of data

Note

This document is a work in progress.

These methods, databases, and software provide RNA structure data that can be used with RNAvigate.

To add a database, method, or software package for obtaining RNA structure data, submit an issue on [Github issues](https://github.com/Weeks-UNC/RNAvigate/issues). Include the name of the database, method or software, a citation, and a brief explanation.

Categories of RNA structure data:

  • post-transcriptional modifications

  • base-pairing

  • through-space interactions

  • solvent accessibility

  • ligand binding

  • ribosome binding

  • protein binding

  • 3D structure

  • conservation and covariation

  • folding landscapes

  • multi-state ensembles

small molecule binding

chem-CLIP-Map-Seq

Frag-MaP

ribosomal profiling

Ribosome profiling https://cshperspectives.cshlp.org/content/early/2018/07/23/cshperspect.a032698

Intra-RNA and RNA-RNA interactions

Note: RNAvigate is currently not well-suited for display or analysis of inter-molecular data

Proximity detection through chimeric reads

transcriptome-wide RNA-RNA inter- and intra-molecular base pairing

RPL, PARIS, SPLASH, LIGR-seq, MARIO, COMRADES (uses psoralen) SHARC, SHAPE-JuMP (uses bifunctional acylation reagents)

single-molecule correlated chemical probing for intra-molecular structural communication

RING-MaP, PAIR-MaP (correlated adduct-formation)

secondary structure

chemical probing of secondary structure, widely used to infer secondary structure

DMS, TMO, glyoxal, EDC (base-pairing status) SHAPE reagents (backbone flexibility) hydroxyl radicals, LASER (NAz) (solvent accessibility)

computational methods for secondary structure determination

RNAStructure, RNAfold (thermodynamics based) Dynalign II, R-scape (covariation based) RNAalifold, TurboFold II (thermodynamics and covariation based) Pfold, ContraFold, ContextFold (machine learning based) SPOT-RNA, MXfold2, CDPfold, DMfold, E2Efold, Ufold (deep learning based)

Protein binding

chemical probing for protein binding

footprinting SHAPE (fSHAPE), LASER/SHAPE (differential probing signal from protein binding)

RNP-MaP (per-nucleotide protein proximity)

enrichment quantification for protein binding

https://www.sciencedirect.com/science/article/pii/S0021925822003647

CLIP-seq/HITS-CLIP, PAR-CLIP, iCLIP, eCLIP, irCLIP, CLASH, vPAR-CL (CLIP-derivative methods) RIP-seq (IP without crosslinking) APEX-seq/Proximity-CLIP (proximity biotinylation) PIP-seq (Formaldehyde crosslink and RNase footprinting)

structure ensembles

experimental ensemble deconvolution

SHAPE-seq, SPET-seq (co-transcriptional folding) Mutate and map (M^2)

thermodynamic ensemble deconvolution

RSample, SLEQ, R2D2, REEFFIT, IRIS (based on fitting thermodynamic models to probing data)

smCCP-based ensemble deconvolution

DANCE-MaP, DRACO, DREEM (based on clustering of DMS-MaP sequencing reads)

Tertiary structure

https://www.nature.com/articles/s41592-022-01623-y

in-vitro experimental methods for determining 3D structure

X-ray crystalography, NMR, cryo-EM, SAXS

computational methods for tertiary structure determination

iFold, SimRNA (ab initio folding methods) FARNA, MC-Sym, FARFAR2 (fragment assembly methods) ARES (deep-learning method)

RNA post-transcriptional modifications

Ψ: Ψ-seq, Pseudo-seq, PSI-seq, CeU-seq, RBS-seq

m5C: Bisulfite sequencing, m5C-RIP, AZA-IP, m5C-miCLIP

m1A: m1A-meRIP, m1A-ID-seq, m1A-seq, m1A-MAP, m1a-Quant-seq

ac4C: acRIP, ac4C-seq

Nm: Nm-seq, RibOxi-seq, RiboMeth-seq, MeTH-seq

m7G: m7G-MAP-seq, m7G-(me)RIP seq, TRAC-seq, AlkAniline-seq, BoRed-seq, m7G-seq

Online databases

codon usage database https://www.kazusa.or.jp/codon/

The RNA families database https://rfam.org/

RNAcentral https://rnacentral.org/ (comprehensive database of non-coding RNA)

MODOMICS (RNA modifications, including their known locations in RNA sequences)

GENCODE high quality reference gene annotation and validation for human and mouse genomes

Ensembl is a genome browser for vertebrate genomes and model organisms

Online computational Tools

RBPmap http://rbpmap.technion.ac.il/ find RNA binding protein sequence motif matches in an RNA of interest ViennaRNA Web Services https://rna.tbi.univie.ac.at/ (thermodynamic folding tools)

FORNA http://rna.tbi.univie.ac.at/forna/ (quickly produce an RNA structure diagram)

R2DT https://rnacentral.org/r2dt (create a secondary structure diagram in standardized orientations)

Clustal Omega http://www.ebi.ac.uk/Tools/msa/clustalo/ (multiple sequence alignments)