基因/蛋白质命名和综述:
HGNC: HUGO Gene Nomenclature Committee
Gene family
GENECARD: Human Gene Database
GIFtS: GeneCards inferred functionality score
Alias, Genomics, Proteins, Variants, Transcripts,
Orthologs, Paralogs, Domains & Families, Function,
Localization, Pathways & Interactions,
Drugs & Compounds, Disorders
基因组注释和基因:
RefSeq/GENCODE: Annotation database
NCBI(refGene)/UCSC(knownGene)/Ensembl(ensGene): Ensembl已融入GENCODE; UCSC也有意反映GENCODE;所以最终还是RefSeq和GENCODE两大派。
CCDS: Consensus Coding Sequence database
DNA变异:
HGVS: Human Genome Variant Society, 仅是突变命名数据库
dbSNP:
HapMap:
1KGP: 1000 Genome Project
ExAC: Exon Aggregation Consortium
gnomAD: Genome Aggregation Database
HGMD: Human Gene Mutation Database
ClinVar:
OMIM: Online Mendelian Inheritance in Man
Regulation element:
ENCODE:
Roadmap:
MethDB:
miRbase:
lncRNAdb:
蛋白质序列和结构
UniProtKB: (P12345) Universal Protein Knowledgebase, 蛋白质的综合信息
RCSB PDB: (XXXX) Protein Data Bank
蛋白质家族和结构域:
Pfam: family,domain
Prosite: family, domain, functional site
SMART: include pfam domain, transmembrane segments, coiled coid regions, signal peptide
CDD(Conserved Domain Database, id: cd1234567),以及CD search工具(基于PSI-BLAST)
(另外有两个结构数据库,CSD和NDB,分别是蛋白质和核酸)
功能注释:
GO: Gene Oncology
KEGG/Rectome: pathway database
STRING: Protein-protein interaction
肿瘤:
TCGA: The Cancer Genome Atlas
Visit TCGA data:
GDC Data Portal
TCGAbiolink/gdc-client
UCSC XenaBrowser
Broad Firebrowser
Tumorportal
cBioportal
COSMIC: