GenBank
GenBankĀ® is a comprehensive database that contains publicly available nucleotide sequences for almost 260 000 formally described species. These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole-genome shotgun (WGS) and environmental sampling projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and GenBank staff assigns accession numbers upon data receipt. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the NCBI Entrez retrieval system, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP.
Repository Scope
Research Areas
Life Sciences; Basic Biological and Medical Research; General Genetics; Bioinformatics and Theoretical Biology; Microbiology;
Data Types Encouraged/Permitted
The following data types can be deposited: mRNA or genomic sequence; Complete Microbial Genomes; Whole Genome Shotgun (WGS) Sequences; Transcriptome Shotgun Assembly (TSA) Sequences; High-Throughput Genomic (HTGs) Sequences; Third Party Annotation (TPA); and Targeted Locus Study (TLS)
Data Types Explicitly Prohibited
Noncontiguous sequences; Primer sequences; Protein sequences with no underlying nucleotide submission; Sequence containing a mix of genomic and mRNA sequence; Sequences without a physical counterpart (consensus sequences); Sequences with length less than 200 nucleotides. Raw sequence reads from next generation sequencing platforms should be submitted to the Sequence Read Archive (SRA).
Fee for JHU Researchers to Deposit No
Data Limit
None listed
Data Access
Option for Data Access
Open Access;
Details on Data Access
Anyone can download files in GenBank. No registration required.
Sensitive Data
Human Data Accepted
Yes
Level of Deidentification Required
Do not include any data that could reveal the personal identity of the source (https://www.ncbi.nlm.nih.gov/genbank/)
Human Participant Data Sharing Policy
Administration
Required Funder
None listed
Persistent Identifier
Accession number
Data Retention Period
No stated retention period
AI LLM Policy
None listed
re3data
re3data Keywords:
COVID-19; DNA; EST; GSS; STS; bioSample; clone; epigenomics; genomes; metagenomes; nucleotide; sequence data; transcriptome
re3data Repository Contact
info@ncbi.nlm.nih.gov