GenBank

GenBankĀ® is a comprehensive database that contains publicly available nucleotide sequences for almost 260 000 formally described species. These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole-genome shotgun (WGS) and environmental sampling projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and GenBank staff assigns accession numbers upon data receipt. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the NCBI Entrez retrieval system, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP.

Repository Website


Repository Scope

Data Collection Policy

Research Areas

Life Sciences; Basic Biological and Medical Research; General Genetics; Bioinformatics and Theoretical Biology; Microbiology;

Data Types Encouraged/Permitted

The following data types can be deposited: mRNA or genomic sequence; Complete Microbial Genomes; Whole Genome Shotgun (WGS) Sequences; Transcriptome Shotgun Assembly (TSA) Sequences; High-Throughput Genomic (HTGs) Sequences; Third Party Annotation (TPA); and Targeted Locus Study (TLS)

Data Types Explicitly Prohibited

Noncontiguous sequences; Primer sequences; Protein sequences with no underlying nucleotide submission; Sequence containing a mix of genomic and mRNA sequence; Sequences without a physical counterpart (consensus sequences); Sequences with length less than 200 nucleotides. Raw sequence reads from next generation sequencing platforms should be submitted to the Sequence Read Archive (SRA).

Fee for JHU Researchers to Deposit

No

Data Limit

None listed


Data Access

Option for Data Access

Open Access;

Details on Data Access

Anyone can download files in GenBank. No registration required.


Sensitive Data

Human Data Accepted

Yes

Level of Deidentification Required

Do not include any data that could reveal the personal identity of the source (https://www.ncbi.nlm.nih.gov/genbank/)

Human Participant Data Sharing Policy


Administration

Submission Policy

Required Funder

None listed

Persistent Identifier

Accession number

Data Retention Period

No stated retention period

AI LLM Policy

None listed


re3data

re3data Keywords:

COVID-19; DNA; EST; GSS; STS; bioSample; clone; epigenomics; genomes; metagenomes; nucleotide; sequence data; transcriptome

re3data Repository Contact

info@ncbi.nlm.nih.gov

re3data Record