GenBank

Repository Name: GenBank
Repository Homepage URL: https://www.ncbi.nlm.nih.gov/genbank/
Repository Description:

Source: https://www.re3data.org/

GenBankĀ® is a comprehensive database that contains publicly available nucleotide sequences for almost 260 000 formally described species. These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole-genome shotgun (WGS) and environmental sampling projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and GenBank staff assigns accession numbers upon data receipt. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the NCBI Entrez retrieval system, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP.


Data Collection Policy URL: https://www.ncbi.nlm.nih.gov/genbank/submit_types/
Research Areas: Life Sciences; Basic Biological and Medical Research; General Genetics; Bioinformatics and Theoretical Biology; Microbiology;
Data Types Explicitly Prohibited: The following data is not accepted by GenBank: Noncontiguous sequences; Primer sequences; Protein sequences with no underlying nucleotide submission; Sequence containing a mix of genomic and mRNA sequence; Sequences without a physical counterpart (consensus sequences); Sequences with length less than 200 nucleotides. Raw sequence reads from next generation sequencing platforms should be submitted to the Sequence Read Archive (SRA).
Fee for JHU Researchers to Deposit: Yes
Data Limit: None listed
Option for Data Access: Open Access;
Details on Data Access:

Anyone can download files


Human Data Accepted: Yes
Level of Deidentification Required: "do not include any data that could reveal the personal identity of the source" ttps://www.ncbi.nlm.nih.gov/genbank/
Human Participant Data Sharing Policy URL: https://www.ncbi.nlm.nih.gov/genbank/
Submission Policy URL: https://www.ncbi.nlm.nih.gov/genbank/submit/
Required Funder: None listed
Persistent Identifier: Accession number
Data Retention Period: No stated retention period
AI LLM Policy: None listed
re3data Keywords: COVID-19; DNA; EST; GSS; STS; bioSample; clone; epigenomics; genomes; metagenomes; nucleotide; sequence data; transcriptome
re3data Repository Contact: info@ncbi.nlm.nih.gov
re3data Record URL: https://www.re3data.org/repository/r3d100010528