An official website of the United States Government
Official websites use .gov
Federal government websites often end in .gov or .mil. Before sharing sensitive information,
make sure you're on a federal government site.
Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov
website. Share sensitive information only on official, secure websites.
BLAST finds regions of similarity between biological sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance. Technical documentation at http://blast.ncbi.nlm.nih.gov/Blast.cgi?CMD=Web&PAGE_TYPE=BlastDocs
BLAST includes several specialized search interfaces: SmartBLAST, Primer-BLAST, Global Align, CD-Search, IgBLAST, VecScreen, CDART, Multiple Alignment, MOLE-BLAST, Searches at a Cloud Provider, BLAST+ Docker Image
The BioProject database provides an organizational framework to access information about research projects with links to data that have been or will be deposited into archival databases maintained at members of the International Nucleotide Sequence Database Consortium (INSDC, which comprises the DNA DataBank of Japan (DDBJ), the European Nucleotide Archive at European Molecular Biology Laboratory (ENA), and GenBank at the National Center for Biotechnology Information (NCBI)).
The NCBI BioSystems Database provides integrated access to biological systems and their component genes, proteins, and small molecules, as well as literature describing those biosystems and other related data throughout Entrez.
CDD is a protein annotation resource that consists of a collection of well-annotated multiple sequence alignment models for ancient domains and full-length proteins.
Identifies the conserved domains present in a protein sequence. CD-Search uses RPS-BLAST (Reverse Position-Specific BLAST) to compare a query sequence against position-specific score matrices that have been prepared from conserved domain alignments present in the Conserved Domain Database (CDD).
The Entrez Programming Utilities (E-utilities) are a set of eight server-side programs that provide a stable interface into the Entrez query and database system at the National Center for Biotechnology Information (NCBI). The E-utilities use a fixed URL syntax that translates a standard set of input parameters into the values necessary for various NCBI software components to search for and retrieve the requested data. The E-utilities are therefore the structured interface to the Entrez system, which currently includes 38 databases covering a variety of biomedical data, including nucleotide and protein sequences, gene records, three-dimensional molecular structures, and the biomedical literature. Technical Documentation at http://www.ncbi.nlm.nih.gov/books/NBK25501/
Sequence databases in FASTA format for use with the stand-alone BLAST programs. These databases must be formatted using formatdb before they can be used with BLAST.
Genetic Relationship and Fingerprinting (GRAF) is a rapid statistical method to detect duplicates and closely related samples in large genomic datasets to be used as a quality assurance tool in dbGaP data processing. For more information, see this abstract in PubMed.
https://www.ncbi.nlm.nih.gov/pubmed/28609482