A utility for computing alignment of proteins to genomic nucleotide sequence based on a variation of the Needleman Wunsch global alignment algorithm and specifically accounts for introns and splice signals.
The Protein database is a collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB. Protein sequences are the fundamental determinants of biological structure and function.
A collection of Reference Sequence (RefSeq) proteins, from the complete genomes of prokaryotes, plasmids, and organelles, that have been grouped and annotated based on sequence similarity and protein function.
PubChem contains the chemical structures of small organic molecules and information on their biological activities. PubChem includes substance information, compound structures, and bioactivity data in three primary databases, PCSubstance, PCCompound, and PCBioAssay, respectively. PubChem is integrated with Entrez, NCBI's primary search engine, and also provides compound neighboring, sub/superstructure, similarity structure, bioactivity data, and other searching features. Technical documentation at http://pubchem.ncbi.nlm.nih.gov/help.html#faq
PUG provides access to PubChem services via a programmatic interface. Users may download data, initiate chemical structure searches, standardize chemical structures and interact with E-utilities. Access PUG with standard URLs or via SOAP. Technical documentation at http://pubchem.ncbi.nlm.nih.gov/pug/pughelp.html
PubMed comprises more than 26 million citations for biomedical literature from MEDLINE, life science journals, and online books. Citations may include links to full-text content from PubMed Central and publisher web sites.
Not all articles in PMC are available for text mining and other reuse, many have copyright protection, however articles in the PMC Open Access Subset are made available for download under a Creative Commons or similar license that generally allows more liberal redistribution and reuse than a traditional copyrighted work.
Yearly citation totals from each year of the MEDLINE/PubMed Baseline referencing citations back to year 1781. These totals may increase over time for a particular year as new citations are added. For example, 25 citations were listed for the year 1800 in the 2018 MEDLINE/PubMed Baseline, while the 2019 Baseline includes 387 citations for that year.