The consensus human HCD libraries were created using more than 10,000 raw files
The libraries in the msp and NIST binary formats were updated on May 19, 2020. The new libraries of consensus spectra contain 86% more peptides than the previous version with 4-15% increase in the number of positive IDs returned for typical samples at FDR=0.01. Previous versions are available on request.
Methods used for building the libraries are presented at the ASMS 2020 conference:
Filtering and optimization of peptide tandem mass spectral libraries.
Authors: Sergey L. Sheetlin, Guanghui Wang, Dmitrii V. Tchekhovskoi , Zheng Zhang, Stephen E. Stein
The poster is available for download .
The meaning of the fields present in the comments of the library spectra can be found here .
File - Description | Size | Link |
---|---|---|
Libraries of selected spectra in plain ASCII text (msp format). This format can be used as an exchange format. It is also directly readable by some software applications, like Skyline, after unzipping: | ||
Library 1 (best): human_hcd_tryp_best.msp.tar.gz (high-quality spectra, mostly tryptic peptides without missed cleavages) Library 2 (good): human_hcd_tryp_good.msp.tar.gz (medium-quality spectra, mostly tryptic peptides with missed cleavages) Library 3 (semi-tryptic): human_hcd_semitryp.msp.tar.gz (high- and medium-quality spectra, mostly semi-tryptic peptides) | 646MB 396MB 599MB | Download 1 Download 2 Download 3 |
Libraries of selected spectra in NIST binary format. The folders generated by extracting these packages can be used directly by MS Search or MSPepSearch and by the MSPepSearch node within Proteome Discoverer: | ||
Library 1 (best): human_hcd_tryp_best.tar.gz Library 2 (good): human_hcd_tryp_good.tar.gz Library 3 (semi-tryptic): human_hcd_semitryp.tar.gz | 1.5GB 937MB 1.4GB | Download 1 Download 2 Download 3 |
uniprot-all.fasta.tar.gz - FASTA file used to create the library. | 7.0MB | Download |
md5_checklist.chk - MD5 hash file. | 1KB | Download |
Library 1 | Libraries 1,2 | Libraries 1,2,3 | |
---|---|---|---|
Total number of Spectra | 398,373 | 598,850 | 911,783 |
Unique Peptide Sequences | 257,122 | 365,188 | 605,677 |
Proteome Coverage | 27.9% | 32.2% | 35.3% |
The decoy library (sequence reversed) were created based on the methods described in the paper:
Zhang, Zheng, et al. “Reverse and random decoy methods for false discovery rate estimation in high mass accuracy peptide spectral library searches.” Journal of Proteome Research 17.2 (2018): 846-857.
Decoy Libraries - Description | Size | Link |
---|---|---|
Libraries of decoy spectra in plain ASCII text (msp format). This format can be used as an exchange format. It is also directly readable by some software applications, like Skyline, after unzipping: | ||
Decoy for Library 1: human_hcd_tryp_best_decoy.msp.tar.gz Decoy for Library 2: human_hcd_tryp_good_decoy.msp.tar.gz Decoy for Library 3: human_hcd_semitryp_decoy.msp.tar.gz | 556MB 352MB 529MB | Download 1 Download 2 Download 3 |
Library of decoy spectra in NIST binary format (two parts). The folders generated by extracting these packages can be used directly by MS Search or MSPepSearch and by the MSPepSearch node within Proteome Discoverer: | ||
Decoy for Library 1: human_hcd_tryp_best_decoy.tar.gz Decoy for Library 2: human_hcd_tryp_good_decoy.tar.gz Decoy for Library 3: human_hcd_semitryp_decoy.tar.gz | 1.5GB 904MB 1.4GB | Download 1 Download 2 Download 3 |
md5_checklist.chk - MD5 hash file. | 1KB | Download |
© Copyright information
These mass spectral libraries are protected by copyright law and may not be re-distributed without a valid Distribution Agreement. To receive such an agreement, contact the Standard Reference Data Program at the National Institute of Standards and Technology by emailing data@nist.gov or calling 301-975-2008.