Proteomics Tools

From BioAssist
Revision as of 15:09, 28 May 2013 by Brian Risk (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

This is a list of Proteomics tools created by the NBIC BioAssist task force

Please feel free to add your tools and your experiences! You can make a new page about a tool if you want to describe it in some detail.


Category Package Description Performance/experience
Peptide identification (DB) Peppy Peppy performs all necessary tasks of six-frame proteogenomic searches of LC-MS/MS data on a genome. This includes genome segmentation, translation and FDR analysis -- all done automatically with no need for intervention. Six-frame searches allow the discovery of novel proteins or protein forms that are specific to disease. None
Infrastructure Data Analysis Framework DAF or Data Analysis Framework is a software package developed in the NBIC BioAssist Proteomics task force to enable transparent distributed computations on the grid or a cluster. This Analysis Framework was developed to fulfill the computational and data analysis requirements of life science data in general and proteomics in particular. None
Framework msCompare MS-compare is a framework that can run multiple tools from different sources in a multiplicative comparison so that you can find the combination that gives the best results for your data. The framework contains a system of file conversion programs. Written by Berend Hoekman (NBIC BioRange)
Conversion Expecter A graphical user interface for the conversion of MS/MS spectra and peptide identifications into PrideXML None
Identification Phenyx MS/MS protein identification software, available for Linux and Windows, Phenyx is scallable in terms of computing nodes and supports many input data formats. Search results can be exported from MS Excel to XML formats
Commercial
Framework TPP The Trans-Proteomic Pipeline (TPP) includes all of the steps of the ISB MS/MS analysis pipeline, after the database search. Most users will only need to download the TPP distrubtion. None
Framework CPAS Computational Proteomics Analysis System. CPAS is a suite of database and analysis tools, originally developed in the FHCRC CPL, that manages proteomics experimental workflows and integrates database search algorithms (X!Tandem, Mascot, and Sequest) and Institute for Systems Biology (ISB) data management tools. None
Framework Labkey Server Derived from CPAS. LabKey Server incorporates a robust data pipeline for importing and processing mass spectrometry data from raw and mzXML data files. The pipeline manages the chain of processing steps needed to infer protein identifications and expression levels from the output of a mass spec machine. The pipeline integrates with leading search engines including X!Tandem™, Mascot™, and Sequest™. The pipeline also integrates PeptideProphet™, ProteinProphet™ , and XPRESS™, all components of the Trans Proteomic Pipeline™ from the Institute for Systems Biology. None
Framework Prequips a Java-based, modular software platform for integration, visualization and analysis of complex proteomics mass spectrometry data sets. The software has a graphical user interface and provides access to data produced by data analysis pipelines such as the Trans-Proteomic Pipeline. It bridges the gap between data processing pipelines and high-level analysis tools used in Systems Biology. None
Framework OpenMS/TOPP OpenMS is an open-source software C++ library for LC/MS data management and analyses. It offers an infrastructure for the development of mass spectrometry related software. OpenMS is free software available under the LGPL. OpenMS will be compatible with the upcoming Proteomics Standard Initiative (PSI) formats for MS data. We also try hard to keep up to date with code documentation and making it easy to use OpenMS. None
Framework ProteoWizard ProteoWizard is a modular and extensible set of open-source, cross-platform tools and software libraries for proteomics data analysis. The libraries enable rapid tool creation by providing a robust, pluggable development framework that simplifies and unifies data file access, and performs standard chemistry and LCMS dataset computations. None
Framework XCMS XCMS is an LC/MS-based data analysis approach which incorporates novel nonlinear retention time alignment, feature detection, and feature matching. Without using internal standards, the method dynamically identifies hundreds of endogenous metabolites for use as standards, calculating a nonlinear retention time correction profile for each sample. Following retention time correction, the relative metabolite ion intensities are directly compared to identify changes in specific endogenous metabolites, such as potential biomarkers. Written in R. None
Framework Mass++ Mass++ is software for mass spectrometer. The software can read and process various MS data format. Mass++ is a plug-in style software. So users can customize Mass++ depending on their purposes such as addition of new functions and deleting unnecessary functions to increase pefomance without editing Mass++ source code. mail list None
Framework mspire mspire - 'Mass Spectrometry Proteomics in Ruby' is a collection of tools for working with MS proteomics data in ruby. It seeks to provide support for open standards (e.g., parsers for mzData, mzXML, Peptide/Protein Prophet and the TPP) and contribute other useful functionality for working with mass spectrometry data in ruby. None
Framework proteios A multi-user platform for analysis and management of proteomics data. Licensed under GPLv2 None
Framework msInspect sInspect allows you to process and view mass spectrometry (MS1) data in an mzXML file. You can use msInspect's built-in tools to inspect data, identify peptide features, generate peptide arrays using data from multiple runs, and export data to external applications for further analysis and collaboration. msInspect is an open source application written in Java. None
Framework Insilicos Proteomics Pipeline The Insilicos platform is a flexible, high-throughput system for analysis of biological samples and identification of biomarkers. Uniquely, the Insilicos platform can achieve nearly complete coverage of biomarker proteins in complex samples. None
Framework PatternLab a “simple to use” yet efficient and panoptic software to addresses shotgun proteomic data analysis that was initialy published in BMC Bioinformatics in 2008. None
Framework Corra s a single, user-friendly, informatic framework, that is simple to use and fully customizable, for the enabling of LC-MS-based quantitative proteomic workflows of any size, able to guide the user seamlessly from MS data generation, through data processing, visualization, and statistical analysis steps, to the identification of differentially abundant or expressed candidate features for prioritized targeted identification by subsequent MS/MS (also in SourceForge) Conclusion
Framework TIQAM TIQAM (Targeted Identification for Quantitative Analysis by MRM) is developed as a suite of software tools to support targeted indentification and quantification using multiple reaction monitoring (MRM) mass spectum technology. More specifically, TIQAM software provided user friendly interfaces to assist process of peptide selection, transition generation and validation. (See also poster) None
Framework Scaffold Visualize and Validate Complex MS/MS Proteomics Experiments
Commercial
Framework PEAKS PEAKS is a suite of tools, each specifically designed to solve a unique problem. One tool, or a selection of tools from the PEAKS family will suit your environment.
Commercial
Framework Rosetta Biosoftware The Rosetta Elucidator® system is a data management and analysis solution for mass spectrometry data, and supports multiple workflows (label-free, SILAC, and shotgun proteomics and metabolomics), data from different instruments, and integration with different search engines for accurate peptide and protein identifications.
Commercial
Framework ProteinPilot For ABI
Commercial
Framework ProteinCenter From Proxeon (formerly Protana). ProteinCenter™ is a web-based data interpretation tool that enables scientists to compare and interpret data sets in minutes instead of months.
Commercial
Framework ViPDAC Virtual Proteomics Data Analysis Cluster. It runs on Amazon Elastic Cloud, have to run an instance in Amazon but the source code is available
Commercial
Mass Accuracy MaxQuant MaxQuant is a quantitative proteomics software package designed for analyzing large mass spectrometric data sets. It is specifically aimed at high resolution MS data. None
Mass Accuracy VIPER VIPER (Visual Inspection of Peak/Elution Relationships) can be used to visualize and characterize the features detected during LC-MS analyses. It is primarily intended for processing deisotoped data from high mass measurement accuracy instruments (e.g. FT, TOF, or Orbitrap) and comparing the features found to a list of expected peptides (aka mass and time (MT) tags), as practiced by the AMT tag approach. None
Mass Accuracy DTASuperCharge A component of MSQuant. It creates MGF peak lists from raw files, retaining the retention time and scan number information required by MSQuant. It requires Xcalibur (including the XDK) and Microsoft .NET 2.0 or later to be installed. It is not clear whether it can be made to run stand-alone, on a system without a full installation of Xcalibur. None
Peptide identification (DB) Mascot Mascot is a mass spectral search algorithm that uses mass spectrometry data to identify proteins from primary sequence databases.
Commercial
The most popular search engine.
Peptide identification (DB) Sequest SEQUEST 3G is the latest, next-generation SEQUEST standard specifically developed and optimized for translational proteomic applications involving phosphorylation and other post-translational modifications (PTMs). SEQUEST 3G defines a single common standard for similarity scores, search parameters and statistics, and input/output file formats and is ideal for noisy and poor quality spectra.
Commercial
Peptide identification (DB) Phenyx henyx enables proteomics scientists to 1) submit MS/MS data and identify peptides and proteins 2) visualize and evaluate results using various dynamic views, 3) manually validate results and compare Phenyx runs and those of other database search engines (like Mascot, SEQUEST and X!Tandem), 4) perform quantitative analysis with the Phenyx quantitation module, and 5) export results and generate reports into various formats and integrate Phenyx into your laboratory workflow.
Commercial
Peptide identification (DB) Paragon
Commercial
Peptide identification (DB) Byonics combines traditional MS/MS matching with de novo sequencing.
Commercial
Peptide identification (DB) X!Tandem This software has a very simple, sophisticated application programming interface (API): it simply takes an XML file of instructions on its command line, and outputs the results into an XML file, which has been specified in the input XML file. Unlike some earlier generation search engines, all of the X! Series search engines calculate statistical confidence (expectation values) for all of the individual spectrum-to-sequence assignments. None
Peptide identification (DB) OMSSA The Open Mass Spectrometry Search Algorithm [OMSSA] is an efficient search engine for identifying MS/MS peptide spectra by searching libraries of known protein sequences. OMSSA scores significant hits with a probability score developed using classical hypothesis testing, the same statistical method used in BLAST. OMSSA is free and in the public domain. None
Peptide identification (DB) InsPecT None
Peptide identification (DB) SpectrumMill None
Peptide identification (DB) Crux None
Peptide identification (DB) MyriMatch None
Peptide identification (DB) PepSlice None
Peptide identification (De novo) PEAKs
Commercial
Peptide identification (De novo) Lutefisk Lutefisk is software for the de novo interpretation of peptide CID spectra None
Peptide identification (De novo) MSNovo None
Peptide identification (De novo) EigenMS None
Peptide identification (De novo) NovoHMM None
Peptide identification (De novo) Audens None
Peptide identification (De novo) RAId None
Peptide identification (lib match) SpectraST (part of TPP) SpectraST (short for "Spectra Search Tool" and rhymes with "contrast") is a spectral library building and searching tool designed primarily for shotgun proteomics applications. None
Peptide identification (lib match) BiblioSpec BiblioSpec is a suite of software tools for creating and searching MS/MS peptide spectrum libraries. BiblioSpec is available free of charge for noncommercial use through an interactive web-site None
Raw Data Xcalibur Thermo. Windows application to control the instrument and basic data processing. Plays well with Sequest and TPP None
Raw Data Bioworks Thermo. Based on Xcalibur. More advanced data processing. New version is called ProteomeDiscoverer. None
Raw Data Analyst. ABI None
Raw Data MassLynx. Waters None
Raw Data MassHunter. Agilent None
Raw Data DataAnalysis. Bruker None
Raw Data ProteomeDiscoverer Thermo. New version of Bioworks None
Raw Data / Conversion X2XML converts different RAW files to mzXML None
Results processing Pride Converter Converts different peak list files and results different search engine results files to PRIDE XML. None
Results processing Mascot parser From matrix science None
Results processing OMSSA parser MSSA Parser is a Java based parser for OMSSA (Open Mass Spectrometry Search Algorithm) omx files. Initially created by Steffen Huber for Prof. Dr. Albert Sickmann under the guidance of Dr. Lennart Martens, and further developed by Harald Barsnes, including the addition of a simple, lightweight and platform independent OMSSA Viewer. None
Results processing Mascot datfile Parses Mascot output None
Results processing MassSieve MassSieve is a free tool for parsing MS/MS search engine results. It allows you to combine the results of several searches into a single experiment and parsimoniously separate the results. Many different experiments can be loaded and the results can be compared and contrasted. None
Quantitation MsQuant MSQuant is a tool for quantitative proteomics/mass spectrometry and processes spectra and LC runs to find quantitative information about proteins and peptides. Though automated it also allows manual inspection and change. MSQuant’s main purpose is to make quantitation of proteins and peptides possible in the area of mass spectrometry/proteomics (science/analytical chemistry/molecular biology). E.g. an experiment where one heavy isotope-labelled sample from treated cells is compared to unlabelled, untreated cells to give quantitative answers about cellular processees. None
Quantitation MaxQuant MaxQuant is a quantitative proteomics software package designed for analyzing large mass spectrometric data sets. It is specifically aimed at high resolution MS data None
Quantitation Census A quantitative analysis software tool for mass spectrometry-based proteomics. Sung Kyu Park, John D Venable, Tao Xu, John R Yates III, Nature Methods, 2008, 5, 319-322. None
Protein Inference ProteinProphet roteinProphet(TM) automatically validates protein identifications made on the basis of peptides assigned to MS/MS spectra by database search programs such as SEQUEST. Importantly, it addresses two critical issues: (1) Peptides corresponding to 'single-hit' proteins (those to which no other peptides correspond) are less likely to be correct that those corresponding to 'multi-hit' proteins. (2) Many peptides are present in more than a single database protein entry. None
Protein Inference IDPicker Accurate, transparent, and parsimonious protein assembly for MS/MS search results None
Protein Inference Combyne Byne integrates ByOnic's peptide identifications into a list of proteins, ranked by confidence (p-value). The combination of ByOnic and ComByne greatly outperforms current standards, such as Mascot and ProteinProphet. ByOnic typically identifies 50 to 100 percent more spectra. On complex samples with many low-abundance proteins, ComByne then identifies 30 to 50 percent more proteins. None
Protein Inference EBP Empirical Bayes Protein identification algorithm. EBP is the first generally applicable protein identification method that can combine the results of multiple search algorithms and replicated LC MS/MS experiments. EBP is compatible with the Trans-Proteomic Pipeline (TPP) from http://systemsbiology.org and can be used as an alternative to the ProteinProphet algorithm. None

Other lists on the net