Package: net4pg 0.1.0
Laura Fancello
net4pg: Handle Ambiguity of Protein Identifications from Shotgun Proteomics
In shotgun proteomics, shared peptides (i.e., peptides that might originate from different proteins sharing homology, from different proteoforms due to alternative mRNA splicing, post-translational modifications, proteolytic cleavages, and/or allelic variants) represent a major source of ambiguity in protein identifications. The 'net4pg' package allows to assess and handle ambiguity of protein identifications. It implements methods for two main applications. First, it allows to represent and quantify ambiguity of protein identifications by means of graph connected components (CCs). In graph theory, CCs are defined as the largest subgraphs in which any two vertices are connected to each other by a path and not connected to any other of the vertices in the supergraph. Here, proteins sharing one or more peptides are thus gathered in the same CC (multi-protein CC), while unambiguous protein identifications constitute CCs with a single protein vertex (single-protein CCs). Therefore, the proportion of single-protein CCs and the size of multi-protein CCs can be used to measure the level of ambiguity of protein identifications. The package implements a strategy to efficiently calculate graph connected components on large datasets and allows to visually inspect them. Secondly, the 'net4pg' package allows to exploit the increasing availability of matched transcriptomic and proteomic datasets to reduce ambiguity of protein identifications. More precisely, it implement a transcriptome-based filtering strategy fundamentally consisting in the removal of those proteins whose corresponding transcript is not expressed in the sample-matched transcriptome. The underlying assumption is that, according to the central dogma of biology, there can be no proteins without the corresponding transcript. Most importantly, the package allows to visually inspect the effect of the filtering on protein identifications and quantify ambiguity before and after filtering by means of graph connected components. As such, it constitutes a reproducible and transparent method to exploit transcriptome information to enhance protein identifications. All methods implemented in the 'net4pg' package are fully described in Fancello and Burger (2022) <doi:10.1186/s13059-022-02701-2>.
Authors:
net4pg_0.1.0.tar.gz
net4pg_0.1.0.zip(r-4.5)net4pg_0.1.0.zip(r-4.4)net4pg_0.1.0.zip(r-4.3)
net4pg_0.1.0.tgz(r-4.4-any)net4pg_0.1.0.tgz(r-4.3-any)
net4pg_0.1.0.tar.gz(r-4.5-noble)net4pg_0.1.0.tar.gz(r-4.4-noble)
net4pg_0.1.0.tgz(r-4.4-emscripten)net4pg_0.1.0.tgz(r-4.3-emscripten)
net4pg.pdf |net4pg.html✨
net4pg/json (API)
# Install 'net4pg' in R: |
install.packages('net4pg', repos = c('https://laurafancello.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/laurafancello/net4pg/issues
Last updated 2 years agofrom:6de9142efc. Checks:OK: 1 NOTE: 6. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Oct 30 2024 |
R-4.5-win | NOTE | Oct 30 2024 |
R-4.5-linux | NOTE | Oct 30 2024 |
R-4.4-win | NOTE | Oct 30 2024 |
R-4.4-mac | NOTE | Oct 30 2024 |
R-4.3-win | NOTE | Oct 30 2024 |
R-4.3-mac | NOTE | Oct 30 2024 |
Exports:cc_compositioncc_statsget_adj_matrixget_ccpeptide_statsplot_ccread_inc_matrixreduce_inc_matrixtranscriptome_filter
Dependencies:BiocGenericsdata.tablegraphlatticemagrittrMatrix
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Get peptides and peptide-to-protein mappings for each connected component | cc_composition |
Provide statistics on the CCs size | cc_stats |
Generate adjacency matrix | get_adj_matrix |
Generate graph and calculate its connected components | get_cc |
Calculate percentage of shared vs specific peptides | peptide_stats |
Plot peptide-to-protein mapping graph | plot_cc |
Read incidence matrix of proteomic identifications | read_inc_matrix |
Reduce size of incidence matrix for downstream analyses | reduce_inc_matrix |
Perform transcriptome-informed post-hoc filtering | transcriptome_filter |