Package: net4pg 0.1.0

Laura Fancello

net4pg: Handle Ambiguity of Protein Identifications from Shotgun Proteomics

In shotgun proteomics, shared peptides (i.e., peptides that might originate from different proteins sharing homology, from different proteoforms due to alternative mRNA splicing, post-translational modifications, proteolytic cleavages, and/or allelic variants) represent a major source of ambiguity in protein identifications. The 'net4pg' package allows to assess and handle ambiguity of protein identifications. It implements methods for two main applications. First, it allows to represent and quantify ambiguity of protein identifications by means of graph connected components (CCs). In graph theory, CCs are defined as the largest subgraphs in which any two vertices are connected to each other by a path and not connected to any other of the vertices in the supergraph. Here, proteins sharing one or more peptides are thus gathered in the same CC (multi-protein CC), while unambiguous protein identifications constitute CCs with a single protein vertex (single-protein CCs). Therefore, the proportion of single-protein CCs and the size of multi-protein CCs can be used to measure the level of ambiguity of protein identifications. The package implements a strategy to efficiently calculate graph connected components on large datasets and allows to visually inspect them. Secondly, the 'net4pg' package allows to exploit the increasing availability of matched transcriptomic and proteomic datasets to reduce ambiguity of protein identifications. More precisely, it implement a transcriptome-based filtering strategy fundamentally consisting in the removal of those proteins whose corresponding transcript is not expressed in the sample-matched transcriptome. The underlying assumption is that, according to the central dogma of biology, there can be no proteins without the corresponding transcript. Most importantly, the package allows to visually inspect the effect of the filtering on protein identifications and quantify ambiguity before and after filtering by means of graph connected components. As such, it constitutes a reproducible and transparent method to exploit transcriptome information to enhance protein identifications. All methods implemented in the 'net4pg' package are fully described in Fancello and Burger (2022) <doi:10.1186/s13059-022-02701-2>.

Authors:Laura Fancello [aut, cre], Thomas Burger [aut, ctb]

net4pg_0.1.0.tar.gz
net4pg_0.1.0.zip(r-4.5)net4pg_0.1.0.zip(r-4.4)net4pg_0.1.0.zip(r-4.3)
net4pg_0.1.0.tgz(r-4.4-any)net4pg_0.1.0.tgz(r-4.3-any)
net4pg_0.1.0.tar.gz(r-4.5-noble)net4pg_0.1.0.tar.gz(r-4.4-noble)
net4pg_0.1.0.tgz(r-4.4-emscripten)net4pg_0.1.0.tgz(r-4.3-emscripten)
net4pg.pdf |net4pg.html
net4pg/json (API)

# Install 'net4pg' in R:
install.packages('net4pg', repos = c('https://laurafancello.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/laurafancello/net4pg/issues

On CRAN:

4.00 score 2 stars 3 scripts 229 downloads 9 exports 6 dependencies

Last updated 2 years agofrom:6de9142efc. Checks:OK: 1 NOTE: 6. Indexed: yes.

TargetResultDate
Doc / VignettesOKOct 30 2024
R-4.5-winNOTEOct 30 2024
R-4.5-linuxNOTEOct 30 2024
R-4.4-winNOTEOct 30 2024
R-4.4-macNOTEOct 30 2024
R-4.3-winNOTEOct 30 2024
R-4.3-macNOTEOct 30 2024

Exports:cc_compositioncc_statsget_adj_matrixget_ccpeptide_statsplot_ccread_inc_matrixreduce_inc_matrixtranscriptome_filter

Dependencies:BiocGenericsdata.tablegraphlatticemagrittrMatrix

An introduction to net4pg

Rendered fromIntro_To_net4pg.Rmdusingknitr::rmarkdownon Oct 30 2024.

Last update: 2021-09-20
Started: 2021-09-16