You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+28-25Lines changed: 28 additions & 25 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -16,13 +16,15 @@ Ligand-binding site prediction based on machine learning.
16
16
17
17
### Description
18
18
19
-
P2Rank is a stand-alone command line program that predicts ligand-binding pockets from a protein structure. It achieves high prediction success rates without relying on an external software for computation of complex features or on a database of known protein-ligand templates.
19
+
P2Rank is a stand-alone command line program that predicts ligand-binding pockets from a protein structure.
20
+
It achieves high prediction success rates without relying on an external software for computation of complex features
21
+
or on a database of known protein-ligand templates.
20
22
21
23
### What's new?
22
24
23
-
* Version 2.5 brings speed optimizations (~2x faster prediction), ChimeraX visualizations and improvements to rescoring (`fpocket-rescore` command).
24
-
* Version 2.4.2 adds support for BinaryCIF (`.bcif`) input and rescoring of fpocket predictions in `.cif` format.
25
-
* Version 2.4 adds support for mmCIF (`.cif`) input and contains a special profile for predictions on AlphaFold models and NMR/cryo-EM structures.
25
+
* Version **2.5** brings speed optimizations (~2x faster prediction), ChimeraX visualizations, and improvements to rescoring (`fpocket-rescore` command).
26
+
* Version **2.4.2** adds support for BinaryCIF (`.bcif`) input and rescoring of fpocket predictions in `.cif` format.
27
+
* Version **2.4** adds support for mmCIF (`.cif`) input and contains a special profile for predictions on AlphaFold models and NMR/cryo-EM structures.
26
28
27
29
### Requirements
28
30
@@ -62,7 +64,7 @@ If you use P2Rank, please cite relevant papers:
62
64
*[Software article](https://doi.org/10.1186/s13321-018-0285-8) about P2Rank pocket prediction tool
63
65
Krivak R, Hoksza D. ***P2Rank: machine learning based tool for rapid and accurate prediction of ligand binding sites from protein structure.*** Journal of Cheminformatics. 2018 Aug.
64
66
*[A new web-server article](https://doi.org/10.1093/nar/gkac389) about updates in the web interface [prankweb.cz](https://prankweb.cz)
65
-
Jakubec D, Skoda P, Krivak R, Novotny M, Hoksza D ***PrankWeb 3: accelerated ligand-binding site predictions for experimental and modelled protein structures.*** Nucleic Acids Research, Volume 50, Issue W1, 5 July 2022, Pages W593–W597
67
+
Jakubec D, Skoda P, Krivak R, Novotny M, Hoksza D.***PrankWeb 3: accelerated ligand-binding site predictions for experimental and modelled protein structures.*** Nucleic Acids Research, Volume 50, Issue W1, 5 July 2022, Pages W593–W597
66
68
*[Web-server article](https://doi.org/10.1093/nar/gkz424) introducing the web interface at [prankweb.cz](https://prankweb.cz)
67
69
Jendele L, Krivak R, Skoda P, Novotny M, Hoksza D. ***PrankWeb: a web server for ligand binding site prediction and visualization.*** Nucleic Acids Research, Volume 47, Issue W1, 02 July 2019, Pages W345-W349
@@ -104,17 +106,17 @@ prank predict -c alphafold test.ds # use alphafold config and model (confi
104
106
105
107
### Prediction output
106
108
107
-
For each structure file `<struct_file>` in the dataset P2Rank produces several output files:
108
-
*`<struct_file>_predictions.csv`: contains an ordered list of predicted pockets, their scores, coordinates
109
-
of their centers together with a list of adjacent residues, list of adjacent protein surface atoms, and a calibrated probability of being a ligand-binding site
110
-
*`<struct_file>_residues.csv`: contains list of all residues from the input protein with their scores,
111
-
mapping to predicted pockets, and a calibrated probability of being a ligand-binding residue
109
+
For each structure file `{struct_file}` in the dataset P2Rank produces several output files:
110
+
*`{struct_file}_predictions.csv`: contains an ordered list of predicted pockets, their scores, coordinates
111
+
of their centers together with a list of adjacent residues, list of adjacent protein surface atoms, and a calibrated probability of being a ligand-binding site.
112
+
*`{struct_file}_residues.csv`: contains a list of all residues from the input protein with their scores,
113
+
mapping to predicted pockets, and a calibrated probability of being a ligand-binding residue.
112
114
* PyMol and ChimeraX visualizations in `visualizations/` directory (`.pml` and `.cxc` scripts with data files in `data/`)
113
-
*generating visualizations can be turned off by`-visualizations 0` parameter
114
-
*`-vis_renderers 'pymol,chimerax'` parameter can be used to turn individual visualization renderers on/off
115
-
*coordinates of SAS points can be found in `visualizations/data/<struct_file>_points.pdb.gz`. There the "Residue sequence number" (23-26 of HETATM record)
116
-
corresponds to the rank of the corresponding pocket (points with value 0 don't belong to any pocket)
117
-
*`-vis_copy_proteins 0` parameter can be used to turn off copying of protein structures to the visualizations directory (faster but visualizations won't be portable)
115
+
*Generating visualizations can be turned off with the`-visualizations 0` parameter
116
+
*`-vis_renderers 'pymol,chimerax'` parameter can be used to turn individual visualization renderers on/off.
117
+
*`-vis_copy_proteins 0` parameter can be used to turn off copying of protein structures to the visualizations directory (faster, but visualizations won't be portable).
118
+
* Coordinates and ligandability scores of SAS points can be found in `visualizations/data/{struct_file}_points.pdb.gz`. Here, the "Residue sequence number" (23-26 of HETATM record)
119
+
is the rank of the corresponding pocket (0 means the point doesn't belong to any pocket) and the b-factor column corresponds to the ligandability score.
118
120
119
121
120
122
### Configuration
@@ -162,8 +164,8 @@ and PUResNetV2.0
162
164
are supported at the moment).
163
165
164
166
Rescoring output:
165
-
*`<struct_file>_rescored.csv`: list of pockets sorted by the new score
166
-
*`<struct_file>_predictions.csv`: same as with `prank predict` (since 2.5)
167
+
*`{struct_file}_rescored.csv`: list of pockets sorted by the new score
168
+
*`{struct_file}_predictions.csv`: same as with `prank predict` (since 2.5)
167
169
* Note: probability column is calibrated for rescoring fpocket predictions
168
170
* visualizations
169
171
@@ -175,25 +177,26 @@ prank rescore fpocket.ds -c rescore_2024 # use new experimental rescoring model
175
177
prank eval-rescore fpocket.ds # evaluate rescoring model on a dataset with known ligands
176
178
~~~
177
179
178
-
For rescoring the dataset file needs to have a specific 2-column format. See examples in `test_data/`: `fpocket.ds`, `concavity.ds`, `puresnet.ds`.
180
+
For rescoring, the dataset file needs to have a specific 2-column format. See examples in `test_data/`: `fpocket.ds`, `concavity.ds`, `puresnet.ds`.
179
181
180
182
New experimental rescoring model `-c rescore_2024` shows promising result but hasn't been fully evaluated yet. It is recommended for AlphaFold models, NMR and cryo-EM structures since it doesn't depend on b-factor as a feature.
181
183
182
184
183
-
184
185
#### Run fpocket and rescore in one command
185
186
186
-
You can use `fpocket-rescore` command to run fpocket and then rescore its predictions automatically.
187
+
You can use `fpocket-rescore` command to run [Fpocket](https://github.com/Discngine/fpocket) and then rescore its predictions automatically.
187
188
188
189
~~~bash
189
-
prank fpocket-rescore test.ds
190
+
prank fpocket-rescore test.ds# expects 'fpocket' command in PATH
0 commit comments