Skip to content

Commit 2b199cf

Browse files
committed
update readme
1 parent cf26aa9 commit 2b199cf

File tree

1 file changed

+28
-25
lines changed

1 file changed

+28
-25
lines changed

README.md

Lines changed: 28 additions & 25 deletions
Original file line numberDiff line numberDiff line change
@@ -16,13 +16,15 @@ Ligand-binding site prediction based on machine learning.
1616

1717
### Description
1818

19-
P2Rank is a stand-alone command line program that predicts ligand-binding pockets from a protein structure. It achieves high prediction success rates without relying on an external software for computation of complex features or on a database of known protein-ligand templates.
19+
P2Rank is a stand-alone command line program that predicts ligand-binding pockets from a protein structure.
20+
It achieves high prediction success rates without relying on an external software for computation of complex features
21+
or on a database of known protein-ligand templates.
2022

2123
### What's new?
2224

23-
* Version 2.5 brings speed optimizations (~2x faster prediction), ChimeraX visualizations and improvements to rescoring (`fpocket-rescore` command).
24-
* Version 2.4.2 adds support for BinaryCIF (`.bcif`) input and rescoring of fpocket predictions in `.cif` format.
25-
* Version 2.4 adds support for mmCIF (`.cif`) input and contains a special profile for predictions on AlphaFold models and NMR/cryo-EM structures.
25+
* Version **2.5** brings speed optimizations (~2x faster prediction), ChimeraX visualizations, and improvements to rescoring (`fpocket-rescore` command).
26+
* Version **2.4.2** adds support for BinaryCIF (`.bcif`) input and rescoring of fpocket predictions in `.cif` format.
27+
* Version **2.4** adds support for mmCIF (`.cif`) input and contains a special profile for predictions on AlphaFold models and NMR/cryo-EM structures.
2628

2729
### Requirements
2830

@@ -62,7 +64,7 @@ If you use P2Rank, please cite relevant papers:
6264
* [Software article](https://doi.org/10.1186/s13321-018-0285-8) about P2Rank pocket prediction tool
6365
Krivak R, Hoksza D. ***P2Rank: machine learning based tool for rapid and accurate prediction of ligand binding sites from protein structure.*** Journal of Cheminformatics. 2018 Aug.
6466
* [A new web-server article](https://doi.org/10.1093/nar/gkac389) about updates in the web interface [prankweb.cz](https://prankweb.cz)
65-
Jakubec D, Skoda P, Krivak R, Novotny M, Hoksza D ***PrankWeb 3: accelerated ligand-binding site predictions for experimental and modelled protein structures.*** Nucleic Acids Research, Volume 50, Issue W1, 5 July 2022, Pages W593–W597
67+
Jakubec D, Skoda P, Krivak R, Novotny M, Hoksza D. ***PrankWeb 3: accelerated ligand-binding site predictions for experimental and modelled protein structures.*** Nucleic Acids Research, Volume 50, Issue W1, 5 July 2022, Pages W593–W597
6668
* [Web-server article](https://doi.org/10.1093/nar/gkz424) introducing the web interface at [prankweb.cz](https://prankweb.cz)
6769
Jendele L, Krivak R, Skoda P, Novotny M, Hoksza D. ***PrankWeb: a web server for ligand binding site prediction and visualization.*** Nucleic Acids Research, Volume 47, Issue W1, 02 July 2019, Pages W345-W349
6870
* [Conference paper](https://doi.org/10.1007/978-3-319-21233-3_4) introducing P2Rank prediction algorithm
@@ -104,17 +106,17 @@ prank predict -c alphafold test.ds # use alphafold config and model (confi
104106

105107
### Prediction output
106108

107-
For each structure file `<struct_file>` in the dataset P2Rank produces several output files:
108-
* `<struct_file>_predictions.csv`: contains an ordered list of predicted pockets, their scores, coordinates
109-
of their centers together with a list of adjacent residues, list of adjacent protein surface atoms, and a calibrated probability of being a ligand-binding site
110-
* `<struct_file>_residues.csv`: contains list of all residues from the input protein with their scores,
111-
mapping to predicted pockets, and a calibrated probability of being a ligand-binding residue
109+
For each structure file `{struct_file}` in the dataset P2Rank produces several output files:
110+
* `{struct_file}_predictions.csv`: contains an ordered list of predicted pockets, their scores, coordinates
111+
of their centers together with a list of adjacent residues, list of adjacent protein surface atoms, and a calibrated probability of being a ligand-binding site.
112+
* `{struct_file}_residues.csv`: contains a list of all residues from the input protein with their scores,
113+
mapping to predicted pockets, and a calibrated probability of being a ligand-binding residue.
112114
* PyMol and ChimeraX visualizations in `visualizations/` directory (`.pml` and `.cxc` scripts with data files in `data/`)
113-
* generating visualizations can be turned off by `-visualizations 0` parameter
114-
* `-vis_renderers 'pymol,chimerax'` parameter can be used to turn individual visualization renderers on/off
115-
* coordinates of SAS points can be found in `visualizations/data/<struct_file>_points.pdb.gz`. There the "Residue sequence number" (23-26 of HETATM record)
116-
corresponds to the rank of the corresponding pocket (points with value 0 don't belong to any pocket)
117-
* `-vis_copy_proteins 0` parameter can be used to turn off copying of protein structures to the visualizations directory (faster but visualizations won't be portable)
115+
* Generating visualizations can be turned off with the `-visualizations 0` parameter
116+
* `-vis_renderers 'pymol,chimerax'` parameter can be used to turn individual visualization renderers on/off.
117+
* `-vis_copy_proteins 0` parameter can be used to turn off copying of protein structures to the visualizations directory (faster, but visualizations won't be portable).
118+
* Coordinates and ligandability scores of SAS points can be found in `visualizations/data/{struct_file}_points.pdb.gz`. Here, the "Residue sequence number" (23-26 of HETATM record)
119+
is the rank of the corresponding pocket (0 means the point doesn't belong to any pocket) and the b-factor column corresponds to the ligandability score.
118120

119121

120122
### Configuration
@@ -162,8 +164,8 @@ and PUResNetV2.0
162164
are supported at the moment).
163165

164166
Rescoring output:
165-
* `<struct_file>_rescored.csv`: list of pockets sorted by the new score
166-
* `<struct_file>_predictions.csv`: same as with `prank predict` (since 2.5)
167+
* `{struct_file}_rescored.csv`: list of pockets sorted by the new score
168+
* `{struct_file}_predictions.csv`: same as with `prank predict` (since 2.5)
167169
* Note: probability column is calibrated for rescoring fpocket predictions
168170
* visualizations
169171

@@ -175,25 +177,26 @@ prank rescore fpocket.ds -c rescore_2024 # use new experimental rescoring model
175177
prank eval-rescore fpocket.ds # evaluate rescoring model on a dataset with known ligands
176178
~~~
177179

178-
For rescoring the dataset file needs to have a specific 2-column format. See examples in `test_data/`: `fpocket.ds`, `concavity.ds`, `puresnet.ds`.
180+
For rescoring, the dataset file needs to have a specific 2-column format. See examples in `test_data/`: `fpocket.ds`, `concavity.ds`, `puresnet.ds`.
179181

180182
New experimental rescoring model `-c rescore_2024` shows promising result but hasn't been fully evaluated yet. It is recommended for AlphaFold models, NMR and cryo-EM structures since it doesn't depend on b-factor as a feature.
181183

182184

183-
184185
#### Run fpocket and rescore in one command
185186

186-
You can use `fpocket-rescore` command to run fpocket and then rescore its predictions automatically.
187+
You can use `fpocket-rescore` command to run [Fpocket](https://github.com/Discngine/fpocket) and then rescore its predictions automatically.
187188

188189
~~~bash
189-
prank fpocket-rescore test.ds
190+
prank fpocket-rescore test.ds # expects 'fpocket' command in PATH
190191
prank fpocket-rescore test.ds -fpocket_command "/bin/fpocket -w m" # specify custom fpocket command (optionally with arguments)
191-
prank fpocket-rescore test.ds -fpocket_keep_output 0 # don't keep fpocket output files
192+
prank fpocket-rescore test.ds -fpocket_keep_output 0 # delete fpocket output files
192193
~~~
193194

194-
In this case dataset file doesn't need to have the 2-column format.
195-
`prank fpocket-rescore` can be used as in-place replacement of `prank predict` command.
196-
Note: if you use `fpocket-rescore`, please cite fpocket paper as well.
195+
In this case, the dataset file can be a simple list of pdb/cif files since Fpocket predictions will pe calculated ad-hoc.
196+
`prank fpocket-rescore` will produce `predictions.csv` as well, so it can be used as an in-place replacement for `prank predict` in most scenarios.
197+
Note: if you use `fpocket-rescore`, please cite Fpocket paper as well.
198+
199+
197200

198201
## Build from sources
199202

0 commit comments

Comments
 (0)