Skip to content

Qrouger/PPIFold

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PPIFold

Automatised pipeline for massive PPI prediction and figure creation.

This python package is use to generate PPI prediction on lot of system, without lost time in generate initial file and sort result. It predict the better homo-oligomer for protein and better interface to interact with specific proteins.
This allow to predict massive multimer complexe with lot of PPI.

Requirements

Installation

You need to install AlphaPulldown 1.0.4 with his database, SignalpP5 and python 3.10.

Pipeline

pipeline

Initial File

The initial file need to be Uniprot ID, separate by a comma and in a ".txt" file.

Ex : UniprotID1,UniprotID2,UniprotID3...

Arguments

python main.py --txt_name string --data_dir Path --dir_alpha_analysis Path --make_multimers Boolean --env_multimer string --env_feature string --max_aa integer --use_signalP Boolean

Mandatory

--txt_name Name of the file who contains all Uniprot ID
--data_dir Path to the directory with all alphafold database
--dir_alpha_analysis Path to singularity image

Optional

--make_multimers This argument is True by default, if you just want make feature you have to set it on False
--env_feature The name of the conda environment need to make feature with AlphaPulldown, set by default on None
--env_multimer The name of the conda environment need to make multimer with AlphaPulldown, set by default on None
--max_aa The maximum lenght of a model generable by your GPU (in function of Vram), set by default on 2400 (24 Go)
--use_signalP Use SignalP if your protéins can be periplasmic, set by default on True

Result

This pipeline have a cutoff on PAE (10), iQ-score (25) and hiQ-score (50).

Figures

MSA depth
Residue interaction table
Distogram
Interaction network

Other Files

OOM file
A text file who contains too large interaction in function of --max_aa.

Bad MSA
A text file who contains proteins with MSA depth lower than 100 sequences.
/!\Result for proteins with less than 100 sequences in MSA is not accurate for validate or invalidate predict PPI.

Packages versions :
numpy: 2.0.1
matplotlib: 3.9.1
Bio: 1.7.1
networkx: 3.3

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages