Functional Annotation and Molecular modeling of Hypothetical Proteins (HPs) from P.aeruginosa plasmid pUM505: An In silico Approach

Srikant Awasthi; Pragya Saxena; Hillol Chakdar; Alok K Srivastava; Salman Akhtar*

doi:10.17577/IJERTV9IS050015

Volume 09, Issue 05 (May 2020)

Functional Annotation and Molecular modeling of Hypothetical Proteins (HPs) from P.aeruginosa plasmid pUM505: An In silico Approach

DOI : 10.17577/IJERTV9IS050015

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 225
Authors : Srikant Awasthi , Pragya Saxena , Hillol Chakdar , Alok K Srivastava, Salman Akhtar*
Paper ID : IJERTV9IS050015
Volume & Issue : Volume 09, Issue 05 (May 2020)
Published (First Online): 07-05-2020
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Functional Annotation and Molecular modeling of Hypothetical Proteins (HPs) from P.aeruginosa plasmid pUM505: An In silico Approach

Srikant Awasthi1-2, Pragya Saxena2, Hillol Chakdar2, Alok K Srivastava2 and Salman Akhtar1*

1Department of Bioengineering, Integral University, Lucknow, INDIA, 226026

2Microbial Genomics Laboratory, National Bureau of Agriculturally Important Microorganisms (NBAIM), Mau, Uttar Pradesh, INDIA, 275101

Dr. Salman Akhtar Associate Professor

Department of Bioengineering Integral University, Lucknow, India 226026

Abstract:- Structural and function annotation of P. aeruginosa plasmid pUM505isessentially required to facilitate the understanding of mechanisms of pathogenesis and biochemical pathways important for selecting novel therapeutic target. In present study, randomly selected twelve hypothetical protein sequence of P. aeruginosa plasmid pUM505 has been annotated using various In-Silico tools and databases to determine domain family, solubility of protein, ligand binding sites etc. Six out of 12 proteins have been putatively annotated, in which four have been annotated with high confidence. Physio-chemical characterization revealed two proteins are stable. The three-dimensional structure of two important annotated proteins were modeled and their ligand binding sites were identified. Domains and families for six proteins have been found. The analysis revealed that these proteins have antitoxin activity, integrase enzyme activity, conjugal DNA transfer activity, etc. Structural prediction of these proteins and detection of binding sites from this study would indicate a potential target aiding docking studies for therapeutic designing against various diseases.

Keywords: Hypothetical Protein, Functional Annotation, Molecular Modeling, Docking

INTRODUCTION

Pseudomonas aeruginosa, a gram-negative bacteriumis well known for its environmental versatility. Diverse growing habitat includes soil,coastal marine, plant and animal tissues (Khan et al., 2007). P. aeruginosais also well known for its multidrug resistant and is a global threat towards many infections disease. P. aeruginosais a major opportunistic pathogen in humans, causing serious complications caused by infections in patients particularly susceptible like people with immune system deficiencies, victims of skin burns, catheterized patients who suffer urinary tract infections and patients with respirators, causing nosocomial pneumonia (Lyczak et al., 2000). It is the major cause of mortality in patients with cystic fibrosis colonizing the lungs (Williams et al., 2010). Role of plasmids in antibiotic resistance are well establish.Plasmids are circular deoxyribonucleic acid molecules that exist in bacteria, usually independent of the chromosome. The study of plasmids is important to medical microbiology because plasmids can encode genes for antibiotic resistance or virulence factors (Wang et al., 1988). The pUM505 plasmid contains a genomic island with sequence similar to islands found in chromosomes of virulent P. aeruginosa clinical isolates. Plasmid pUM505 contains several genes that encode virulence factors, suggesting that the plasmid may contribute directly to bacterial virulence (RodrÃguezetal., 2016). The bacterium's virulence depends on a large number of cell-associated and extracellular factors. The virulence factors play an important pathological role in the colonization, the survival of the bacteria and the invasion of tissues(Wang and Gui, 2013).

Due to cost-efficiency throughput of genome sequencing has increased enormously resulting thousands of bacterial genomes now available and this number is increasing enormously day by day.Functional annotation of proteomes is a demanding problem(Roberts et al., 2004). A large fraction of proteins is still labeled as hypothetical protein,unknownfunction or with similar terms that imply that there is no functional indication for the ORF.Function annotation of putative uncharacterized HPs for their possible biological activity has emerged as an important focus for computational biology (Kumar et al., 2014; Loewenstein et al., 2009; Shahbaaz et al., 2013).The pUM505 sequence contained 138 complete coding regions, the majority of them encoded on the complementary DNA strand (75%), with respect to the predicted origin of replication (oriV). Most of the identified genes (46%) encode hypothetical proteins (HSPs). Proper structural and functional determination of this huge fraction (46%) is very important to reveal complete understanding of virulence mechanism in P. aeruginosa.Therefor an improved functional annotation of its proteome is of particular urgency. In present study, 12 randomly selected HPs from P.aeruginosa plasmid pUM505 have been annotated with the help of various bioinformatics resources. Moreover, two important annotated HPS have been structurally modeled and characterized.
MATERIALS AND METHODS
MetaPocket 2.0(http://projects.biotec.tu-dresden.de/metapocket/help.php)was used to find out the ligand binding sites.Proteins are primarily scanned for ligands and it uses the interaction energy between the protein and a simple van der Waals probe to locate vigorously favorable binding sites (Zhang et al., 2011).
RESULTS AND DISCUSSION
CONCLUSIONS

Our primary sequence-based analysis led to the identification of two HPs as biologically significant, which might be involved as enzymes (antitoxins, conjugal DNA transferase, and oxido-reductase etc.). Furthermore, we successfully predicted the structure of two HPs to describe their functions at the molecular level. The outcome of the present study may facilitate better understanding of the mechanism of virulence, drug resistance, pathogenesis, adaptability to host, tolerance for host immune response and drug discovery for treatment of infections caused by P. aeruginosa.

ACKNOWLEDGEMENTS

The authors gratefully acknowledge the financial assistance under project Application of Microorganisms in Agriculture and Allied Sectors (AMAAS) from Indian Council of Agricultural Research (ICAR), India. The authors further acknowledge the Integral University, Lucknow for providing the necessary support for the completion of this study and allotting its manuscript communication ID.

Conflict of interest: The authors declare that they have no conflict of interest.

REFERENCES

Arcus, V.L., McKenzie, J.L., Robson, J., Cook, G.M., 2011. The PIN-domain ribonucleasesand the prokaryotic VapBC toxin-antitoxin array.Protein Eng. Des. Sel. 24 (1-2), 3340.
Bateman,A., Coin, L., Durbin, R., Finn, R. D., Hollich, V., GriffithsJones,S., &Studholme, D. J., 2004. The Pfam protein families database.Nucleic acids research, 32(suppl 1), D138-D141.
Bork, P.,Koonin, E.V.,1996.Protein sequence motifs. Current opinion instructural biology, 6(3), 366-376.
Gardy, J. L., Spencer, C., Wang, K., Ester, M., Tusnady, G. E., Simon, I., Brinkman, F. S., 2003. PSORT-B: Improving protein subcellular localization prediction for Gram-negative bacteria. Nucleic acids research,31(13), 3613-3617.
DÃaz, D. A., Barreto, G. E., Santos, J. G., 2014. Structural and Functional Prediction of the Hypothetical Protein Pa2481 in Pseudomonas Aeruginosa Pao1. Advances in Computational Biology, Springer International Publishing (pp. 47-55).
Gasteiger, E., Hoogland, C., Gattiker, A., Duvaud, S. E., Wilkins, M. R., Appel, R. D., Bairoch, A., 2005. Protein identification and analysis tools on the ExPASy server. Humana Press,(pp. 571-607).
Kelley, L. A., Sternberg, M. J., 2009. Protein structure prediction on the Web: a case study using the Phyre server. Nature protocols,4(3), 363-371.
Kiefer, F., Arnold, K., KÃ¼nzli, M., Bordoli, L., Schwede, T., 2009.The SWISS-MODEL Repository and associated resources. Nucleic acids research,37(suppl 1), D387-D392.
Khan, N. H., Ishii, Y., Kimata-Kino, N., Esaki, H., Nishino, T., Nishimura, M., Kogure, K., 2007. Isolation of Pseudomonas aeruginosa from open ocean and comparison with freshwater, clinical, and animal isolates. Microbial ecology, 53(2), 173-186.
Kumar K., Prakash A., Tasleem M., Islam A., Ahmad F., Hassan M.I., 2014. Functional annotation of putative hypothetical proteins from Candida

dubliniensis. Gene 543,93100.
Laskowski, R. A.,MacArthur, M. W., Moss, D. S., Thornton, J. M., 1993.PROCHECK: a program to check the stereochemical quality of proteinstructures. Journal of applied crystallography, 26(2), 283-291.
Laskowski, R. A., Rullmann, J. A. C., MacArthur, M. W., Kaptein, R., Thornton, J. M., 1996. AQUA and PROCHECK-NMR: programs for checking the quality of protein structures solved by NMR. Journal of biomolecular NMR, 8(4), 477-486.
Loewenstein, Y., Raimondo, D., Redfern, O.C., Watson, J., Frishman D., Linial M., Orengo C., Thornton J., Tramontano A., 2009. Protein function annotation by homology-based inference. Genome Biology, 10,207.
Li, M., Wang, B., 2007. Homology modeling and examination of the effect of the D92E mutation on the H5N1 nonstructural protein NS1 effector domain. Journal of molecular modeling, 13(12), 1237-1244.
Lyczak. J. B., Cannon, C. L., Pier, G. B., 2000. Establishmentof Pseudomonas aeruginosa infection: lessons from a versatile opportunist. Microbiology Infection, 2,10511060.
Marchler-Bauer, A., Lu, S., Anderson, J. B., Chitsaz, F., Derbyshire, M. K., DeWeese-Scott, C.,Gwadz, M., 2001. CDD: a Conserved Domain Database for the functional annotation of proteins. Nucleic acids research,39(suppl 1), D225-D229.
Mayer, L. W., 1988. Use of plasmid profiles in epidemiologic surveillance of disease outbreaks and in tracing the transmission of antibiotic resistance.Clinical microbiology reviews, 1(2), 228-243.
Moore, D., Maneewannakul, K., Maneewannakul, S., Wu, J. H., Ippen-Ihler, K., Bradley, D. E., 1990. Characterization of the F-plasmid conjugative transfer gene traU, Journal of Bacteriology,172, 4263-4270.
Powell, S., Szklarczyk, D., Trachana, K., Roth, A., Kuhn, M., Muller, J., Jensen, L. J.,2012.eggNOG v3. 0: orthologous groups covering 1133 organisms at 41 different taxonomic ranges. Nucleic acids research, 40(D1), D284-D289.
Roberts, R. J., 2004. Identifying protein function -a call for community action. PloS 2, E42.
RodrÃguez-Andrade, E., HernÃ¡ndez-RamÃrez, K. C., DÃaz-PerÃ©z, S. P., DÃaz-MagaÃ±a, A., ChÃ¡vez-Moctezuma, M. P., Meza-Carmen, V., RamÃrez-DÃaz,

M. I., 2016. Genes from pUM505 plasmid contribute to Pseudomonas aeruginosa virulence. Antonievan Leeuwenhoek, 109(3), 389-396.
Shahbaaz M., Hassan, M. I., Ahmad, F., 2013. Functional annotation of conserved hypothetical proteins from HaemophilusinfluenzaeRd KW20. PLoS ONE 8, e84263.
Wang, H., Tu, F.,Gui, Z., 2013. Virulence factors in Pseudomonas aeruginosa: mechanisms and modes of regulation. Indian Journal of Microbiology, 2, 163-167.
Williams, B.J., Dehnbostel, J., Blackwell, T.S., 2010. Pseudomonas aeruginosa: host defence in lung diseases. Respirology 15, 1037 1056.
Yu, C. S., Cheng, C. W., Su, W. C., Chang, K. C., Huang, S. W., Hwang, J. K., Lu, C. H.,2014. CELLO2GO: a web server for protein subCELlularLOcalization prediction with functional gene ontology annotation. PloS one,9(6), e99368.
Zhang, Z., Li, Y., Lin, B., Schroeder, M., Huang, B., 2011. Identification of cavities on protein surface using multiple computational approaches for drug binding siteprediction. Bioinformatics, 27(15), 2083-2088.

FIGURE LEGENDS

Fig1. (a-b) (a)Three-dimensional structure of HP38 model and (b) their stereo-chemical property by Ramchandran plot. All the residues are in most favored region

Fig2 (a-b)(a)Three-dimensional structure of HP80 model and (b) their stereo-chemical propertyby Ramachandran plot. All the residues are in most favored region

Fig. 3 (a-b)The secondary structure of (a) HP38 and (b) HP80 showing helices, b-Sheets and b-hairpins. Fig. 4 (a-b)Model quality estimation plot obtained by ERRAT server for (a) model HP38 and (b) model HP80. Fig4(c-d).Active sites (shown in balls) identified in protein (c) HP38 and (d) HP80.

Fig1. (a-b):- (a)Three dimensional structure of HP38 model and (b) their stereo-chemical property by Ramchandran plot. All the residues are in most favored reg
1. (b)

Fig2 (a-b):-(a)Three dimensional structure of HP80 model and (b) their stereo-chemical propertyby Ramachandran plot.

Fig. 3 (a-b)The secondary structure of (a) HP38 and (b) HP80 showing helices, b-Sheets and b-hairpins.

(b)

Fig. 4 (a-b):-Model quality estimation plot obtained by ERRAT server for (a) model HP38 and (b) model HP80
1. (b)

Fig4(c-d).Active sites (shown in balls) identified in protein (c) HP38 and (d) HP80

(c) (d)

Tables

Table1. Predicted Functions of HPs in P.aeruginosa plasmid pUM505

S.No.	GenBank ID	Functional Annotation
S.No.	GenBank ID	EGGNOG	Pfam
1.	YP_004927991.1	Function Unknown	Not found
2.	YP_004928098.1	No Ortholog	Not found
3.	YP_004927980.1	Nucleoid-associated protein	NA-37 family
4.	YP_004928038.1	PIN3 domain protein	PIN3 family
5.	YP_004928012.1	Secreted protein	DUF2895
6.	YP_004927986.1	No Ortholog	Not found
7.	YP_004927975.1	Function Unknown	Not found
8.	YP_004927989.1	phage protein	HNH endonuclease
9.	YP_004928003.1	TraU	TraU family
1	YP_004928109.1	Function Unknown	DUF 1845 (family of unknown function)
1	YP_004927994.1	No Ortholog	Not found
1	YP_004928060.1	Function Unknown	DUF 1302 (family of unknown function)

Table2.Subcellular localization of HPs predicted by different bioinformatics tool

S.No	GeneBank ID	Sub-cellular localization			SignalPeptide	Secretory Protein
S.No	GeneBank ID	PSORT B	PSLpred	CELLO	SignalPeptide	Secretory Protein
1	YP_004927991.1	Cytoplasmic	Cytoplasmic protein	Cytoplasmic	No	No
2	YP_004928098.1	Cytoplasmic	Cytoplasmic protein	Cytoplasmic	No	No
3	YP_004928060.1	Outer membrane	Inner membrane protein	Outer membrane	Yes	No
4	YP_004928038.1	Cytoplasmic	Cytoplasmic protein	Cytoplasmic	No	No
5	YP_004927994.1	Cytoplasmic	Periplasmic protein	Cytoplasmic	No	No
6	YP_004927986.1	Unknown	Cytoplasmic	Cytoplasmic	No	No
7	YP_004927980.1	Cytoplasmic	Inner membrane protein	Cytoplasmic	No	No
8	YP_004927975.1	Cytoplasmic	Inner membrane protein	Cytoplasmic	No	Yes
9	YP_004927989.1	Cytoplasmic	Cytoplasmic protein	Cytoplasmic	No	No
10	YP_004928003.1	Unknown	Cytoplasmic protein	Extracellular	Yes	Yes
11	YP_004928012.1	Cytoplasmic	Inner-membrane protein	Cytoplasmic	No	No
12	YP_004928109.1	Cytoplasmic	Inner membrane protein	Cytoplasmic	No	No

Table 3. MetaPocket clusters and their unctional residues

HP Model	Pocket No.	z-score	Pocket Sites
HP38 (YP_004928038.1)	1	11.99	'GHE-1', 'SFN-1', 'LCS-1', 'FPK-1', 'PAS-2', 'CON-1'
	2	3.52	'PAS-1', 'GHE-2'
	3	1.49	'LCS-2'
	4	0.93	'FPK-2', 'PAS-3', 'LCS-3', 'CON-2', 'GHE-3'
	5	0.74	'FPK-3'
	6	0.18	'SFN-2'
	7	0.04	'SFN-3'
HP80 (YP_004927980.1)	1	18.38	'GHE-1', 'SFN-1', 'LCS-1', 'FPK-1', 'PAS-1', 'CON-1'
	2	4.56	'PAS-2','FPK-2', 'LCS-2','SFN-2', 'GHE-2'
	3	1.49	'FPK-3', 'SFN-3'
	4	-0.02	'PAS-3', 'LCS-3', 'GHE-3'

Supplementary Tables

Table 1. Supplementary table S1.Physiochemical characterization of HPs protein

S.No.	Gene bank Accession Number	Sequence length	M. wt	pI	R	+ R	EC	II	Protein Claas	AI	GRAVY
1.	YP_004927991.1	228	24996.02	4.80	32	22	50460	42.42	unstable	78.86	-0.375
2.	YP_004928098.1	110	12894.83	4.71	20	12	11920	59.63	unstable	101.18	-0.205
3.	YP_004928060.1	606	65668.79	4.58	66	41	106480	17.53	stable	77.81	-0.271
4.	YP_004928038.1	136	15199.66	6.73	17	17	9970	44.48	unstable	113.38	-0.267
5.	YP_004927994.1	245	27858.82	8.87	35	39	25565	57.76	unstable	94.86	-0.543
6.	YP_004927986.1	206	22816.97	6.12	21	17	40575	68.31	unstable	99.51	-0.127
7.	YP_004927980.1	340	38425.87	5.25	53	39	35870	48.97	unstable	76.12	-0.611
8.	YP_004927975.1	443	49901.31	7.18	56	56	52035	48.98	unstable	78.42	-0.691
9.	YP_004927989.1	242	27935.31	9.93	25	43	36690	45.12	unstable	81.03	-0.683
1	YP_004928003.1	312	33486.89	8.06	23	25	65360	24.16	Stable	76.38	-0.101
1	YP_004928012.1	219	25508.95	6.53	29	28	53650	47.42	unstable	76.58	-0.495
1	YP_004928109.1	289	32660.03	5.70	42	36	39545	51.49	unstable	84.15	-0.410

Functional Annotation and Molecular modeling of Hypothetical Proteins (HPs) from P.aeruginosa plasmid pUM505: An In silico Approach

Fig. 3 (a-b)The secondary structure of (a) HP38 and (b) HP80 showing helices, b-Sheets and b-hairpins.

Fig. 4 (a-b):-Model quality estimation plot obtained by ERRAT server for (a) model HP38 and (b) model HP80

Fig4(c-d).Active sites (shown in balls) identified in protein (c) HP38 and (d) HP80

Leave a Reply