The failure to produce and/or crystallize proteins is often due to their modular structure. There exists therefore considerable interest to develop strategies for tailoring proteins into crystallizable domains. In the framework of a Structural Genomics Project on soluble yeast proteins, we have tested the expression of numerous genetic constructs of our targets in order to produce and crystallize proteins and protein domains and solve their three-dimensional structure. In some cases, the choice of the domain boundaries was guided by prediction from sequence using various software packages, including Prelink, a home-made prediction method for detecting unfolded regions. In other cases, large numbers of constructs were generated using molecular biology or biochemical methods. In this paper, we analyze the results of the over-expression in E. coli and crystallization of these constructs, and compare these with the predictions that can be obtained from our software and from others.
Keywords: Protein domain, disorder prediction, protein expression, crystallization