|
Functional Genomics Characterization of Arabidopsis Genes
|
|
Large-scale fluorescent tagging of full-length genes to characterize native expression patterns and subcellular targeting of Arabidopsis proteins of unknown function A large gap remains in our understanding of the function of a very significant portion of Arabidopsis gene products. This project, if funded, will begin filling this gap by systematically analyzing a large number of the functionally-unassigned genes by Fluorescent Tagging of their Full-Length Protein products (FTFLP). The proposed research will generate important information and tools to characterize the Arabidopsis proteome by seeking three specific aims ;
Based on the most recent data on the Arabidopsis genome sequence and its annotations, we have identified 8,293 genes annotated as "unknown protein", "putative protein", or "waiting for functional annotation". We applied to this list a series of filters to identify most suitable candidate genes for characterization by our FTFLP approach : see Table 1. We sought to select a short list of 4,000 genes that are maximally diverse and therefore representative of most of the unassigned Arabidopsis sequences : see Table 2. UPDATE July 5, 2002 Upon recommendation from the grant reviewers, we are scaling down to work on ca. 800 genes as a pilot study. From the 4000 genes we chose in December 2001, we selected 800 on the following criteria: 1. must have a full length cDNA and 2. do not have any Gene Ontology annotations. To maximize the diversity of the genes in the set, we preferentially chose the genes that are single-copies. Therefore, the chosen list of 855 (Table 2) contain more single-copy genes and a bit more plant-specific genes than the list of 4000, but are proportional in all other characteristics (e.g. predicted location and protein domains). UPDATE August 28, 2002 The list of 855 was split into three files of ~286 genes for each group. In addition, the total number of associated ESTs and the mean intensity (along with standard deviation and coefficient of variance) of all the AFGC microarray experiments (ca. 560 hybridizations) are included to provide a 'rough' idea of level of expression. Table 1. Identification of the "long list" of candidate genes for FTFLP characterization
Table 2. Identification of the "short list" of candidate genes for FTFLP characterization
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||