Analyzing Virus Gene Expression Data to Understand Regulatory Interactions


Funding and Staffing Details

This project is funded by a 3-year BBSRC/EPSRC grant under the Bioinformatics Initiative. The project is a collaboration between Xiaohui Liu of the Intelligent Data Analysis Group in the Department of Information Systems and Computing at Brunel University, Nigel Martin of the Database and Web Technologies Group in the Department of Computer Science and Information Systems at Birkbeck, Paul Kellam of the Windeyer Institute of Medical Sciences at UCL, and Christine Orengo of the Biomolecular Structure & Modelling Group in the Department of Biochemistry and Molecular Biology at UCL.


Project Aims

The project seeks to understand how to determine the genetic network of molecular interactions by applying multivariate time series (MTS) methods to the modelling of gene expression data, particularly in the virology domain. The Viral Genomics and Bioinformatics Group at UCL have already successfully produced an array of all known and putative open reading frames of human herpesvirus 8, and in collaboration with the Database Technology and Bioinformatics Groups at Birkbeck College are developing a database of viral protein families (VIDA). This project will extend VIDA with an expression array component, examine the data by applying clustering algorithms and related data pre-processing techinques, and construct models for understanding gene regulation and interaction. Two multivariate time series methods will be used: the Vector Auto-Regressive Process and Dynamic Bayesian Networks. Finally, relevant protein structures, functions, and transcriptional control mechanisms will be used to validate the models.


Project Publications

Comparing, Contrasting and Combining Clusters in Viral Gene Expression Data, P Kellam, X Liu, N.J.Martin, C.Orengo, S.Swift, A.Tucker, Proc. 6th Workshop on Intelligent Data Analysis in Medicine and Pharmacology, 56-62, (2001).

A Framework for Modelling Virus Gene Expression Data, P Kellam, X Liu, N.J.Martin, C.Orengo, S.Swift, A.Tucker, Intelligent Data Analysis, 6(3), 218-227, (2002).

Consensus Clustering and Functional Interpretation of Gene Expression Data, S Swift, A Tucker, V Vinciotti, N Martin, C Orengo, X Liu, P Kellam, Genome Biology, 5:R94 (2004).