Motivations: Classification of biological samples for diagnostic purposes is a difficult task because of the many decisions involved on the number, type and functional manipulations of the input variables. This study presents a generally applicable strategy for systematic formulation of optimal diagnostic indexes. To this end, we develop a novel set of computational tools by integrating regression optimization, stepwise variable selection and cross-validation algorithms. Results: The proposed discrimination methodology was applied to plasma and tissue (liver) metabolic profiling data describing the time progression of liver dysfunction in a rat model of acute hepatic failure generated by D-galactosamine (GaIN) injection. From the plasma data, our methodology identified seven (out of a total of 23) metabolites, and the corresponding transform functions, as the best inputs to the optimal diagnostic index. This index showed better time resolution and increased noise robustness compared with an existing metabolic index, Fischer's BCAA/AAA molar ratio, as well as indexes generated using other commonly used discriminant analysis tools. Comparison of plasma and liver indexes found two consensus metabolites, lactate and glucose, which implicate glycolysis and/or gluconeogenesis in mediating the metabolic effects of GaIN.
ASJC Scopus subject areas
- Statistics and Probability
- Molecular Biology
- Computer Science Applications
- Computational Theory and Mathematics
- Computational Mathematics