Using synthetic data to evaluate multiple regression and principal component analyses for statistical modeling of daily building energy consumption

T. A. Reddy, D. E. Claridge

Research output: Contribution to journalArticle

41 Scopus citations


Multiple regression modeling of monitored building energy use data is often faulted as a reliable means of predicting energy use on the grounds that multicollinearity between the regressor variables can lead both to improper interpretation of the relative importance of the various physical regressor parameters and to a model with unstable regressor coefficients. Principal component analysis (PCA) has the potential to overcome such drawbacks. While a few case studies have already attempted to apply this technique to building energy data, the objectives of this study were to make a broader evaluation of PCA and multiple regression analysis (MRA) and to establish guidelines under which one approach is preferable to the other. Four geographic locations in the US with different climatic conditions were selected and synthetic data sequences representative of daily energy use in large institutional buildings were generated in each location using a linear model with outdoor temperature, outdoor specific humidity and solar radiation as the three regression variables. MRA and PCA approaches were then applied to these data sets and their relative performances were compared. Conditions under which PCA seems to perform better than MRA were identified and preliminary recommendations on the use of either modeling approach formulated.

Original languageEnglish (US)
Pages (from-to)35-44
Number of pages10
JournalEnergy and Buildings
Issue number1
StatePublished - 1994
Externally publishedYes


ASJC Scopus subject areas

  • Civil and Structural Engineering
  • Building and Construction
  • Mechanical Engineering
  • Electrical and Electronic Engineering

Cite this