Use of random forest algorithm to evaluate model-based EUI benchmarks from CBECS database

Apoorva Kaskhedikar; T Agami Reddy; George Runger

Use of random forest algorithm to evaluate model-based EUI benchmarks from CBECS database

Apoorva Kaskhedikar, T Agami Reddy, George Runger

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Evaluating the extent to which an individual building consumes energy in excess of its peers is the first step in initiating energy efficiency improvements. Energy benchmarking offers such an assessment, albeit simplified. A widely used energy benchmarking tool is the Environmental Protection Agency's (EPA) Energy Star® Portfolio Manager (PM), which is based on the Commercial Buildings Energy Consumption Survey (CBECS) database. This tool relies on standard linear regression models between building energy use intensities (EUIs) and certain building shell characteristics and equipment types. The statistical significance of these models is rather poor, and they only include continuous building variables. In an effort to improve these models, we have investigated the use of a state-of-the-art, tree-based ensemble learning methodology, the random forest (RF) algorithm, to identify the most influential CBECS parameters that impact building EUI for medium office and school building types. Surprisingly, none of the building features turn out to be really influential. Many of the CBECS building features found to be ranked relatively high are not the ones included in the PM models, and the resulting RF models were only marginally better than the linear regression PM models. These findings cast doubts on the veracity (i.e., the accuracy and completeness) of the data contained in the CBECS database. More careful appraisal is warranted given the widespread use of this database for extracting meaningful generalized correlations between energy use and various building and related characteristics. The poor statistical significance of these correlations has important implications for ongoing building energy conservation policy instruments, such as energy reporting and disclosure legislation being enforced by numerous U.S. cities.

Original language	English (US)
Title of host publication	ASHRAE Transactions
Publisher	American Society of Heating, Refrigerating and Air-Conditioning Engineers (ASHRAE)
Pages	17-28
Number of pages	12
ISBN (Electronic)	9781936504961
State	Published - 2015
Event	2015 ASHRAE Winter Conference - Chicago, United States Duration: Jan 24 2015 → Jan 28 2015

Publication series

Name	ASHRAE Transactions
Volume	121
ISSN (Print)	0001-2505

Other

Other	2015 ASHRAE Winter Conference
Country/Territory	United States
City	Chicago
Period	1/24/15 → 1/28/15

ASJC Scopus subject areas

Building and Construction
Mechanical Engineering

Cite this

Use of random forest algorithm to evaluate model-based EUI benchmarks from CBECS database. / Kaskhedikar, Apoorva; Reddy, T Agami; Runger, George.
ASHRAE Transactions. American Society of Heating, Refrigerating and Air-Conditioning Engineers (ASHRAE), 2015. p. 17-28 (ASHRAE Transactions; Vol. 121).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

@inproceedings{b5260222151748b6ac721e532026d13f,

title = "Use of random forest algorithm to evaluate model-based EUI benchmarks from CBECS database",

abstract = "Evaluating the extent to which an individual building consumes energy in excess of its peers is the first step in initiating energy efficiency improvements. Energy benchmarking offers such an assessment, albeit simplified. A widely used energy benchmarking tool is the Environmental Protection Agency's (EPA) Energy Star{\textregistered} Portfolio Manager (PM), which is based on the Commercial Buildings Energy Consumption Survey (CBECS) database. This tool relies on standard linear regression models between building energy use intensities (EUIs) and certain building shell characteristics and equipment types. The statistical significance of these models is rather poor, and they only include continuous building variables. In an effort to improve these models, we have investigated the use of a state-of-the-art, tree-based ensemble learning methodology, the random forest (RF) algorithm, to identify the most influential CBECS parameters that impact building EUI for medium office and school building types. Surprisingly, none of the building features turn out to be really influential. Many of the CBECS building features found to be ranked relatively high are not the ones included in the PM models, and the resulting RF models were only marginally better than the linear regression PM models. These findings cast doubts on the veracity (i.e., the accuracy and completeness) of the data contained in the CBECS database. More careful appraisal is warranted given the widespread use of this database for extracting meaningful generalized correlations between energy use and various building and related characteristics. The poor statistical significance of these correlations has important implications for ongoing building energy conservation policy instruments, such as energy reporting and disclosure legislation being enforced by numerous U.S. cities.",

author = "Apoorva Kaskhedikar and Reddy, {T Agami} and George Runger",

note = "Publisher Copyright: {\textcopyright} 2015 ASHRAE.; 2015 ASHRAE Winter Conference ; Conference date: 24-01-2015 Through 28-01-2015",

year = "2015",

language = "English (US)",

series = "ASHRAE Transactions",

publisher = "American Society of Heating, Refrigerating and Air-Conditioning Engineers (ASHRAE)",

pages = "17--28",

booktitle = "ASHRAE Transactions",

}

TY - GEN

T1 - Use of random forest algorithm to evaluate model-based EUI benchmarks from CBECS database

AU - Kaskhedikar, Apoorva

AU - Reddy, T Agami

AU - Runger, George

PY - 2015

Y1 - 2015

N2 - Evaluating the extent to which an individual building consumes energy in excess of its peers is the first step in initiating energy efficiency improvements. Energy benchmarking offers such an assessment, albeit simplified. A widely used energy benchmarking tool is the Environmental Protection Agency's (EPA) Energy Star® Portfolio Manager (PM), which is based on the Commercial Buildings Energy Consumption Survey (CBECS) database. This tool relies on standard linear regression models between building energy use intensities (EUIs) and certain building shell characteristics and equipment types. The statistical significance of these models is rather poor, and they only include continuous building variables. In an effort to improve these models, we have investigated the use of a state-of-the-art, tree-based ensemble learning methodology, the random forest (RF) algorithm, to identify the most influential CBECS parameters that impact building EUI for medium office and school building types. Surprisingly, none of the building features turn out to be really influential. Many of the CBECS building features found to be ranked relatively high are not the ones included in the PM models, and the resulting RF models were only marginally better than the linear regression PM models. These findings cast doubts on the veracity (i.e., the accuracy and completeness) of the data contained in the CBECS database. More careful appraisal is warranted given the widespread use of this database for extracting meaningful generalized correlations between energy use and various building and related characteristics. The poor statistical significance of these correlations has important implications for ongoing building energy conservation policy instruments, such as energy reporting and disclosure legislation being enforced by numerous U.S. cities.

AB - Evaluating the extent to which an individual building consumes energy in excess of its peers is the first step in initiating energy efficiency improvements. Energy benchmarking offers such an assessment, albeit simplified. A widely used energy benchmarking tool is the Environmental Protection Agency's (EPA) Energy Star® Portfolio Manager (PM), which is based on the Commercial Buildings Energy Consumption Survey (CBECS) database. This tool relies on standard linear regression models between building energy use intensities (EUIs) and certain building shell characteristics and equipment types. The statistical significance of these models is rather poor, and they only include continuous building variables. In an effort to improve these models, we have investigated the use of a state-of-the-art, tree-based ensemble learning methodology, the random forest (RF) algorithm, to identify the most influential CBECS parameters that impact building EUI for medium office and school building types. Surprisingly, none of the building features turn out to be really influential. Many of the CBECS building features found to be ranked relatively high are not the ones included in the PM models, and the resulting RF models were only marginally better than the linear regression PM models. These findings cast doubts on the veracity (i.e., the accuracy and completeness) of the data contained in the CBECS database. More careful appraisal is warranted given the widespread use of this database for extracting meaningful generalized correlations between energy use and various building and related characteristics. The poor statistical significance of these correlations has important implications for ongoing building energy conservation policy instruments, such as energy reporting and disclosure legislation being enforced by numerous U.S. cities.

UR - http://www.scopus.com/inward/record.url?scp=84938850662&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84938850662&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84938850662

T3 - ASHRAE Transactions

SP - 17

EP - 28

BT - ASHRAE Transactions

PB - American Society of Heating, Refrigerating and Air-Conditioning Engineers (ASHRAE)

T2 - 2015 ASHRAE Winter Conference

Y2 - 24 January 2015 through 28 January 2015

ER -

Use of random forest algorithm to evaluate model-based EUI benchmarks from CBECS database

Abstract

Publication series

Other

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this