The data integrated in the SEíS semantic information system encompass energy certificates, building descriptions, simulation outcomes, energy monitoring, and climate data. The data sources have been the following:

ICAEN: Energy certification of buildings and their simulated performance during several stages of the building life cycle including design and refurbishment collected by the Catalan Energy Institute (ICAEN), a public administration. Every energy certification contains the energy rating of the building, energy consumptions, types of the HVAC (heating, ventilation, and air conditioning) systems, and geometric characteristics such as the built surface or the compactness (a ratio between surface of a building and its volume). ICAEN has provided access to more than 1800 energy certifications, 200 of them have been included in the dataset because they include simulation outputs. Currently, relevant attributes such as indicators of consumptions and emissions are not included in the ICAEN database. For compensating this lack, approximated values have been calculated from studies of energy consumption in the Spanish building sector1,and from standard values considered by national regulations2 and from the European standard ISO 13790:20083 as well.

LEAKO: Building monitoring data provided by Leako, a company from the Basque Country dedicated to the installation, distribution and HVAC control. The database contains energy consumption data (for example, thermal consumption for air and water heating, and water consumption) and indoor conditions (for example, air temperature) for several buildings.

Geographical Information National Institute (CNIG): Geographical data collected by the Geographical Information National Institute (CNIG) public institute. It contains information about the Spanish territory including population, areas, elevation, or Universal Transverse Mercator (UTM) coordinate. Nevertheless, these data do not include climate zones which are fundamental for the analysis of energy performance of buildings. For this reason, the climate zone has been estimated for each place using the Spanish Building Code4 since it provides the classification of climate zones per each provincial capital.

AEMET: Climate data from the Spanish Meteorological Agency including 10 minute data from around 250 weather stations of the Aemet's National Surface Weather Stations Network. The AEMET data sources has been linked to the RÉPENER dataset since the outdoor environmental properties of the buildings can be enhanced with the climate data.

1 IDAE. (2011). Proyecto SECH‐SPAHOUSEC. Análisis del consumo energético del sector residencial en España. Informe final. Madrid, Spain: IDEA

2 AICIA. (2009). Condiciones de aceptación de Procedimientos alternativos a LIDER y CALENER. Anexos. Madrid, Spain: IDAE. Retrieved March 4, 2013

3 ISO 13790: 2008 (E). Energy performance of buildings ‐ Calculation of energy use for space heating and cooling.

4 Código Técnico de la Edificación. (2006). Madrid, Spain: Ministerio de la Vivienda.