1. Introduction
Indoor air pollution is placed among the top five environmental public health risks that cause morbidity and mortality globally. The majority of people spend more than 90% of their time in indoor environments [1,2], and health problems and diseases associated with poor indoor air quality (IAQ) can cause a variety of adverse health effects to them [3,4]. The time spent indoors recently increased significantly in year 2020 due to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic when people are advised to ‘stay home stay safe’ to protect health workers [5,6].
Table 1 summarises common indoor air pollutants, their sources, and current guidelines to maintain IAQ. Air pollutants inside indoor environments can be generated from different sources, including occupants’ exhalation (carbon dioxide; CO2), activities such as cooking and smoking, emissions from building materials, etc. from which various air pollutants, such as carbon monoxide (CO), particulate matter (PM), and volatile organic compounds (VOCs) are released [2,7]. CO2 is not counted as an air pollutant, but its level represents the performance of ventilation systems, especially in wintertime, whereas high CO2 levels represent poor ventilation and possible accumulation of other indoor air pollutants [8,9,10]. Additionally, IAQ could be affected by local outdoor air pollutants, which can ingress into indoor environments (see Table 1).
Rapid developments in low-cost sensors (LCSs) and wireless communication technologies have become much more prominent in in everyday life [9,18,19,20,21,22]. LCSs have the potential to revolutionise insufficient IAQ monitoring systems with the prospect of delivering real-time air pollution data through sensor networks, which complement the established reference measurement methods defined in the EU Air Quality Directives (e.g., 2008/50/EC). Although there is no officially agreed definition of the term “low-cost” [23], “low-cost” has been identified by the United States Environmental Protection Agency (US EPA) as devices costing less than $2500 (USD); this is the limit often defining capital investment limits by LCS users [24]. Due to the importance of this technology in the future development of smart homes, the EU has invested millions of Euros on a number of sensor-based projects such as EuNetAir, IAQSense, and SENSIndoor and the similar investments can be seen elsewhere (e.g., USA, Australia). Smart homes can enable an adaptable living environment, e.g., in managing IAQ, to promote comfort and convenience to the occupants. The outcomes of these projects could contribute in the development of novel sensor systems, real-time air pollution monitors, air quality models, and standardised methods [25,26,27,28]. These projects have encouraged research institutions and businesses to take a greater interest in the advancements of sensing technology in IAQ-related research inside smart homes. In the subsequent text, we have interchangeably used the terms “sensor”, “sensor kit” and “LCS”.
Deploying a network of air pollution LCSs with the support of advanced communication technologies is sufficient to provide accurate information in understanding the spatio-temporal distribution of indoor air pollutants and assessment of personal exposures in smart homes. However, among the limited studies focused on LCS applications in indoor environments, studies mainly focused on general applications, benefits/challenges, future demands/directions of LCSs for specific indoor applications [11,23,29,30]. Moreover, their focus on data analysis was limited to changes in concentrations and no prediction or precautionary actions against the possible events were incorporated [31,32]. Although these studies presented promising results, their scope was not to consider the needs of smart homes. Given the scattered information and research gaps in the existing body of literature, this review aims to fill this knowledge gap by summarising the relevant knowledge in different research disciplines, synthesising the emerging themes and providing unique insights for making homes smart with respect to air quality. In particular, the specific objectives of this review are to (i) provide a comprehensive summary of common indoor air pollutants and pros/cons of LCSs manufactured for indoor applications, (ii) review and summarise the optimal deployment strategies of LCSs within a domestic context, (iii) discuss pre-/post-processing protocols to conduct reliable measurements, carry out data management and data processing, and generate useful information for occupants from the large datasets obtained by networked structures, (iv) evaluate the effectiveness of predictive modelling tools to obtain best-fit approaches with an adequate spatial resolution for estimating exposure to indoor air pollution using LCSs.
2. Scope and Outline
The scope of this review is limited to advanced automation technologies, including sensor selection, deployment strategies, data processing, and development of predictive models, which brings healthier indoor environments via monitoring and control of IAQ through the use of networked LCSs. Therefore, a discussion on how to improve IAQ for healthier environments, as well as considerations of ventilation settings, optimal selection of filtration units or air purifying systems are excluded from the scope of this study.
We searched peer-reviewed research articles focusing on the main keywords in various scientific electronic resources, such as Web of Science, ScienceDirect, Wiley, Springer, PubMed and those known to authors. The terminologies used in the search were either one or a combination of “smart homes”, “home automation”, “low-cost sensors”, “affordable pollution sensing”, “sensor deployment strategies”, “data collection”, “data assimilation”, “data processing”, “machine learning”, “data modelling”, “predictive modelling”, “indoor air pollution” and “indoor air quality”. Documents published in the technical reports or the following websites, WHO, UK Department for Environment, Food and Rural Affairs (DEFRA), European Commission, US EPA organisations, and different sensor manufacturers were also reviewed for their contents. Since the topic is broad and explored by private companies/organisations as well, studies/reports outside the aforementioned databases were reviewed as a part of a grey literature review using similar keywords through the Google search engine. Studies that did not have one of the quoted terms or explicitly focused on outdoor/ambient environments were discarded from the analysis. The resulting documents were manually screened and collected information fitting to the context was selected for discussion. The search was limited to English-based articles/reports.
As summarised in Figure 1, the article starts with a brief review of the common indoor air pollutants for developing the background context for the topic areas covered (Section 3), followed by the state-of-the-art air pollution sensing technologies for indoor environments by considering their performance and drawbacks (Section 4). The subsequent section (Section 5) explores optimal deployment strategies to better capture spatio-temporal distribution of indoor air pollutants and addresses how these strategies could be built to improve the accuracy and reliability of data. Section 6 summarises pre- and post-processing methods and tools in dealing with LCSs data. Section 7 describes suitability of advanced IAQ predictive models for indoor settings. Finally, a summary of topic areas covered, conclusions and future remarks are presented in Section 8.
3. Common Indoor Air Pollutants and Their Sources
IAQ is affected by diverse ranges of indoor sources as well as infiltration of outdoor air pollutants. Each source could impact the overall IAQ, depending on their intensity and the operational time (see Table 1). The most common indoor air pollutants arising from indoor occupants, activities and/or materials are CO2, CO, VOCs, and PM in different aerodynamic size fractions, including PM ≤ 2.5µm (PM2.5) and ≤10µm (PM10). Although there can be other pollutants, such as polycyclic aromatic hydrocarbons (PAHs; specifically, benzo[a]pyrene), nitrogen oxides (NOx = NO+NO2), ozone (O3), sulphur dioxide (SO2), formaldehyde (HCHO), radon and persistent organic pollutants (POPs), the presence of all of these components in one place is unlikely. In addition, under-controlled thermal comfort parameters, such as temperature, air velocity, relative humidity (RH), noise and lighting levels are other parameters that make the living environment pleasant for the occupants. Hence, a flow of clean air throughout a building environment is necessary to minimise the risk of accumulation of indoor air pollutants.
4. Sensor Technology
Assessing the existing IAQ and unexpected changes in its level through continuous measurement is necessary to know the status of IAQ and its effects on the occupants’ health. Sensing IAQ with the help of LCSs could be served as the core of smart homes and counted as one of the major components to maintain high-quality living standards. The desirable sensors in smart homes should: (i) be sensitive and selective to target pollutants for reliable sensing relevant to indoor environments that pose health risks to occupants; (ii) be durable with optimal performance over a long-term of deployment; (iii) be small in size, maintenance-free with low-power consumption; (iv) be adopted in complex sensor networks; and (v) work quietly with minimum operating noise [11,12,33,34,35,36,37]. These features enable air pollution sensors to be deployed with relative ease to locations where understanding air quality level could have a huge impact on human health. However, LCSs come with challenges, which may reduce user trust, accuracy and interpretability of recorded data [12,38]. If their quality remained unchanged under realistic conditions, they could become a game-changer in various IAQ measurements [39].
4.1. Electrochemical Sensors
Electrochemical technology is one of the oldest and perhaps widely used technologies for concentration measurements of gaseous pollutants using either potentiometric (measuring a difference of potentials) or amperometric (measuring current of a redox reaction) principles. Fundamentally, electrochemical sensors (ECs) require at least two electrodes (reference and counter electrodes) for operation, which operate based on a chemical reaction between a gaseous pollutant in the air and an electrode in an electrolyte. The sensors are coated with a catalyst that provides a high surface area, which promotes reactions [34]. The recent ECs contain a cell with three electrodes including, measuring, reference and counter electrodes, which host reduction/oxidation of chosen gases. In this technology, the sample gas diffuses through the sensor’s membranes towards the measuring electrode, which results in an electron transfer (produce an internal current). Recently, some sensor manufacturers (e.g., those of AlphaSense and Membrapor, Wallisellen, Switzerland) have upgraded ECs by adding the fourth electrode to monitor physical changes and measure drift [40].
ECs have a comparatively low-cost, high sensitivity/low cross-sensitivity, low detection limit (~sub-ppm), reasonable response time, and less power-intensive (µW) characteristics compared to traditional monitors [34]. Additionally, stability with acceptable drift values (between 2% and 15% per year) have been reported for the commercial ECs (e.g., Nemoto and SGX Sensortech) [40]. However, they are more complicated, vulnerable to poisoning, large in size, of shorter life span (~1–3 years), and more expensive than that of metal oxide semiconductor (MOx) gas sensors (see Section 4.2). As listed in Table 2, ECs have shown interference with the change in meteorology (e.g., air temperature), which is in the first-order impact on an electric output signal of gas concentration (ppb level) and second-order error on gas sensitivity. Low temperatures decrease the speed of reaction in electrochemical cells, which reduce the applicability to operate under cold environments (<10 °C). However, there is a solution to overcome the effects of temperature on background currents (zero currents) that would make a significant impact on measurements at low concentration levels [41].
4.2. Metal Oxide Semiconductor (MOx) Sensors
In MOx sensors, gaseous air pollutants react with the sensor surface and change it’s electrical (resistance or conductivity) properties [44,45]. Measuring the changes in electrical properties represent the concentration of the target pollutant in the air. Because of advances in fabrication methods and the simplicity of semiconductor sensor devices, MOx gas sensors are moderately low-priced compared to other technologies (cheaper than ECs). MOx sensors are robust, lightweight/long-lasting, sensitive to low-concentration gases (as low as ppb level), and less power intensive (less than 1 W) but higher than PIDs (photoionisation detectors; see Section 4.3) [46,47,48].
Simple and fast production processes on a large scale as well as simply controllable processes make MOx gas sensors a desirable technology for air quality monitoring. MOx gas sensors have been reported to be sensitive to a variety of air pollutants [48], with responses changing with the concentration of gaseous pollutants and device operating temperature [46]. MOx gas sensors have been implemented to measure/monitor trace amounts of gaseous pollutants, such as CO, CO2, O3, total VOCs, Ammonia (NH3) and NOx [46,49]. However, non-linear output signals, cross-sensitivity to other gases (especially to changes in environmental conditions and other VOC substances in complex mixtures), poisoned by certain or high doses of target gases (e.g., high concentration of certain organic compounds and gaseous sulphur-containing substances) have been discussed in the literature [12,48,50,51].
4.3. Photoionisation Detectors (PIDs)
The PID is another type of LCS, which uses high-energy photons (ultraviolet (UV) light) for ionisation of gaseous molecules [40]. The main principle is that the gas between the electrodes is ionized by UV light (in the energy scale of 10 eV) to produce charged ions. The resulting ions are proportional to the output signals as well as pollutant concentrations in the detector. Due to high sensitivity, PIDs are extensively used for the detection of VOCs, because each VOC component has its own ionisation potential (IP). IP range varies from easy to ionise substances (~7 eV) to extremely difficult to ionise substances (~12–16 eV). For example, PIDs effectively detect most hazardous gases, including VOCs (e.g., benzene = 9.25; hexane = 10.13; toluene = 8.82; and xylene = 8.56 eV) due to their low IPs, and offer a range of benefits, such as fast response, small size, ease of use/maintenance, and ability to detect low concentrations. However, PIDs cannot detect air constituents (O2 and N2), CO2, CO, SO2, CH4, and O3 due to their high IPs.
4.4. Optical Sensors
Optical sensors, also called light scattering sensors, are used for detection of PMs. Light-scattering PM sensors measure the optical properties of the particles as an ensemble, which offers fast and real-time responses, minimal drift and greatly reduces the cost and size of the sensors [52,53,54,55]. In addition to small size, low-energy consumption (less power supply voltage ~5 V) and ability to generate high-frequency output data during operations make optical sensors a good candidate in various applications [56,57]. Furthermore, variations in PM2.5 concentration measurement under low-concentrations (20–30 μg m−3) among different optical PM sensors against reference instruments could be a major drawback of sensors of this type. This is because the amount of scattered light is reliant on size, shape, density, and refractive index of particles [58]. Despite all these limitations, reliable functioning of optical PM sensors in indoor environments with small spatial scale was reported [59].
4.5. Sensor Selection
Putting multiple sensors together onto boards, calibrating and reshaping them as commercial products for indoor (or outdoor) applications has been a common practice. Such sensor-based products are becoming increasingly available, while the information around lifetime and maintenance are not clearly available. Table 2 (sensors) and Table 3 (commercial sensor-based products) summarise the specification of technologies in the market, whose performances have been evaluated by at least one indoor study. Moreover, the manufacturer’s specifications obtained from technical datasheets, such as type of pollutant, technology, measuring range, reported sensor lifetime, sampling mechanism, sampling interval, environmental operating range, and connectivity have been summarised in Table 3.
Studies showed that the sensor correlations against the research-grade instruments could vary before and/or after deployment even for identical sensors under identical conditions [60,61,62,63]. Furthermore, environmental conditions (temperature and RH) and cross-sensitivities of certain pollutants (e.g., NO2 gas on O3 sensors, NO gas on NO2 sensors, and hydrogen molecule on CO sensors) on sensor readings have been imperfectly addressed [34,38,64,65,66]. In other words, due to the lack of regulatory bodies, questions are raised about their reported values, reproducibility and comparability. However, significant progress has been made in this direction in the recent past. For example, the Air Quality Sensor Performance Evaluation Centre (AQ-SPEC) operated by South Coast Air Quality Management District (SCAQMD) [67,68], the US EPA, Air Sensor Toolbox [69], and the EU Joint Research Centre (EU JRC) [50,70] programs have been initiated to quantitatively evaluate the performance, stability and quality assurance/control (QA/QC) of sensor-based products. To tackle these issues in a more convenient way by not only considering in-field co-location, field normalisation or field calibration with reference instruments [71,72,73], recent studies have shown an alternative solution that can be utilised to improve the QA/QC of readings. Affordable laboratory facilities, such as the Envilution® chamber are currently offered by academic and research institutions to calibrate and evaluate the performance of LCSs before and after deployment under controlled environments [73]. Here, a controlled environment is defined as a situation where changes in environmental conditions and pollution concentrations, representing indoor environments, for testing LCSs can be simulated (controlled) inside the chamber. Therefore, LCSs performance can be assessed under a combination of indoor variations in environmental parameters and pollution concentrations. In-field co-location would be an alternative QA/QC measure after deployment. Moreover, routine calibration checks after deployment for simple networks along with advanced statistical techniques, e.g., data consistency checks, network correlations, and principal components analysis, in complex networks (Section 7) can boost the performance of this system to maintain long-term satisfactory performance. Such platforms, initiatives and programs offer support to obtain reliable data by the use of appropriate sensors, which could result in improving personal exposure estimates in home environments.
5. Deployment Strategies
Enclosed environments such as homes trap more polluted air than open environments due to the presence of indoor sources, lack of free-flow air circulation and inadequate ventilation. In addition, different exposure levels to indoor air pollutants have been reported for individuals even at the same location [32,74]. Unfortunately, the use of conventional monitoring devices are unable to satisfactorily capture a spatial variation and map instantaneous changes in IAQ because of the associated cost, non-scalability, and lack of spatio-temporal mapping of indoor air pollutants [11,30,75,76]. Considering these potentials and demands for technologies, the emergence of LCSs has changed the landscape of IAQ monitoring systems, where specific sensors and sensor-based products are manufactured and designed for indoor applications (see Table 2 and Table 3, respectively). Although air pollution sensors have some drawbacks (Table 2), relatively smaller changes in the environmental parameters and less complexity of indoor air flow patterns compared to outdoor environments could be beneficial for using them indoors. Table 4 summarises examples of the sensor applications in indoor environments, in which less attention has been given to strategies for spatio-temporal distribution of multiple air pollutants. The objectives of the reviewed studies were limited to the performance evaluation of sensors in enclosed environments, in which near-source air pollution monitoring systems or considering an adult’s breathing height as a common practice among the studies is inadequate to assess overall IAQ [35,77,78,79]. Considering indoor arrangements and the relationship between indoor-outdoor environments [80], here we focus our efforts in developing suitable strategies for indoor environments, while air pollution sensors are playing the major role in covering the area.
Air quality sensors should be deployed systematically across a location in order to (i) optimise the cost and the number of sensors according to building layout, space and room features, (ii) ensure the reliability level of the sensor network in case of sensor failure, (iii) provide acceptable spatial and temporal coverage of indoor air pollutants, and (iv) minimise the cost associated to computational analysis and prediction models [23,90,91,92]. There are common suggestions regarding deployment strategies in practical engineering applications, such as sensor selection as per common indoor sources, considering the impacts of outdoor air pollution on indoors, and deploying sensors along the wall with proper accessibility for calibration or maintenance [35,93]. However, sensor deployment strategies, especially for IAQ applications are usually determined based on objective functions and sensor applications [90,94]. In general, deployment strategies in indoor environments vary with time and space, which can be categorised into (i) engineering, and (ii) optimisation methods. In the engineering method, previous experiences and rules of thumb are incorporated. Uniform deployment of several sensors in space would be a common practice in engineering methods as can be seen in studies listed in Table 4, which may result in a fairly expensive and unfeasible output in some cases. Application of this method may result in lack of (i) spatio-temporal mapping, (ii) controlling the response time, and (iii) generalisability to multiple rooms/spaces [90,94,95]. To compensate for the limitations of this method, the optimisation method has recently developed, in which indoor airflow patterns in the deployment of sensors are taken into account [96,97,98,99,100]. In this method, modelling tools such as computational fluid dynamics (CFD), zonal model, and multi-zone airflow model are utilised along with genetic algorithm, artificial neural networks (ANNs), simulated annealing, and stochastic approximation methods to optimise objective, cost, or fitness functions based on the predefined goals [97,98,101,102]. Although this method could bring precision in choosing the optimal strategy, optimisation methods could be computationally intensive in the large deployment of sensors in multi-zone airflow and CFD-based simulations [95]. Nevertheless, to achieve optimal strategies regardless of sensor locations and to avoid occasional error in the prediction results of small sensor networks (a combination of 3 to 4 sensors as reported by Ren and Cao [79]), systematic sensor deployment methods, such as clustering model of fuzzy C-means (FCM) algorithm based on ANN [103] or based on the genetic algorithm [104] for the efficient prediction of indoor environments could be employed.
Although no standard values for IAQ exist and the idea of setting guideline values [13,105] is not new, we propose a simple deployment strategy for LCS deployment in typical indoor spaces after building the evidence-base from the relevant published literature (Figure 2). This basic strategy could be considered as a generalised plan, where developing an optimisation model is not computationally feasible and could include (i) deployment of environmental and pollutant sensors across the indoor space, whereas deploying height has to be set according to occupants’ height; and (ii) deploying sensors based on specifications discussed in Figure 2, in locations, where taking samples using sensors’ induction fan can represent the entire environment. In the absence of a legislative framework for regulating IAQ, such a strategy could help optimise the sampling that is representative of indoor environments and can be beneficial in planning appropriate mitigation steps for reducing the exposure from indoor air pollutants. However, an optimised network of air pollution LCSs needs to be supported by the appropriate data processing (Section 6) and predictive modelling (Section 7) to allow its interpretation, visualisation and conveying the meaningful messages to the users in a simple form.
6. Data Processing
A network of LCSs requires a substantial amount of pre- and post-processing of data before presenting to the users. Pre-processed data is recorded by LCSs, which utilises an initial calibration (pre-processed data). Post-processed data is transmitted by the sensor to a database, which subsequently undergoes QA/QC protocols before being made available to the users. Pre-processed data should often not be made available to the users until sufficient QA/QC has been performed. QA/QC is essential in LCS monitoring systems and refers to a set of activities and measures that are taken to ensure that the requirements, objectives and established quality standards with a pre-established level of performance and confidence being met. However, their role is not to guarantee that the data is of the highest possible quality, which is often unreachable and unfeasible. What is sought is to ensure that the data are accurate, reliable, fit and adequate for a particular purpose or application.
6.1. Pre-Processing of Low-Cost Sensor (LCS) Data
LCSs are manufactured to measure numerous parameters, including but not limited to (i) date/time; (ii) environmental parameters (e.g., temperature (°C), relative humidity (RH, %), barometric pressure); (iii) gaseous pollutants (concentration by molar ratio or mass); and (iv) particle concentrations, segregated size fractions in different size bins (µg m−3). The amount of data produced by LCSs is often orders of magnitude greater than traditional measurement techniques. For example, at an acquisition rate of 1 Hz, the total number of measurements could be 86,400 per day per single measurement. If one considers monitoring of six indoor parameters at a minute sampling frequency, it will have 8,640 samples in one day per location. Considering a network with multiple locations, it brings challenges to data management and processing [106,107,108].
Handling of large volume datasets requires an infrastructure to process data. To do so, several tools have been developed to address the processing of such large multivariable dataset, with good performance. One of the available tools is the Apache Spark framework (
On top of big data frameworks—a descriptor for very large and multivariate time-series datasets produced by LCS systems—different kinds of tool were designed to tackle specific applications. In the internet of things (IoT) area, multivariate time-series are continuously needed to be pre-processed to guarantee its fitness for their expected usage. Currently, the combination of big data frameworks along with time-series databases, data collectors, data monitors, and data visualisers, has boosted the ability to use data from LCSs to generate useful and reliable IAQ information. For time-series database management and streaming, the open-source InfluxDB platform [115] offers a variety of tools and mechanisms to deal with LCSs time-series datasets [82,116,117]. The capability of open-source InfluxDB has been proved for its time-series functionality, keeping costs as low as possible, making querying archives simple, and connectivity to data collectors like Telegraf and to graphing software like Grafana low-effort [116]. The Telegraf tool [118] offers a plugin architecture that supports the connection between a broad range of data sources to collect and report metrics and events. Grafana [119] has emerged as one of the most used platforms by the industry, offering a rich and extendable web interface to build dashboards on top of data sources and collectors, catch errors and monitor readings, bring compatibility with several languages, tools and frameworks [116].
In summary, pre-processed datasets always involve three problems: the quality of data, high dimensionality, and the growing amount of data. The measurements provided by LCSs are only useful when these issues are overcome. With an increase in interest surrounding big data and its applications, many open-source frameworks have been developed as discussed with the capability to process and store large amounts of time-dependent data. These tools help LCS networks to effectively propagate and batch-process data enabling users to conduct a wide range of experiments concurrently with real-time monitoring of the results.
6.2. Post-Processing of LCS Data
Post-processing techniques, such as outlier detection, data cleaning and gap-filling methodologies could help to determine missing, duplicated, inconsistent datasets, and eliminate high-frequency noises to improve the quality of measured data [120,121]. To meet the demands for higher data quality in LCS systems, Mahajan and Kumar [106] presented a toolbox, known as Sense Your Data: Sensor toolbox. This web-based tool provides easy and efficient functions to analyse air pollution data for both researchers as well as the general public. The tool offers data plotter (including data summary), anomaly/outlier removal and gap-filling. The three different algorithms implemented in this tool for data processing are: (i) autoregressive integrated moving average (ARIMA) additive for tasks related to prediction/forecasting [122]; (ii) K-nearest neighbour (K-NN) for anomaly detection [120,123]; and (iii) the ANN model for air pollution time-series data dealing with forecasting [122] and gap-filling [120]. The two algorithms for gap filling are: (i) Interpolation using the “imputeTS” package [124] to fill the missing values in the dataset; and (ii) Kalman filter to estimate past, present and future values even when the precise nature of the system is unknown [125].
Other anomaly detection techniques that are specialised in time-series data are the SAX algorithm (symbolic aggregate approximation); [126]) and the cluster-based algorithm for anomaly detection in time-series using Mahalanobis distance (C-AMDATS; [127]). SAX addresses the detection of anomalies in time-series datasets using the concept of discords, which transforms a time-series into a sequence of characters (i.e., a string) using clustering techniques [128]. C-AMDATS, in turn, is an unsupervised learning technique that uses clustering methods and the covariance matrix to compute the Mahalanobis distance, to determine how a certain pattern differs from the others, and to calculate the most anomalous using an anomaly rank index. It is a multivariate technique and its performance has been evaluated as the best results compared to the SAX algorithm using urban air pollution data [127]. Recently, a lightweight python library called Luminol for time series data analysis was developed, which implements several anomaly detection algorithms [129]. Luminol owns a series of applications ranging from detecting and correcting network anomalies—the amount of writing, requests, etc.—to health, sensors and IoT applications, which could be valuable and important for post-processing time-series data from networked LCSs.
7. Predictive Modelling
Developing a predictive model that can forecast the changes in IAQ and occupants’ exposure is crucial to obtain concentration profiles of air pollutants in indoor spaces [130]. Predictive modelling is a commonly used technique, which employs analysis of historical/current data and generation of a model to help predicting future outcomes. With the availability of IAQ data collected using the LCSs, sophisticated techniques can be employed to develop a predictive model. In the subsequent sections, we review and consolidate the techniques used for predictive modelling and bring the prevalent best practice and knowledge to develop optimal indoor models, in which previously discussed topics are used as the foundation in the model development.
7.1. Types of Indoor Air Quality (IAQ) Predictive Models
IAQ modelling is a non-invasive and inexpensive method to better estimate spatio-temporal distribution of indoor air pollutants. IAQ is commonly predicted using mechanistic (white box) or statistical (black box) models. Mechanistic models utilise detailed input parameters which apply fate and transport of indoor air pollutants via diffusion, convective mass transfer, and sorption of pollutants. Mechanistic models can be applied on unoccupied microenvironments where detailed indoor/outdoor target air pollutants, building layout and ventilation conditions are available or under-controlled. Mechanistic models have been implemented in several studies to predict indoor PMs [131,132,133] as well as VOCs [134,135,136,137]. Mechanistic models can be categorised as single compartment mass balance-based model and CFD model, as described here:
The single compartment mass balance-based model is a common mechanistic model that has been widely used in studies to explore IAQ with proper validation against real-world data [138,139,140]. Liu and Zhai [94] integrated a probability-based adjoint inverse method into the single compartment mass balance-based model to back-track indoor pollution sources. In the model, interpolation was used to obtain the pollutant concentrations at the locations among sensors, where sensor readings are assumed to be always accurate. However, this is not the right assumption in the case of LCSs due to drift error. For example, the uncompensated drift error and standard deviation of a VOC sensor in many environments were about 0.8 and 0.3 ppm per 4 months, respectively [141]. Therefore, Xiang et al. [142] improved the mass balance-based model by considering LCS specifications and optimally compensating drift errors. The corrected model was composed of an optimal indoor concentration prediction and estimation model, which was supported by a hybrid sensor network synthesis algorithm.
CFD is a well-known mechanistic model that is restrictive in nature due to its exceptional complexity and dependency on many assumptions, approximations, and real observations. Empirical models can be integrated into detailed mathematical models to enhance the accuracy of predictions. CFD supported models by empirical/physics-based models require additional resources and pre-existing knowledge during model development [143,144,145].
In statistical models, model parameters are identified using experimental data and the model structure is inferred by applying statistical methods. While mechanistic and empirical/physics-based models are complicated to develop and there are no established mechanisms, statistical models can help especially in case of dealing with large datasets [146]. This technique can deliver reliable outcomes, but the complete lack of physical insights is a significant drawback. Statistical models have been developed in which they appear to be less resource-demanding compared to other models. In fact, statistical models need the use of consistent input data streams via data loggers or pollution monitors, thereby, the absence of input information flow could endanger the accuracy of the model [144,147,148].
In addition to traditional statistical models, such as kriging or Gaussian process regression, the use of machine-learning techniques gained increased attention in statistical IAQ predictions. The common statistical machine learning-based models are multiple linear regression, partial least squares, generalized linear model, decision trees (classification and regression trees), Bayesian hierarchical model, generalized boosting model, support vector machine, random forests, and ANN [72,146,149,150,151,152,153]. Although discussing the details of these methods is not the primary objective of this study, we showcase the most applicable models that can be of use in building predictive models using LCSs.
Linear regression is a statistical method that captures the linear relationships of independent variables to predict the value of a dependent variable, such as forecasting air pollution [154,155]. Partial least square model and generalised linear model provide a general framework for handling regression models for normal/non-normal data that can be applied in IAQ applications [156,157]. Decision trees are simple but successful techniques that predict the target value via learning simple decision rules [152,158]. Bayesian hierarchical modelling is a statistical model that utilises Bayes’ theorem for estimations. The hierarchical approach facilitates the understanding of multi-parameter problems and developing computational strategies [159,160]. A generalised boosting model is a combination of decision tree-based algorithms and boosting techniques, which frequently fit decision trees to improve the accuracy of the model [152]. Support vector machine regression is the proposed method to deal with non-linear problems [72,161]. Random forest or random decision forests regression model is a simple, flexible and most used machine learning algorithm, which can be utilised in both classification and regression applications [162,163,164,165]. ANN is the most commonly used machine learning technique for solving complex problems [70,72,166,167,168,169]. ANN has shown the capability of estimating IAQ with an acceptable range of 0.62 < R2 < 0.79 only with one hidden layer [167,170,171]. However, there are few emerging applications of deep neural networks (DNN), like recurrent neural networks (RNN), long short-term memory (LSTM), and gated recurrent units (GRU), that need exploration [146,172,173,174].
7.2. Best-Fit Approaches for LCS IAQ Modelling
Table 5 presents a summary of modelling studies for residential settings that various machine learning techniques are used for the prediction of IAQ parameters. The development of machine learning and statistical models in recent years (Section 7.1) has offered significant benefits in the prediction of complex indoor environments [72,146,165]. The development of these predictive models would require large scale data collection provided by the sensor network, adequate computing infrastructure for data processing, analysis and model construction. Although the applications of predictive models are vast, the limited efforts on implementation of ANN, multiple linear regression, and random forest regression models showed acceptable performances in predicting indoor variables. Nevertheless, further efforts should be undertaken to enhance the performance of these tools in predicting all known indoor air pollutants (Figure 2) using continuously generated data by sensor networks rather than focusing only on proxies.
Based on the review of various predictive modelling techniques, it has been found that the statistical models based on machine learning (Table 5) could provide a good fit for indoor air pollution prediction in smart homes. This technique provides a powerful tool for modelling the behaviour of indoor built environments with a complex interplay of the response and predictor variables. The predictive model should also be able to optimally maintain its stability in dealing with inaccurate readings and source generation rate estimates by applying proper weighting factors (a function of sensor drift and source generation rates) to improve the overall prediction accuracy. To do so, mechanistic model techniques are utilised to provide the basis for the selection of appropriate parameters for machine learning models on theoretical physics-based principles. However, uncertainty and potential disadvantages of mechanistic models as highlighted in the previous section could endanger the feasibility of the model in multiple buildings or the case of an occupied building.
8. Conclusions and Future Remarks
People spend a significant amount of their time in indoor environments, where they are most probably exposed to at least one IAQ problem. IAQ remains mostly unregulated and maintaining safe IAQ during the long-term stay at homes to tackle the novel coronavirus pandemic, or similar outbreaks is more challenging [184]. Smart homes equipped with air quality LCSs and integrated processing/predicting tools can offer a healthy environment to occupants. Although technologies in this field are continuously evolving, emerging knowledge among the researchers in different fields is sparse, and smart home components are considered separately due to diversity in the research field. Here, we reviewed the standard protocols needed to be met to satisfy the indoor measurement challenges. Then we reviewed data assimilation and data processing tools and predictive modelling techniques to estimate indoor exposure. From the study, the following conclusions are drawn:
Indoor pollutants are released from different sources at different concentration levels, thereby, selection of LCSs should be in the way that they can serve the task according to the target pollutants and concentrations. The accuracy and diversity of LCSs used in indoor environments is an important focus in deployment strategies of LCSs in smart homes. Proper deployment height is also suggested due to variation in exposure heights among the occupants.
Deployment of networked LCSs to map spatio-temporal distribution of indoor air pollutants is necessary to optimise the number of deployed LCSs, obtain meaningful data, reducing the computational time/cost, and data handling without losing accuracy. There are limited studies on long-term deployments of sensor networks, especially in indoor residential environments.
The lack of data reliability and QA/QC is counted as the most important challenge associated with LCSs. We emphasised an important role of laboratory calibration of LCS. Relying only on initial LCS calibration, which is a prevalent practice in reviewed studies, for long-term deployment should be complemented by routine performance testing to the success of networked sensors. Such performance evaluations can allow maintaining data quality, oversee manufacturing variability, sensor stability, drift and ageing over time.
Several open-source tools have been developed for data processing to give network providers the tools to deploy large-scale networks with little overhead. As LCSs record large amounts of time-series data, open-source tools such as InfluxDB and Grafana are necessary to be able to capture and process recorded measurements as well as allow easy visualisations for both the network operator and the occupants. Considering home-specific internal data servers can offer additional security from the external threats.
A wide range of data processing tools are available with many capabilities, including data cleaning, data plotting and different types of anomaly detection. These tools can increase the confidence and reliability of the data, improving the services provided by the network providers and improving the experience for the occupants.
There is an increasing trend towards the application of machine learning-based statistical models due to the availability of a continuous flow of IAQ data using LCSs. However, there are several limitations of exclusive data-based studies due to the lack of established knowledge related to the selection of desirable parameters, appropriate performance metrics, and the application of different models for different scenarios. Therefore, the best way forward would be to further advance the knowledge of statistical models for IAQ prediction by carrying out larger-scale deployments and considering a wider range of indoor pollutants that are backed by the theoretical principles from mechanistic models for modelling the underlying micro-environmental principles and mechanisms.
Making homes smarter is becoming an integral component of the smart city concept. According to the Allied Business Intelligence (ABI) Research report on smart homes [185], almost 300 million smart homes are set to be installed around the world by 2022. Having smart homes in terms of IAQ is not a distant dream. This review reveals the benefits of using technological advancement in estimating the effects of long-term exposure to indoor air pollutants and determining new prevention strategies and control measures on health conditions in smart homes. It contributes to future generations of smart buildings as well as designing of smart cities and embracing smart technologies for IAQ monitoring by the general public and adopted in their routine lifestyle. Some of the ongoing projects such as the MyGlobalHome [186] aim to develop such advanced property development platform by connecting developers to consumers of sustainable and connected homes and seek to bridge a gap between the smart technology developers and property developers. The efforts by the aforementioned projects along with the support of ongoing research activities concerning air quality sensors could result in appreciable health benefits to smart home occupants.
Author Contributions
Conceptualization, P.K.; methodology, H.O. and P.K.; writing—original draft preparation, H.O. and P.K.; writing—review and editing, H.O., P.K., J.H., M.G. and E.G.S.N.; supervision, P.K.; project administration, P.K.; funding acquisition, P.K.; resources, P.K. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by an Innovate UK funded project ‘MyGlobalHome’ prototype project and the pilot demonstrator under the Technology Strategy Board File References: 104782 and 106168, respectively.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Acknowledgments
We thank the reviewers for their time to help with the review of the article.
Conflicts of Interest
The authors declare no conflict of interest.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures and Tables
Figure 1. Essential steps toward a successful implementation of smart indoor sensor network in achieving appreciable indoor air quality (IAQ) and health benefits to home occupants.
Figure 2. Schematic diagram of a simple home deployment strategy for LCSs as per location, including proposed environmental and air pollution sensors (green boxes) and their associated ranges (a blue box) in a typical indoor space. The representative image of a home building was obtained from free sources using Google image search engine.
The unhealthy exposure thresholds defined for the common indoor and outdoor air pollutants [11,12,13].
Pollutants | Indoor Air | Outdoor Air | References |
---|---|---|---|
Benzene (C6H6) |
Carcinogenic compounds, no safe level of exposure |
5 (annual) European Union (EU) |
[13,14] |
CO |
100 (15 min–once per day) |
10 (max daily 8 h mean) EU |
[13,14] |
CO2 |
<1000 (hygienically harmless) |
405 (by |
[15] |
HCHO [µg m−3] | 100 (30 min) WHO | N/A | [13] |
Naphthalene [µg m−3] | 10 (annual) WHO | N/A | [13] |
NO2 |
200 (1 h) WHO |
200 (1 h) EU/WHO |
[13,14] |
O3 |
N/A | 120 (max daily 8 h mean) EU |
[14,16] |
PAH (benzo[a]pyrene) |
All indoor exposures relevant to health, |
1 (annual) EU |
[13,14] |
PM2.5 |
10 (annual) WHO |
10 (annual) WHO |
[13,14,16] |
PM10 |
20 (annual) WHO |
20 (annual) WHO |
[13,14,16] |
Tetrachloroethylene |
250 (annual) | N/A | [13] |
Trichloroethylene |
Carcinogenicity with risk of 4.3 × 10−7 at 1 µg m−3 | N/A | [13] |
TVOCs a |
<0.3 (no hygienic objections) |
N/A | [17] |
Note: N/A refers to not available; AIR refers to German Committee on Indoor Guide Values, formerly known as “Ad-hoc AG”. a Total VOCs, defined by the International Organisation for Standardisation (ISO) 16000-6.
Table 2The current available air quality LCSs characteristics, advantages and disadvantages [39,40,42,43]. Examples of different deployed environmental and air pollution LCSs suggested for indoor environments are presented at the end of table.
Sensor |
Known for | Summary of Pros and Cons |
---|---|---|
Electrochemical |
NO2, SO2, O3, NO, CO, |
√ Good sensitivity, from mg m−3 (potentiometric) to µg m−3 (amperometric). |
MOx |
CO, CO2, H2, O3, NH3, |
√ Good sensitivity, from mg m−3 to µg m−3 (ppb level) and relatively long lifetime (>5 years). |
PID |
VOCs 1 | √ Small in size with moderate price (approximately 400€ for a sensor to ~5000€ for a handheld device). |
Optical particle counter |
PMs | √ Fast response time (in a second). |
Optical | CO and CO2 | √ Good sensitivity for CO2 (350–2000 ppm). |
1 Photoionisation detectors (PIDs) demonstrate a better sensitivity than electrochemical cells for volatile organic compounds (VOCs) (range from 100 ppb and 20 ppm).
|
Specification of sensor-based product specifications (both single- and multiple-purpose units) reported by manufacturers available in the market that could be used for IAQ and/or personal indoor exposure monitoring systems. The authors highly suggest the buyers to check the up-to-date specifications of the sensors prior to selection and do not endorse any brand or a product.
Single-Purpose Units Designed for IAQ | |||
Sensor name | Pollutant | Technology | Specific Practical Features |
Aeroqual S500 (OZU) | Can be used with a wide range of gas sensor heads (e.g., CO, CO2, O3, VOCs, PM2.5 and PM10). | A sensitive MOx that relies |
Battery: Yes (12Vdc 2700 mA.h) |
AirAssure by TSI | Real-time measurements of PM2.5 mass concentrations. | Enable a light-scattering photometer that detects and measures PM2.5 between 5 and 300 µg m−3. | Power supply: Yes (24 V, 5 W max) |
AirBeam2 |
Measures PM1, PM2.5 and PM10, temp. and RH. | Use a light-scattering method to measure PMs. Particle sensor (Plantower PMS7003); RH sensor (Honeywell HIH-5030-001); Temp. sensor (Microchip MCP9700T-E/TT) | Battery: Yes (up to 10 h battery life) |
Alphasense OPC- Particulate Monitor | Measures PM1, PM2.5 and PM10. |
Use laser beams to detect |
Power supply: No battery, 175 mA |
AS-LUNG |
Real-time measurements of PM1, PM2.5 and PM10 in µg m−3 as well as CO2 concentrations. | Use Plantower PMS3003 laser particle counter sensors, which come factory calibrated. | Battery: No (Yes for the station) |
Cair | Detect dust particle concentration of a given size range in pcs ft−3. |
Count particle via laser beams | Power supply: No battery, yes (USB 5 V) |
Cairsense micro-sensors | Offers a separate range of air quality gas sensors, including T, RH, NO2, NH3, CO, O3+NO2, NH3, H2S+CH4S, HCHO, SO2, PMs, and non-methane VOCs. | See Technical Data of each sensor kit for detailed specifications. | Battery: Yes |
Dylos - DC1700-PM | Measures both PM2.5 and PM10 number (>0.5 µm and >2.5 µm) and mass concentrations interchangeably. | Use a true laser particle counter, where laser beams detect particles going past by their reflectivity. | Battery: Yes (up to 6 h of continuous use) |
Eco Witt WH43 | Designed to provide real-time measurements of PM2.5 mass concentrations. | Use Honeywell HPM Series Particulate Sensor to detect/count particles using light-scattering between 0–999 µg m−3. | Battery: Yes |
Laser Egg | A handheld device that provides real-time measurements of PM2.5 and PM10. | Use light-scattering to measure particles between 0.3 and 10 micron within 10–100 ms in aerodynamic diameter. | Battery: Yes |
Micro Aeth | Model AE51 aethalometer, BC aerosol monitor that measures 0–1 mg BC m−3 | Measures the rate of change in absorption of transmitted light due to a continuous collection of aerosol deposit on T60 (Teflon coated glass filter). | Battery: Yes |
MicroPEM by RTI | A portable sensing device that measures PM2.5 and PM10. | It combines real-time nephelometry and integrated referee filter PM measurements. The device carries an impactor and a light-scattering particle detector. | Battery: Yes (up to 40 h of continuous operation) |
Naneos - Partector | A portable, battery-powered instrument that measures the lung deposited surface area (LDSA) of nanoparticles. | Measures nanoparticle surface area based on a non-contact electrical detection principle. | Battery: Internal rechargeable Li:Ion battery (15 h) |
POM * by 2B Technologies | Personal Ozone Monitor (POM) 4 ppb–10 ppm, Resolution 0.1 ppb | Absorption of ultraviolet light |
Battery lifetime: 5–8 h |
PurpleAir |
An OPC, which measures PM1, PM2.5 and PM10 mass concentrations from the counts. | Use Plantower PMS5003 laser particle counter (maximum range ≥1000 μg m−3), where laser beams detect particles going past by their reflectivity. | Power supply: 5 V DC, 3 A |
PurpleAir PA-I-Indoor | Use PMS1003 laser particle counters (maximum range ≥1000 μg m−3) where laser beams detect particles going past by their reflectivity. | Power supply: 5 V DC, 3 A |
|
Samyoung S&C |
Designed to provide real-time measurements of PM2.5 mass concentrations. | The PM2.5 sensor provides mass concentration over 0.3 µm sized particles through Samyoung S&C’s proprietary optical structure with Infrared emitting diode. | Power supply: Yes |
Sensirion - SPS30 Eval Kit | Real-time PMs mass/number concentrations. |
Based on laser scattering (1 to 1000 μg m−3) by using advanced particle size binning | Power supply: Yes |
Speck by Airviz | Detects fine PM (between 0.5 and 3.0 micron) in indoor environments. | Equipped with an optical sensor (DSM501A) that counts the number of particles per litre of air (ppl). It can estimate the particle mass per cubic meter of air |
Power supply: Micro USB, 5 V 500 mA |
Multipurpose units designed for IAQ | |||
Sensor name | Pollutants | Technology | Specific Practical Features |
+IQAir - AirVisual Pro | A handheld device that provides real-time measurements of PM (0.3–2.5 μm) and CO2 (400–10,000 ppm). | Use a light-scattering method to measure PMs. | Battery: Rechargeable Li:Ion (up to 4 h on a single charge) |
Air Fruit | A handheld device that provides real-time measurements of PM2.5, CO2, temp. and RH. | Use light-scattering to measure PM2.5. |
Power supply: 5 V USB cable |
Air Quality Egg V2 2020 | Used for measurements of CO2, SO2, CO, O3, PM (PM1, PM2.5 and PM10) and NO2 (not all together). Each set of sensors also monitors temp., pressure and RH. | Dual Plantower PMS5003 sensor ranges between 0.3 and 10 μm. | Power supply: No battery, 5 V USB or Micro-USB |
Airthings Wave Plus | A smart air quality monitor capable of measuring temp., RH, TVOCs, air pressure, radon and CO2. | Sensor specifications (except for CO2 which is NDIR) are not included in the product sheet. |
Battery: Yes, 2 AA 1.5 V |
AirThinx IAQ | Real-time measurements of PM1, PM2.5 and PM10 in µg m−3. It also provides temp., RH, pressure, CO, CH2O and TVOC measurements. Holds Conformitè Europëenne (CE), Federal Communications Commission (FCC), PCS Type Certification Review Board (PTCRB) certificates. | Equipped with a factory calibrated Plantower PMS5003 laser particle counter. |
Power supply: Yes (5 V DC) |
Awair | Real-time measurements of temp., RH, CO2, PM2.5 and chemicals (VOC). It needs Wi-Fi for setup. | Senso specs: |
Battery: No |
Blueair Aware | The Blueair Aware is a standalone air quality monitor to measure T (0 to 50 °C), RH (25% to 75%), CO2 (450 to 5000 ppb), PM2.5 (1 to 500 μg m−3), and TVOC (125 to 1000 ppb). | N/R | Power supply: Yes (Non-detachable USB cable) |
Edimax | The Edimax Edigreen Home sensor measures PM2.5 and PM10 in µg m−3, CO2, HCHO, TVOC, temp. and RH. | Use a Plantower PMS5003 laser particle counter, which comes factory calibrated. |
Power supply: Yes (USB power adapter) |
Huma-i (HI-300A) | Advanced Portable Air Quality Monitor Indoor and Outdoor Measures temp., RH, CO2, VOC, |
CO2 (400~5000 ppm) |
Battery: Yes, built-in Li-polymer @ 650 mAh/3.7 V |
IDEAL AS10 | The IDEAL AS10 indoor air sensor measures the air composition, indoor climate and possible environmental impacts, all in real time. | It measures PM2.5 and PM10 (0–1000 μg m−3), VOCs (0–32,768 ppb), temp (–10 to +50 °C), RH (20–90%) and air pressure (20–110 hPa). | Battery: No |
Laser Egg +chemical or +CO2 | A handheld device that provides real-time measurements of temp., RH, PM2.5 and VOCs or CO2. | Use light-scattering to measure particles between 0.3 and 2.5 micron. | Battery: Yes |
Magnasci SRL - uRADMonitor A3 (HW105) | Measures 8 air quality parameters including PM2.5, CO2, VOC, HCHO, temp., RH, barometric pressure and Gamma/X-ray radiation. |
Use laser scattering sensor to detect PMs; a NDIR sensor to measure CO2; an EC for HCHO, a Bosch BME 680 sensor for temp., RH, barometric pressure and VOC; and an S129BG Geiger Tube to detect gamma and X-ray radiation. | Battery: No |
SainSmart - Pure Morning P3 | Measures PM2.5, HCHO, CO2, air temp. (in °C) and RH (%). | Equipped with a Plantower PMS5003 laser particle counter. | Battery: No |
Temtop M2000 | Measures real-time reading of HCHO, PM2.5, PM10, |
PM2.5 range: 0~999 μg m−3 |
Battery: Yes |
uHoo | Carries eight dedicated sensors for VOCs (10–10,000 ppb), PM2.5 (0–200 µg m−3), CO (0–1000 ppm), CO2 (400–10,000 ppm), O3 (10–10,000 ppb), temp. (–40 to 85 °C), RH (0 to 100%) and air pressure (300–1100 mbar). | Battery: No |
* Designated by US EPA as a Federal Equivalent Method (FEM: EQOA-0815-227). Note 1: Sensors’ lifetime of the reviewed kits is not reported in the technical documents. Note 2: N/S and N/R stand for not specified and not reported, respectively.
Table 4Some recent LCS deployments indoors across the world.
Pollutants/Range of Operation | Sensor Type | Room Size | Reference Instrument | Sensor Placement/ |
Sampling |
Correlation Factor R2/Agreement | Duration | References |
---|---|---|---|---|---|---|---|---|
PM1, PM2.5, PM10 | PM-Model-II particle counters (Plantower PMS3003) | 88 m2 | Particle monitor (Thermo Scientific Model FH 62 C14) | An apartment in Beijing | 1 min resolution | 79% of the spatiotemporal variation based on a regression model | 10 days | [81] |
PM1, PM2.5, PM10 | Dylos DC1100 Pro and Plantower PMS sensor (AirU) | Two homes, 306.6 m2 and 140 m2 | GRIMM, DustTrak, and MiniVol | Two households in Salt Lake City, USA | 1 min | See various R2 in the manuscript | One-week calibration, several weeks for sampling | [82] |
PM2.5 and PM10 from 21 common residential sources | Air Quality Egg 2018; IQAir AirVisual Pro (AVP); Awair 2nd Edition; Kaiterra Laser Egg; PurpleAir Indoor; and Ikair | N/R | Grimm Mini Wide-Range Aerosol Spectrometer Model 1371 | N/R | 5 min | R2 ≥ 0.83 | 48 h | [83] |
CO (0-29), CO2 (0-3600), PM10/2.5 (0-1) and VOC (0-46) * | Aeroqual Series 500 with different sensor heads. gas-sensitive electrochemical (GSE), NDIR, laser particle counter, and PID types | Floor area of merely 9.3 m2 | Not applicable | Subdivided unit (SDU) in Hong Kong | 60, 120, 5, and 30 sec | No significant correlation between indoor and outdoor pollutants in case of CO (3.58%), PM10 (0.96%), and PM2.5 (7.11%). | 48 h in each SDU in the summer of 2018 | [84] |
Noise (35–120 dB), T (0–50 °C), RH (0–100%), CO (0–1000 ppm), CO2 (0–5000 ppm), NO (0–20 ppm), NO2 (0–20 ppm) and PM2.5 (0.38–17 μm) | Netatmo Weather Station, Onset Temperature, Alphasense (COB4, NOB4, NO2B43F, and OPC-N2), and Harvard miniPEM | In a residential building | RTI MicroPEM for PM only (5 min avg.) | Boston, MA, |
1 min | Carried out only for PM2.5 |
Multiple 1-week sessions | [71] |
CO, NO2, NO, O3, PM2.5 (PAM, Model AS520) | 4-electrode ECs |
A living |
BLUME instrumentation uses chemiluminescence to measure NO2 and NO, UV absorption for O3, non-dispersive infrared absorption for CO, and particle light-scattering for PM2.5 (model EDM180, Grimm Aerosol Technik, Ainring, Germany). | The indoor instrument was placed into the home’s living room, which was either adjacent to the back garden or separated from it by a room in between. | Maximum: 1 Hz. |
For the inorganic gases (0.92 < R2 < 0.96) |
Simultaneous indoor pollutant measurements |
[85] |
Ammonia (1–500 ppm), CH4 (>1000 ppm), C3H8 (>1000 ppm), C4H10 (>1000 ppm), CO (1–1000 ppm), Ethanol (10–500 ppm), H2 (1–1000 ppm), NO2 (0.05–10 ppm) | MOx sensors (MICS series) | Real-time IAQ monitoring in a home using iAir | N/R | Guarda, |
30 sec | N/R | N/R | [86] |
CO and PM2.5 | HAPEX and TZOA-R for PM2.5 |
1 m from an indoor fireplace and 0.6 m above the ground | DustTrak DRX (Model 8534) |
Non-smoking private single-family house/Spain | 5 min | R2 up to 85 | 5 days | [87] |
CO and PM2.5 | HAPEX (PM2.5) and |
Main living area at least at 1 m above the ground | SKC pump (Model Universal PCXR8) | 4 households located in 4 villages/India |
5 min | N/R | 1 week | [87] |
PM2.5 (25 µg m−3), tVOC (300 ppb), CO2 (1300 ppm), T (40 °C), and RH (60%) | Foobot kit | An occupied bedroom (floor area |
GrayWolf TG-502 TVOC, IQ-410, and PC-3016A | Glasgow, UK | 5 min | A significant agreement with the GrayWolf |
81 h 25 min (from 28 August 23:50 LT to 1 September 2017 11:25 LT) | [88] |
PM, CO, O3, NO2, noise, temp. |
A dust sensor (Sharp, Model DN7C3CA006, Osaka, Japan) |
DataRAM 1500 Aerosol Monitor (Thermo Fisher Scientific., pDR, Shoreview, MN, USA) for PM |
Within the fabrication area of a manufacturing facility | 5 min | 0.98 to 0.99 for particle mass densities |
Three months | [89] | |
PM | Wireless PM sensor, Sharp GP2Y1010AU0F | Approximately 29 m2 of floor area | TSI Sidepak AM 150 (TSI Inc., Minnesota, USA) | 2 kitchens in Raipur, India | Sidepak 1 Hz and sensors 0.25 Hz | 0.71 | Multiple days at the two households | [35] |
Light (0.1 to 40,000 Lux), T (−55–80 °C), RH (0–100%), CO2 (0–10,000 ppm) | TAOS TSL2561, Onset HOBO (NTC thermistor, Sensirion SHT15), SenseAir K-30 | IAQ in an educational building | N/R | Two locations at Illinois Institute of Technology in Chicago, IL | 1 min | N/R | 1 week | [33] |
1 Automatically control building climate control systems (air purifiers, kitchen hoods, bathroom or whole house fans, operable windows, or dampers in the mechanical room), when measured pollutant levels are higher than acceptable levels. * Concentration values are in mg m−3. Note: N/R represents not reported.
Table 5A summary of recent IAQ modelling studies that use various machine learning techniques for the prediction of IAQ parameters. Studies carried out in offices, schools and commercial buildings are excluded as their layouts could be different from residential buildings.
Input to the Model | Model Output | Environment, Location, Date, Time | Model | Correlation | Recommendations | Reference |
---|---|---|---|---|---|---|
Time, in/out temp. and RH | Temp. and RH | A test house (147.8 m2 × 3 m) in Finland | Artificial neural networks or ANN (non-linear autoRegressive with eXternal input (NNARX) model and genetic algorithm were employed to construct networks) | Correlation coefficients 0.998 and 0.997 for temp. and RH | Three-layer feed-forward ANN is capable of predicting any nonlinear relations even in a complex situation where (i) some impact factors are still unclear and (ii) some important information is unavailable. | [175] |
Resident activities | CO2 | Two smart apartments and a smart workplace, Washington State University, Pullman, WA, USA | Naïve Bayes, ANN and Decision tree | N/S | The decision tree algorithm did perform best in many examined cases. | [176] |
Indoor temp. |
CO2 | 8 apartments (4 bedrooms and 6 living rooms) located in Kuopio, Finland, from May to October 2011. Measurements were taken every 10 s. | ANN (based on multilayer perceptron network) | R2 ≤ 0.39 ± 0.02 | The prediction of CO2 is difficult, if it is based only on measurements of RH and temp. | [168] |
Temp., internal PM2.5 sources, window opening | PM2.5 | Dwelling (single-storey flats in England during October to May) | ANN (feed-forward) | R2 between 0.84 to 0.90 | ANN is able to accurately predict IAQ from a reduced set of input variables. | [177] |
Temp. and RH | CO2 | Two rooms named as R203 and R204 of Smart Home | Decision tree regression/random forest | Accuracy of 46.25 ppm | It is possible to use the Random Forest method with sufficient accuracy in CO2 estimation on the basis of the internal and external temp./RH, the time and date as the input parameters. | [178] |
Particle deposition parameters | PM2.5 | N/R | ANN (multilayer |
N/R | ANN gives an average relative error of <5% | [179] |
PM2.5, PM10, |
Indoor airborne culturable bacteria | Data were measured in various buildings in Baoding, a city that suffers from PM2.5 pollution in China. | General |
N/R | A machine learning-based method can estimate the concentration of indoor airborne culturable bacteria. |
[180] |
Ambient PM2.5 with 10 and 80 min of lag time | PM2.5 | Indoor and ambient PM2.5 in 13 households in Beijing, China. | Exponential regression | R2~0.87 | The PM2.5 concentrations can be predicted based on ambient measurements. |
[181] |
Temp. |
CO2 | In a room for almost a week (starting from 11 February 2015, to 18 February 2015) in Mons, Belgium | ANN (multilayer |
<17 ppm |
Open-loop and five-steps-ahead prediction networks had better MSE performances, higher sensitivity and specificity values vs. open- and closed-loop models. |
[166] |
One dependent variable and 87 potential predictor variables | PM2.5 | 7-day PM2.5 measurements inside the homes of pregnant women from January 2014 to December 2015 in Ulaanbaatar, Mongolia | Multiple linear regression |
Multiple linear regression (R2 = 50.2%) and random forest regression (R2 = 48.9%) | The improved performance of blended multiple linear regression/random forest regression models in predicting indoor air pollution. | [164] |
PM10, PM2.5, CO2, temp., and RH | PM1 | Real-time daily IAQ monitoring in 10 households during March 2014 to July 2014, India. | Multiple linear regression | R2 = 0.81–0.98 | Multiple linear regression models were found to perform satisfactorily as indicated by 0.92 < index of agreement < 0.99 and 0.81 < R2 < 0.98. | [182] |
Ambient PM2.5 and questionnaire-elicited information | PM2.5 | Daily average of PM2.5 during 3 consecutive days in summer and winter for 116 households in Hong Kong | Linear mixed regression | R2 = 0.61 by cross-validation | The fitted linear mixed-effects model is moderately predictive for the observed indoor PM2.5. | [183] |
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
The evolution of low-cost sensors (LCSs) has made the spatio-temporal mapping of indoor air quality (IAQ) possible in real-time but the availability of a diverse set of LCSs make their selection challenging. Converting individual sensors into a sensing network requires the knowledge of diverse research disciplines, which we aim to bring together by making IAQ an advanced feature of smart homes. The aim of this review is to discuss the advanced home automation technologies for the monitoring and control of IAQ through networked air pollution LCSs. The key steps that can allow transforming conventional homes into smart homes are sensor selection, deployment strategies, data processing, and development of predictive models. A detailed synthesis of air pollution LCSs allowed us to summarise their advantages and drawbacks for spatio-temporal mapping of IAQ. We concluded that the performance evaluation of LCSs under controlled laboratory conditions prior to deployment is recommended for quality assurance/control (QA/QC), however, routine calibration or implementing statistical techniques during operational times, especially during long-term monitoring, is required for a network of sensors. The deployment height of sensors could vary purposefully as per location and exposure height of the occupants inside home environments for a spatio-temporal mapping. Appropriate data processing tools are needed to handle a huge amount of multivariate data to automate pre-/post-processing tasks, leading to more scalable, reliable and adaptable solutions. The review also showed the potential of using machine learning technique for predicting spatio-temporal IAQ in LCS networked-systems.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details



1 Global Centre for Clean Air Research (GCARE), Department of Civil and Environmental Engineering, Faculty of Engineering and Physical Sciences, University of Surrey, Guildford GU2 7XH, UK;
2 Global Centre for Clean Air Research (GCARE), Department of Civil and Environmental Engineering, Faculty of Engineering and Physical Sciences, University of Surrey, Guildford GU2 7XH, UK;
3 Computer Science and Information Systems, BITS Pilani-Hyderabad Campus, Pilani 500078, India;
4 Post-Graduate Program on Computational Modelling and Industrial Technology, SENAI CIMATEC University Centre, 41650-010 Salvador, BA, Brazil;