An Open, Low-Cost IOT Weather and Air-Quality Monitoring Node: Design, Methods and Evaluation

Rafid Mohammed Khaleefah

doi:https://doi.org/10.5281/zenodo.18103624

Volume 14, Issue 08 (August 2025)

An Open, Low-Cost IOT Weather and Air-Quality Monitoring Node: Design, Methods and Evaluation

DOI : https://doi.org/10.5281/zenodo.18103624

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 127
Authors : Rafid Mohammed Khaleefah
Paper ID : IJERTV14IS080085
Volume & Issue : Volume 14, Issue 08 (August 2025)
Published (First Online): 22-08-2025
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

An Open, Low-Cost IoT Weather and Air-Quality Monitoring Node: Design, Methods, and Evaluation

Rafid Mohammed Khaleefah

Information and Communication Technology Centre, University of Basrah. Basrah, Iraq.

Abstract– This paper presents the design, weather and air-quality monitoring node built around BMP280, MQ-135). Telemetry is published over MQTT and dashboards support online visualization and bill-of-materials, firmware workflow, data schemas, method for accuracy, latency, availability, and energy the state-of-the-art by comparing it to recent IoT pipelines [1]-[7]. Field trials collected multi-day traces mean absolute error below typical low-cost sensor humidity (≤3 %RH), stable pressure estimation end-to-end latency of 1.1-1.9 s at 15-60 s sampling circuits, code, and cloud recipes for researchers and meteorological stations[8]-[12].

wireless connectivity and cloud platforms to deliver (QC), and near-real-time visualization. By streaming heterogeneous devices into a coherent data fabric and

ranging from simple persistence models to learned missing in the literature, however, is an end-to-end wiring, firmware state transitions, payload schema recovery. and quantifies the trade-offs among publishable KPIs.

and evaluating a full stack hardware, firmware, visualization – organized around five design principles: and affordability. We contribute a detailed

shareable [1],[3],[4] dashboards. We show that, with

humidity, and pressure measurements from low-cost

Micro-meteorological observations at the envelopes, and that MQTT’s session semantics deliver

national networks cannot support opening/closing small telemetry frames [1]. We position the prototype

scheduling outdoor events, and triaging heat-health [6],[8],[12], highlighting where our approach aligns

stations recommended by the World Meteorological telemetry; schema-first data).

maintain and are often deployed kilometers apart. This urban canyons and coastal zones. In response, on commodity sensors and embedded microcontrollers magnitude lower than reference systems [12].

single-investigator deployments, such as campus labs constrained and where audits by peer reviewers described here can be extended with particulate-matter

III. System Architecture and Methods

Hardware and assembly. The core is an

Device integration. Prior work demonstrates that

measurements. [5] deploys a multi-sensor suite in

portability and remote-area operation via cellular

sensors and validates a LoRaWAN backhaul in

component costs and reproducibility. The review in

Networking and cloud ingestion. MQTT and HTTP(S)

overhead, supports QoS levels and retained “last-will”

preferred for frequent small payloads. HTTP(S) is

cloud platforms like ThingSpeak but incurs greater

Cloud ingestion and QC. The back-end stores raw frames and a curated, QC’d series. QC applies (i) hard range filters; (ii) a Hampel outlier filter (window 9-11 samples, 3-3.5cr); (iii) step-test flags (e.g., fan proximity); (iv) resampling to a uniform cadence with forward fill for short gaps; and (v) derivation of heat index and dew point (Magnus formula). Basic availability and latency metrics are computed per hour and per day from device/server timestamps. Simple persistence and damped-trend baselines provide reference forecasts for future edge-modelling work [2].

Fig. 2. MQTT topic hierarchy used in this study, showing root/domain/device segments and two publishing options (JSON vs. individual streams), with QoS1 and retained status.

On the HTTP path, measurements are mapped to ThingSpeak channels/fields, mindful of rate limits (;::;15 s minimum per channel).

Fig. 4. MATLAB Analysis output used to show humidity as dew point & gauge visualization.

Fig. 3. ThingSpeak channel and field configuration, (site: Baghdad). Showing one of the creations of ThingSpeak channels and mapping of fields.

Both paths can be enabled simultaneously; a monotonic sequence and server timestamps allow reconciliation of duplicates. TLS is enforced for HTTPS; MQTT can run over TLS where a broker supports it; otherwise, we rely on WPA2 network security and the absence of sensitive personal data in payloads.

Calibration and co-location. We recommend a two-point humidity check (e.g., saturated salt solutions) and a single-point temperature offset (ice-water bath proximity) prior to deployment, followed by a 24-h co-location with a known-good station. Linear corrections (gain/offset) are persisted in non-volatile memory and versioned in metadata. Pressure is compared against a METAR reference with altitude correction. These practices reflect guidance and experience summarized across [5],[10],[12].

Evaluation plan. We quantify accuracy (MAE), latency (timestamp delta), packet delivery ratio (PDR), availability (connected fraction), and energy draw. We vary sampling cadence (15-60 s) and assess both indoor and outdoor contexts, annotating disturbances (door/window events, sun exposure) for interpretation. To support replication, we publish the message schema, QC thresholds, and plotting scripts alongside raw CSV exports.

Field definitions, units, and update intervals used in this study.

Field	Unit	Description	Update Interval
field1: Temperature	°C (also logged in °F)	Ambient air temperature; converted to °C for analytics	15-60 s
field2: Humidity	%RH	Relative humidity at sensor location	15-60 s

Data sets and annotations. For each run, we archived device-side logs (boot banners, Wi-Fi RSSI, retries), server-side receive times, and QC’d series. Events such as window opening, fan activation, or sunlight exposure were annotated in a lab log to contextualize steps in the traces. Sampling cadences of 15, 30, and 60 s were tested for 2-6 h sessions, yielding thousands of samples per run.

Reference instrumentation. A calibrated hand-held thermo-hygrometer (±0.3 °C, ±2%RH) was used for spot checks; barometric pressure was cross-checked with airport METAR reports corrected for site elevation. While not a full traceable reference, this arrangement supports the practical MAE calculations reported below and mirrors co-location practice in

[5],[l 0].

A. Statistical Methods and Derived Indices

QC uses a Hampel filter with window w and threshold ka. Given series x_t, the median m_t and median absolute deviation MAD_t are computed within the window; points where lx_t-m_tl>k·l.4826·MAD_t are flagged. Dew point is estimated with the Magnus formula: Td = (b·y)/(a–y), y = ln(RH/l00)+a·T/(b+T), with a=l7.62,b=243.12°C; heat index follows NOAA’s regression for T>27 °C and RH>40%. Latency is computed as server_receive_ts-device_ts; availability is the fraction of time since first sample with non-missing status heartbeats.

Results

Accuracy. Across three indoor and two outdoor sessions, temperature MAE versus the handheld reference ranged 0.4-0.6 °C after a single-point offset; relative humidity MAE was 2.0-3.0 %RH with a two-point adjustment; pressure bias remained within 1.0-1.5 hPa versus the METAR baseline. These values fall within expected datasheet envelopes for low-cost sensors when properly shielded and calibrated [5],[12].

Latency and reliability. Median end-to-end latency (device timestamp to server receive) was 1.1-1.4 s for MQTT (QoS1) and 1.4-1.9 s for HTTPS POST at 30-

60 s cadence on an uncongested Wi-Fi network. Packet Delivery Ratio exceeded 99.2% for MQTT and 98.5% for HTTPS. Reconnection after transient AP loss favored MQTT due to keep-alive and session semantics, HTTPS incurred TLS handshakes more frequently. These findings echo prior comparative studies [1].

Table III. Summary of key quantitative metrics comparing MQTT and HTTPS.

0.6

3.0

1.5

1.4

99.2

99.0

0.6

3.0

1.5

1.9

98.5

99.0

Notes: MAE computed against the handheld/reference observations; median latency measured from device timestamp to cloud persistence; PDR = delivered/published x I 00%; availability reflects end-to-end connectivity over the experiment window.

Availability and behavior. The node sustained >99% connectivity over multi-hour runs. Outdoor diurnal cycles exhibited ~8-10 °C amplitude with humidity inversions (higher RH at night). Indoor perturbations (door opening, fan use) appeared as clear steps and transient micro-gust signatures in the humidity channel. Pressure traces were smooth, modulated by synoptic-scale changes, validating sensor stability.

Alerts were implemented via PushBullet: when a derived metric exceeded a configured threshold, a MATLAB routine posted a notification to a device/channel, which immediately appeared on the paired smartphone. As shown in Fig. 6.

Fig. 5. Plot of the weather channel data in an hourly average format for the last 24 hours in Baghdad (BGD).

Fig. 6. Alert pipeline: QC’d telemetry triggers a MATLAB threshold evaluator, which posts to a PushBullet device/channel and raises an immediate smartphone notification (non-breach events are logged only).

A. Error Budget Analysis

Temperature error arises from sensor quantization, calibration residuals, and radiation bias inside the shield. Humidity error includes hysteresis and temperature-dependent bias; pressure error is dominated by absolute offset and weather-system variability. In our runs, the dominant contributors were radiation bias (outdoors) and RH hysteresis after rapid humidity steps. Practical mitigations include aspirated shields, slower sampling during stabilization, and periodic re-zeroing in a sealed bag with desiccant for the RH sensor.

Discussion

Measurement fidelity. The observed MAEs suggest that with modest screening and co-location, common low-cost sensors yield decision-quality meteorological data for many applications. Remaining biases reflect micro-environmental differences and radiation loading; we recommend an aspirated shield for outdoor deployments and redundant humidity sensing to detect drift [12].

Protocol choice. For frequent, small payloads, MQTT is technically superior to HTTPS due to reduced header overhead and QoS/retained semantics; HTTPS remains attractive when integrating with purely REST-based platforms or when messages are infrequent and human-readable logs are desired [1]. Our dual-path design allows easy A/B measurement and graceful degradation.

Analytics stack. A pragmatic QC pipeline (range checks, Hampel filtering, resampling) catches the majority of spurious readings while preserving sharp changes that carry physical meaning (e.g., step changes when a window opens). Edge prediction is feasible on ESP32-class devices, but we recommend starting with persistence/damped-trend baselines and migrating compute to a nearby SBC (e.g., Raspberry Pi) if model complexity increases [2],[7].

Comparison to prior work. Our measurements align with [3],[4],[6] in demonstrating that low-cost, open designs can support actionable monitoring when paired with diligent QC and calibration. For PM sensing extensions, best practice includes frequent zero checks, humidity compensation, and periodic co-locations [10],[11] _ practices compatible with our pipeline but left for future work.

Platform Comparison

ESP8266 remains cost-efficient but has less RAM/CPU headroom than ESP32, which eases TLS and local analytics. Raspberry Pi-class SBCs are over-provisioned for a single station but ideal as nearby edge aggregators hosting the MQTT broker and dashboards. On the cloud side, ThingSpeak excels for rapid prototyping with MATLAB analytics; custom stacks (MQTT+InfluxDB/TimescaleDB+Grafana) suit multi-node fleets; Ubidots and similar Saas options trade configurability for speed of deployment.

Table IV. Platform Comparison for IoT Weather Pipelines. It shows a qualitative comparison of integration effort, latency, and analytics.

Platform I Ease of ILatency I Analytics

I Integration (qual.)

ThingSpe

ak

High

(REST/MQ

TT; built-in MATLAB)

Low

(small- scale, near real- time)

MATLAB

Analysis

AWS IoT

Core

Medium

(IAM/servic es setup)

Low-

Moderate

Kinesis/Ti

mestream/S ageMaker

Azure

loT Hub

Medium

(enterprise oriented)

Low

Moderate

Azure

Stream Analytics/ ML

Recommendations

Prefer MQTT (QoSl) for routine telemetry; keep HTTPS as a fall-back path.
Enforce schema versioning and include calibration coefficients and valid-from metadata in every message.
Use shielded enclosures; for outdoors, adopt multi-plate or aspirated shields; avoid direct solar load.
Perform two-point RH and single-point °C checks before deployment; revisit co-location quarterly.
Budget for PM sensor aging, humidity correction, and periodic field checks if air-quality is added [10],[11].
Centralize metrics dashboards (latency, PDR, uptime) to catch regressions quickly.

Archive raw frames and QC’d series; publish CSV and analysis notebooks for transparency.

chamber	reference	counter; humidity correction
No	Limited rural	Add
LoRaWAN	deployment range	LoRaWAN
backhaul		gateway;
		compare duty
		cycle
		constraints

Implementation Checklist
- Assemble the node and verify sensor IDs;
- Flash firmware;
- Configure Wi-Fi, API keys, and MQTT topics;
- Perform two-point RH and one-point °C checks;
- Run a 24-h co-location;
- Enable dual-path telemetry;
- Review dashboards for latency/PDR outliers;
- Archive raw/QC’d data and calibration coefficients;
- Prepare a short deployment report with photos and site notes.

Threats to Validity

Internal validity threats include the short co-location periods and reliance on handheld references; although calibrated, they are not traceable standards. External validity is limited by testing under favorable Wi-Fi c?nditions and a single geographic site; results may differ under congested networks or extreme climates. Construct validity could be impacted by unmodeled airflow and radiation bias; the recommended aspirated shield should mitigate these effects in future work.

Table V. Threats to Validity (Summary). Shows Threat categories, evidence, and mitigations

summarized from the Discussion.
Risk Assessment & Mitigation

Operational risks include power loss, Wi-Fi outages, API key leakage, and sensor drift. Mitigations: buffer a few samples locally; implement exponential back-off; rotate keys; store credentials in a separate he der;. alarm on stale data; schedule quarterly cahbrat10n checks; and maintain a spare node for hot-swap during failures.

Practical Lessons Learned

(1) Shielding dominates temperature fidelity outdoors; cheap shields work indoors but underperform in full sun. (2) Dual-path telemetry simplifies A/B testing and helps diagnose backend issues; keep sequence numbers and timestamps consistent. (3) MQTT retain/last-will messages are invaluable for simple fleet health dashboards. (4) RH sensors exhibit hysteresis; slow ramp tests expose this behavior and guide calibration. (5) Document everything: wiring

Internal

validity

Short co-locations;

handheld reference not traceable

Longer co

locations; borrow

traceable instrument

External

validity

Single site

(Baghdad) and favourable Wi-Fi

conditions

Multi-site trials;

congestion stress tests

Construct

validity

Radiation bias and

airflow effects on T/RH

Aspirated shield;

redundant RH probes; step tests

Threat I Description / I I Category Evidence

Mitigation

ph tos, sketch versions, and channel mappings. reviewers and future maintainers will thank you about this step.

Conclusion and Future Work

We engineered and evaluated a reproducible IoT weather/air-quality node that delivers accurate measurements and robust telemetry using commodity components. The contributions include a full BOM firmware state machine and schema, dual-path telemetry, and a pragmatic QC/analytics stack. Results meet practical accuracy envelopes and corroborate findings across the literature [1]-[12]. Future work will extend the node with wind/rain sensors and PM

Table VI. Study Limitations. Shows Limitations of the study with imract and proposed workarounds.

Limitation Impact I Workaround/

No

Cannot relate

Add

wind/rain

micro-climate

anemometer +

sensors in

dynamics to

tipping-bucket

current

wind/rain events

rain gauge

build

I Future fix

No PM PM module not Co-locate with calibration validated against reference

modules, integrate an edge-forecasting block (e.g., NARX/XGB) [7], add LoRaWAN for rural backhaul [6], and conduct multi-week co-locations against reference stations to refine calibration models. Future work could extend our node toward PWV estimation by adopting surface-based algorithms reported for BME280-equipped low-cost AWSs [9].
Detailed Future Work Plan

Phase 1 (Month 1-2): integrate PM sensor (SDS0l 1 or PMS7003), add humidity correction, and validate

against a reference counter for 7-10 days. Phase 2 (Month 3-4): implement edge forecasting on an ESP32-S3 or Raspberry Pi Zero using persistence, ARIMA, and a compact NARX/XGBoost baseline; evaluate RMSE/MAE vs. naive baselines on rolling windows. Phase 3 (Month 5-6): extend backhaul with LoRaWAN for two remote nodes, compare coverage and duty-cycle implications, and perform a week-long outage-tolerance study. Phase 4 (Month 7): automate calibration reminders and dashboard alarms for stale data and abnormal drifts.
Expanded Recommendations

Document site metadata (GPS, elevation, shading, nearby heat sources); keep a per-node CHANGELOG; version firmware and schema jointly; prefer UTC everywhere; use idempotent HTTP endpoints or deduplicate on the server using seq+ts; visualize latency and PDR on the same time axis as temperature/humidity to spot coupling; treat dashboards as part of the scientific artifact, export figure PNG/SVG directly from code to avoid manual plotting errors; store raw data indefinitely and derive curated datasets programmatically to ensure provenance.
Appendix B Algorithms and Pseudocode for QC Pipeline

Function QC(series, w, k): for each channel c m {temp, rh, press}: resample to cadence t; compute rolling median m_t and MAD_t; for each t: if lx_t- m_tl > k·l.4826·MAD_t then flag; compute dew_point(T,RH) and heat_index(T,RH); return curated senes with flags.

Parameter choices used in this study: window w=11 samples, k=3.0; tE{15,30,60}s. Appendix C – Firmware State Machine Pseudocode: state BOOT–tCONNECT (Wi-Fi, NTP)–tSAMPLE

(read sensors, apply cal)–tENCODE (JSON)–tPUBLISH (MQTT QoSl; HTTPS

POST)–tSLEEP( t-elapsed). On failure, exponential back-off with jitter; watchdog resets on stall.
Reproducibility Statement

We provide message schemas, QC thresholds, and plotting scripts. All parameters required to reproduce figures (window sizes, thresholds, sampling cadences) are fixed in the text to enable exact replication by reviewers. Any future revisions will bump the schema version and enumerate changes in a CHANGELOG.
Ethical and Responsible Deployment

Weather and air-quality data can influence public behavior. Publish prominent caveats about low-cost sensor limitations, calibration dates, and intended use. Avoid implying regulatory equivalence with reference-grade stations. When sharing geolocated data, consider jittering coordinates or aggregating temporally/spatially to protect privacy of building occupants.

– References

Fahim, M., El Mhouti, A., Boudaa, T., Jakimi, A., Modeling and implementation of a low-cost IoT-smart weather monitoring station and air quality assessment based on fuzzy inference model and MQTT protocol, Modeling Earth Systems and Environment, 2023. doi:10.1007/s40808-023-01701-w.
i ler, B., Kaya, $. M., K1h9, F. R., Fog-Enabled Machine Leaming Approaches for Weather Prediction in IoT Systems: A Case Study, Sensors, 2025. doi:10.3390/s25134070.
Michailidis, I., et al., An Arduino-Based, Portable Weather Monitoring System, Electronics, 2025. doi:10.3390/electronics14122330.
Albuali, A., et al., Scalable Lightweight IoT-Based Smart Weather Measurement System, Sensors, 2023. doi:10.3390/s23125569.
Costa Branco, P. J., et al., Low-Cost IoT-Based Sensor System: A Case Study on Harsh Environmental Monitoring, Sensors, 2021. doi:10.3390/s21010214.
Johnston, S. J., et al., City Scale Particulate Matter Monitoring Using LoRaWAN Based Air Quality IoT Devices, Sensors, 2019. doi:10.3390/s19010209.
Moursi, A. S., et al., An IoT enabled system for enhanced air quality monitoring and prediction on the edge, Complex & Intelligent Systems, 2021. doi:10.1007/s40747-021-00476-w.
Nagarajaiah, H., et al., Low-Cost IoT Weather Prediction and Monitoring – A Review, Processes, 2024. doi:10.3390/pr12091961.
Suparta, W., Warsita, A., Ircham, A low-cost development of automatic weather station based on Arduino…, IJEECS, 2021.
doi:10.11591/ijeecs.v24.i2.pp744-753
Tagliabue, L. C., et al., Development of IoT Based Particulate Matter Monitoring System for Smart Construction, IJERPH, 2021.
doi:10.3390/ijerphl 82111510
Kalia, P., Ansari, M. A., IOT based air quality and PM concentration monitoring system, Materials Today: Proceedings, 2020.
doi:10.1016/j.matpr.2020.02.179
Ioannou, K., et al., Low-Cost Automatic Weather Stations in the Internet of Things, Information, 2021. doi:10.3390/info12040146.


0.6	3.0	1.5	1.4	99.2	99.0
0.6	3.0	1.5	1.9	98.5	99.0

No	Cannot relate	Add
wind/rain	micro-climate	anemometer +
sensors in	dynamics to	tipping-bucket
current	wind/rain events	rain gauge
build

An Open, Low-Cost IoT Weather and Air-Quality Monitoring Node: Design, Methods, and Evaluation

Fig. 2. MQTT topic hierarchy used in this study, showing root/domain/device segments and two publishing options (JSON vs. individual streams), with QoS1 and retained status.

Fig. 4. MATLAB Analysis output used to show humidity as dew point & gauge visualization.

Fig. 3. ThingSpeak channel and field configuration, (site: Baghdad). Showing one of the creations of ThingSpeak channels and mapping of fields.

A. Statistical Methods and Derived Indices

Results

Fig. 5. Plot of the weather channel data in an hourly average format for the last 24 hours in Baghdad (BGD).

Fig. 6. Alert pipeline: QC’d telemetry triggers a MATLAB threshold evaluator, which posts to a PushBullet device/channel and raises an immediate smartphone notification (non-breach events are logged only).

A. Error Budget Analysis

Discussion

Platform Comparison

Table IV. Platform Comparison for IoT Weather Pipelines. It shows a qualitative comparison of integration effort, latency, and analytics.

Platform I Ease of ILatency I Analytics

I Integration (qual.)

Recommendations

Threats to Validity

Table V. Threats to Validity (Summary). Shows Threat categories, evidence, and mitigations

Risk Assessment & Mitigation

Practical Lessons Learned

Threat I Description / I I Category Evidence

Mitigation

Conclusion and Future Work

Table VI. Study Limitations. Shows Limitations of the study with imract and proposed workarounds.

Limitation Impact I Workaround/

Detailed Future Work Plan

Expanded Recommendations

Appendix B Algorithms and Pseudocode for QC Pipeline

Reproducibility Statement

Ethical and Responsible Deployment