https://doi.org/10.25678/000CBS
You're currently viewing an old version of this dataset. To see the current version, click here.

Data for: Online monitoring of greywater reuse system using excitation-emission matrix (EEM) and K-PARAFACs

Code and data associated with the manuscript: Online monitoring of greywater reuse system using excitation-emission matrix (EEM) and K-PARAFACs.
Abstract:
A currently increasing interest in water reuse is met with the concern about water quality. Excitation-emission matrix (EEM) measurements, which are widely implemented in laboratory analysis, emerge as a promising tool for characterizing both microbial and chemical water qualities in the online monitoring of water reuse systems. However, the robustness of EEM measurements has been rarely validated in actual online monitoring campaigns where predictions are made for new samples independent of those used to establish EEM analysis models, including the popular parallel factor analysis (PARAFAC). In this study, two strategies of conducting PARAFAC were examined for the online monitoring of a greywater reuse system using two EEM datasets from two monitoring periods for model establishment and model testing respectively. With the first strategy that is commonly used in laboratory analyses, an entire EEM datasets from one period was used to establish one PARAFAC model, and the maximum fluorescence intensity (Fmax) of a PARAFAC component was used to predict total cell count (TCC) in another period. However, under the disturbance of dissolved organic matter (DOM) fluorescence in the background, Fmax gave unreliable predictions in model testing. To address this problem, a second and novel strategy was proposed using an EEM clustering and PARAFAC component shift mining technique. This unsupervised algorithm, named K-PARAFACs, automatically groups EEMs into K clusters and on each cluster establishes a cluster-specific PARAFAC model with distinct component shapes. With this method, multiple PARAFAC models were established on one EEM dataset, with each model representing samples with certain TCC ranges and DOM compositions. In model testing, these cluster-specific PARAFAC models served as EEM classifiers. A new sample was not characterized by Fmax but by the cluster-specific model that best fitted the EEM signal of the sample with the least numerical error. The proposed strategy demonstrates its robustness by successfully predicting the TCC trend in test datasets. Our findings suggest that K-PARAFACs is a promising tool that enables robust qualitative monitoring of water reuse systems with background DOM variability.

Data and Resources

This package has no data

Citation

This Data Package

The associated article

Hu, Y., Morgenroth, E., & Jacquin, C. (2025). Online monitoring of greywater reuse system using excitation-emission matrix (EEM) and K-PARAFACs. Water Research, 268, 122604. https://doi.org/10.1016/j.watres.2024.122604

Metadata

  Publication Data Package for:
Open Data Open Data
Author
  • Hu, Yongmin
  • Morgenroth, Eberhard
  • Jacquin, CĂ©line
Keywords Water reuse,Online monitoring,Microbial quality,PARAFAC,Dissolved organic matter,Excitation emission matrix,UV absorbance
Variables
  • dissolved_organic_carbon
  • flow_cytometric_cell_counts
  • fluorescence
Timerange
  • 2022-01 TO 2022-02
  • 2022-09 TO 2022-10
Review Level domain specific
Curator Hu, Yongmin
Contact Morgenroth, Eberhard <Eberhard.Morgenroth@eawag.ch>
DOI 10.25678/000CBS