https://doi.org/10.25678/000DX1
You're currently viewing an old version of this dataset. To see the current version, click here.

Hyperspectral data-cubes and reference pollutants of 302 urban wastewater samples

Overview of the experiment

We conducted this experiment to collect a dataset of hyperspectral data-cubes of wastewater samples, along with reference laboratory analyses of various wastewater pollutants. The goal was to train data-driven models to predict pollution levels in a sample using hyperspectral data-cubes. Therefore, for ten days, we collected samples from four wastewater treatment facilities around Melbourne, Australia. The samples come from three urban wastewater treatment facilities and one stormwater treatment facility. We conducted the sampling between 04/08/2024 and 15/08/2024. Once sampled, we analysed wastewater in the laboratory for reference physical and chemical pollutants and acquired hyperspectral images. To extend the dataset, we also created a combination of stormwater and wastewater samples for which we measured a hyperspectral data-cube and some reference pollutants. This repository also includes background information about data pre-processing and validation.

Repository organization: How to use the data?

The repository is organized into numbered folders. Most folders contain a readme.md file in Markdown format, explaining their contents. All data are stored in non-proprietary formats: CSV for most files, except for hyperspectral acquisitions, which are in ENVI format (compatible with Python). Raw data are kept in their original format, sometimes lacking metadata such as units or column descriptions. This information is provided in the corresponding readme.md files. Pre-processed data, however, contain consistent column names, including units. Jupyter notebooks are included to pre-process and validate the data.

Dataset extent

Data and Resources

Citation

Lechevallier, P., Zhu, W., Shi, L., McCarthy, D., & Rieckermann, J. (2025). Hyperspectral data-cubes and reference pollutants of 302 urban wastewater samples (Version 1.0). Eawag: Swiss Federal Institute of Aquatic Science and Technology. https://doi.org/10.25678/000DX1

Metadata

Open Data Open Data
Author
  • Lechevallier, Pierre
  • Zhu, Wenchang
  • Shi, Luke
  • McCarthy, David
  • Rieckermann, Jörg
Keywords Wastewater,Hyperspectral imaging,Hyperspectral data-cube,Reference pollution,turbidity,TSS,ammonium,DOC,UV-vis absorbance spectra,VNIR reflectance spectra,TP,TN,EC
Variables
  • ammonium-nitrogen
  • dissolved_organic_carbon
  • electric_conductivity
  • total_nitrogen
  • total_phosphorus
  • total_solids
  • turbidity
Systems
  • Urban Drainage Systems
Timerange
  • 2024-08-04 TO 2024-08-15
Geographic Name(s)
  • Western suburbs of Melbourne, Australia.
Review Level none
Curator Lechevallier, Pierre
Contact Lechevallier, Pierre <pierre.lechevallier@eawag.ch>
DOI 10.25678/000DX1