Data classification methodology for electronic noses using uniform manifold approximation and projection and extreme learning machine
View/Open
Cita com:
hdl:2117/363208
Document typeArticle
Defense date2021-12-22
PublisherMultidisciplinary Digital Publishing Institute (MDPI)
Rights accessOpen Access
Except where otherwise noted, content on this work
is licensed under a Creative Commons license
:
Attribution 4.0 International
ProjectDESARROLLO Y VALIDACION DE SISTEMAS DE MONITORIZACION INTELIGENTE, ESTRATEGIAS DE CONTROL DEL PITCH Y DE AMORTIGUACION ESTRUCTURAL PARA AEROGENERADORES OFFSHORE FLOTANTES (AEI-DPI2017-82930-C2-1-R)
SIMULACION IN VIVO DEL EFECTO DE LA HIPOXIA Y LA DOSIS DEL FARMACO EN EL CRECIMIENTO DEL GLIOBLASTOMA (AEI-PGC2018-097257-B-C33)
SIMULACION IN VIVO DEL EFECTO DE LA HIPOXIA Y LA DOSIS DEL FARMACO EN EL CRECIMIENTO DEL GLIOBLASTOMA (AEI-PGC2018-097257-B-C33)
Abstract
The classification and use of robust methodologies in sensor array applications of electronic noses (ENs) remain an open problem. Among the several steps used in the developed methodologies, data preprocessing improves the classification accuracy of this type of sensor. Data preprocessing methods, such as data transformation and data reduction, enable the treatment of data with anomalies, such as outliers and features, that do not provide quality information; in addition, they reduce the dimensionality of the data, thereby facilitating the tasks of a machine learning classifier. To help solve this problem, in this study, a machine learning methodology is introduced to improve signal processing and develop methodologies for classification when an EN is used. The proposed methodology involves a normalization stage to scale the data from the sensors, using both the well-known min-max approach and the more recent mean-centered unitary group scaling (MCUGS). Next, a manifold learning algorithm for data reduction is applied using uniform manifold approximation and projection (UMAP). The dimensionality of the data at the input of the classification machine is reduced, and an extreme learning machine (ELM) is used as a machine learning classifier algorithm. To validate the EN classification methodology, three datasets of ENs were used. The first dataset was composed of 3600 measurements of 6 volatile organic compounds performed by employing 16 metal-oxide gas sensors. The second dataset was composed of 235 measurements of 3 different qualities of wine, namely, high, average, and low, as evaluated by using an EN sensor array composed of 6 different sensors. The third dataset was composed of 309 measurements of 3 different gases obtained by using an EN sensor array of 2 sensors. A 5-fold cross-validation approach was used to evaluate the proposed methodology. A test set consisting of 25% of the data was used to validate the methodology with unseen data. The results showed a fully correct average classification accuracy of 1 when the MCUGS, UMAP, and ELM methods were used. Finally, the effect of changing the number of target dimensions on the reduction of the number of data was determined based on the highest average classification accuracy.
CitationLeon-Medina, J.X. [et al.]. Data classification methodology for electronic noses using uniform manifold approximation and projection and extreme learning machine. "Mathematics", 22 Desembre 2021, vol. 10, núm. 1, article 29.
ISSN2227-7390
Publisher versionhttps://www.mdpi.com/2227-7390/10/1/29
Files | Description | Size | Format | View |
---|---|---|---|---|
2022_62_MATHEMATICS_leo_par_ana_tib_poz_UMAP.pdf | Article principal | 7,150Mb | View/Open |