TY  - JOUR
A1  - Abbaspour-Gilandeh, Yousef
A1  - Molaee, Amir
A1  - Sabzi, Sajad
A1  - Nabipour, Narjes
A1  - Shamshirband, Shahaboddin
A1  - Mosavi, Amir
T1  - A Combined Method of Image Processing and Artificial Neural Network for the Identification of 13 Iranian Rice Cultivars
JF  - agronomy
N2  - Due to the importance of identifying crop cultivars, the advancement of accurate assessment of cultivars is considered essential. The existing methods for identifying rice cultivars are mainly time-consuming, costly, and destructive. Therefore, the development of novel methods is highly beneficial. The aim of the present research is to classify common rice cultivars in Iran based on color, morphologic, and texture properties using artificial intelligence (AI) methods. In doing so, digital images of 13 rice cultivars in Iran in three forms of paddy, brown, and white are analyzed through pre-processing and segmentation of using MATLAB. Ninety-two specificities, including 60 color, 14 morphologic, and 18 texture properties, were identified for each rice cultivar. In the next step, the normal distribution of data was evaluated, and the possibility of observing a significant difference between all specificities of cultivars was studied using variance analysis. In addition, the least significant difference (LSD) test was performed to obtain a more accurate comparison between cultivars. To reduce data dimensions and focus on the most effective components, principal component analysis (PCA) was employed. Accordingly, the accuracy of rice cultivar separations was calculated for paddy, brown rice, and white rice using discriminant analysis (DA), which was 89.2%, 87.7%, and 83.1%, respectively. To identify and classify the desired cultivars, a multilayered perceptron neural network was implemented based on the most effective components. The results showed 100% accuracy of the network in identifying and classifying all mentioned rice cultivars. Hence, it is concluded that the integrated method of image processing and pattern recognition methods, such as statistical classification and artificial neural networks, can be used for identifying and classification of rice cultivars.
KW  - Maschinelles Lernen
KW  - Machine learning
KW  - food informatics
KW  - big data
KW  - artificial neural networks
KW  - artificial intelligence
KW  - image processing
KW  - rice
Y1  - 2020
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:gbv:wim2-20200123-40695
UR  - https://www.mdpi.com/2073-4395/10/1/117
VL  - 2020
IS  - Volume 10, Issue 1, 117
PB  - MDPI
ER  - 
TY  - JOUR
A1  - Saadatfar, Hamid
A1  - Khosravi, Samiyeh
A1  - Hassannataj Joloudari, Javad
A1  - Mosavi, Amir
A1  - Shamshirband, Shahaboddin
T1  - A New K-Nearest Neighbors Classifier for Big Data Based on Efficient Data Pruning
JF  - Mathematics
N2  - The K-nearest neighbors (KNN) machine learning algorithm is a well-known non-parametric classification method. However, like other traditional data mining methods, applying it on big data comes with computational challenges. Indeed, KNN determines the class of a new sample based on the class of its nearest neighbors; however, identifying the neighbors in a large amount of data imposes a large computational cost so that it is no longer applicable by a single computing machine. One of the proposed techniques to make classification methods applicable on large datasets is pruning. LC-KNN is an improved KNN method which first clusters the data into some smaller partitions using the K-means clustering method; and then applies the KNN for each new sample on the partition which its center is the nearest one. However, because the clusters have different shapes and densities, selection of the appropriate cluster is a challenge. In this paper, an approach has been proposed to improve the pruning phase of the LC-KNN method by taking into account these factors. The proposed approach helps to choose a more appropriate cluster of data for looking for the neighbors, thus, increasing the classification accuracy. The performance of the proposed approach is evaluated on different real datasets. The experimental results show the effectiveness of the proposed approach and its higher classification accuracy and lower time cost in comparison to other recent relevant methods.
KW  - Maschinelles Lernen
KW  - Machine learning
KW  - K-nearest neighbors
KW  - KNN
KW  - classifier
KW  - big data
KW  - clustering
KW  - cluster shape
KW  - cluster density
KW  - classification
KW  - reinforcement learning
KW  - data science
KW  - computation
KW  - artificial intelligence
KW  - OA-Publikationsfonds2020
Y1  - 2020
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:gbv:wim2-20200225-40996
UR  - https://www.mdpi.com/2227-7390/8/2/286
VL  - 2020
IS  - volume 8, issue 2, article 286
PB  - MDPI
ER  - 
TY  - JOUR
A1  - Lashkar-Ara, Babak
A1  - Kalantari, Niloofar
A1  - Sheikh Khozani, Zohreh
A1  - Mosavi, Amir
T1  - Assessing Machine Learning versus a Mathematical Model to Estimate the Transverse Shear Stress Distribution in a Rectangular Channel
JF  - Mathematics
N2  - One of the most important subjects of hydraulic engineering is the reliable estimation of the transverse distribution in the rectangular channel of bed and wall shear stresses. This study makes use of the Tsallis entropy, genetic programming (GP) and adaptive neuro-fuzzy inference system (ANFIS) methods to assess the shear stress distribution (SSD) in the rectangular channel.
To evaluate the results of the Tsallis entropy, GP and ANFIS models, laboratory observations were used in which shear stress was measured using an optimized Preston tube. This is then used to measure the SSD in various aspect ratios in the rectangular channel. To investigate the shear stress percentage, 10 data series with a total of 112 different data for were used. The results of the sensitivity analysis show that the most influential parameter for the SSD in smooth rectangular channel is the dimensionless parameter B/H, Where the transverse coordinate is B, and the flow depth is H. With the parameters (b/B), (B/H) for the bed and (z/H), (B/H) for the wall as inputs, the modeling of the GP was better than the other one. Based on the analysis, it can be concluded that the use of GP and ANFIS algorithms is more effective in estimating shear stress in smooth rectangular channels than the Tsallis entropy-based equations.
KW  - Maschinelles Lernen
KW  - smooth rectangular channel
KW  - Tsallis entropy
KW  - genetic programming
KW  - artificial intelligence
KW  - machine learning
KW  - big data
KW  - computational hydraulics
Y1  - 2021
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:gbv:wim2-20210504-44197
UR  - https://www.mdpi.com/2227-7390/9/6/596
VL  - 2021
IS  - Volume 9, Issue 6, Article 596
PB  - MDPI
CY  - Basel
ER  - 
TY  - JOUR
A1  - Mosavi, Amir
A1  - Shamshirband, Shahaboddin
A1  - Esmaeilbeiki, Fatemeh
A1  - Zarehaghi, Davoud
A1  - Neyshabouri, Mohammadreza
A1  - Samadianfard, Saeed
A1  - Ghorbani, Mohammad Ali
A1  - Nabipour, Narjes
A1  - Chau, Kwok-Wing
T1  - Comparative analysis of hybrid models of firefly optimization algorithm with support vector machines and multilayer perceptron for predicting soil temperature at different depths
JF  - Engineering Applications of Computational Fluid Mechanics
N2  - This research aims to model soil temperature (ST) using machine learning models of multilayer perceptron (MLP) algorithm and support vector machine (SVM) in hybrid form with the Firefly optimization algorithm, i.e. MLP-FFA and SVM-FFA. In the current study, measured ST and meteorological parameters of Tabriz and Ahar weather stations in a period of 2013–2015 are used for training and testing of the studied models with one and two days as a delay. To ascertain conclusive results for validation of the proposed hybrid models, the error metrics are benchmarked in an independent testing period. Moreover, Taylor diagrams utilized for that purpose. Obtained results showed that, in a case of one day delay, except in predicting ST at 5 cm below the soil surface (ST5cm) at Tabriz station, MLP-FFA produced superior results compared with MLP, SVM, and SVM-FFA models. However, for two days delay, MLP-FFA indicated increased accuracy in predicting ST5cm and ST 20cm of Tabriz station and ST10cm of Ahar station in comparison with SVM-FFA. Additionally, for all of the prescribed models, the performance of the MLP-FFA and SVM-FFA hybrid models in the testing phase was found to be meaningfully superior to the classical MLP and SVM models.
KW  - Bodentemperatur
KW  - Algorithmus
KW  - Maschinelles Lernen
KW  - Neuronales Netz
KW  - firefly optimization algorithm
KW  - soil temperature
KW  - artificial neural networks
KW  - hybrid machine learning
KW  - OA-Publikationsfonds2019
Y1  - 2020
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:gbv:wim2-20200911-42347
UR  - https://www.tandfonline.com/doi/full/10.1080/19942060.2020.1788644
VL  - 2020
IS  - Volume 14, Issue 1
SP  - 939
EP  - 953
ER  - 
TY  - JOUR
A1  - Hassannataj Joloudari, Javad
A1  - Hassannataj Joloudari, Edris
A1  - Saadatfar, Hamid
A1  - GhasemiGol, Mohammad
A1  - Razavi, Seyyed Mohammad
A1  - Mosavi, Amir
A1  - Nabipour, Narjes
A1  - Shamshirband, Shahaboddin
A1  - Nadai, Laszlo
T1  - Coronary Artery Disease Diagnosis: Ranking the Significant Features Using a Random Trees Model
JF  - International Journal of Environmental Research and Public Health,  IJERPH
N2  - Heart disease is one of the most common diseases in middle-aged citizens. Among the vast number of heart diseases, coronary artery disease (CAD) is considered as a common cardiovascular disease with a high death rate. The most popular tool for diagnosing CAD is the use of medical imaging, e.g., angiography. However, angiography is known for being costly and also associated with a number of side effects. Hence, the purpose of this study is to increase the accuracy of coronary heart disease diagnosis through selecting significant predictive features in order of their ranking. In this study, we propose an integrated method using machine learning. The machine learning methods of random trees (RTs), decision tree of C5.0, support vector machine (SVM), and decision tree of Chi-squared automatic interaction detection (CHAID) are used in this study. The proposed method shows promising results and the study confirms that the RTs model outperforms other models.
KW  - Maschinelles Lernen
KW  - Machine learning
KW  - Deep learning
KW  - coronary artery disease
KW  - heart disease diagnosis
KW  - health informatics
KW  - data science
KW  - big data
KW  - predictive model
KW  - ensemble model
KW  - random forest
KW  - industry 4.0
Y1  - 2020
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:gbv:wim2-20200213-40819
UR  - https://www.mdpi.com/1660-4601/17/3/731
VL  - 2020
IS  - Volume 17, Issue 3, 731
PB  - MDPI
ER  - 
TY  - JOUR
A1  - Harirchian, Ehsan
A1  - Lahmer, Tom
A1  - Buddhiraju, Sreekanth
A1  - Mohammad, Kifaytullah
A1  - Mosavi, Amir
T1  - Earthquake Safety Assessment of Buildings through Rapid Visual Screening
JF  - Buildings
N2  - Earthquake is among the most devastating natural disasters causing severe economical, environmental, and social destruction. Earthquake safety assessment and building hazard monitoring can highly contribute to urban sustainability through identification and insight into optimum materials and structures. While the vulnerability of structures mainly depends on the structural resistance, the safety assessment of buildings can be highly challenging. In this paper, we consider the Rapid Visual Screening (RVS) method, which is a qualitative procedure for estimating structural scores for buildings suitable for medium- to high-seismic cases. This paper presents an overview of the common RVS methods, i.e., FEMA P-154, IITK-GGSDMA, and EMPI. To examine the accuracy and validation, a practical comparison is performed between their assessment and observed damage of reinforced concrete buildings from a street survey in the Bingöl region, Turkey, after the 1 May 2003 earthquake. The results demonstrate that the application of RVS methods for preliminary damage estimation is a vital tool. Furthermore, the comparative analysis showed that FEMA P-154 creates an assessment that overestimates damage states and is not economically viable, while EMPI and IITK-GGSDMA provide more accurate and practical estimation, respectively.
KW  - Maschinelles Lernen
KW  - Machine learning
KW  - Erdbeben
KW  - buildings
KW  - earthquake safety assessment
KW  - earthquake
KW  - extreme events
KW  - seismic assessment
KW  - natural hazard
KW  - mitigation
KW  - rapid visual screening
Y1  - 2020
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:gbv:wim2-20200331-41153
UR  - https://www.mdpi.com/2075-5309/10/3/51
VL  - 2020
IS  - Volume 10, Issue 3
PB  - MDPI
ER  - 
TY  - JOUR
A1  - Kargar, Katayoun
A1  - Samadianfard, Saeed
A1  - Parsa, Javad
A1  - Nabipour, Narjes
A1  - Shamshirband, Shahaboddin
A1  - Mosavi, Amir
A1  - Chau, Kwok-Wing
T1  - Estimating longitudinal dispersion coefficient in natural streams using empirical models and machine learning algorithms
JF  - Engineering Applications of Computational Fluid Mechanics
N2  - The longitudinal dispersion coefficient (LDC) plays an important role in modeling the transport of pollutants and sediment in natural rivers. As a result of transportation processes, the concentration of pollutants changes along the river. Various studies have been conducted to provide simple equations for estimating LDC. In this study, machine learning methods, namely support vector regression, Gaussian process regression, M5 model tree (M5P) and random forest, and multiple linear regression were examined in predicting the LDC in natural streams. Data sets from 60 rivers around the world with different hydraulic and geometric features were gathered to develop models for LDC estimation. Statistical criteria, including correlation coefficient (CC), root mean squared error (RMSE) and mean absolute error (MAE), were used to scrutinize the models. The LDC values estimated by these models were compared with the corresponding results of common empirical models. The Taylor chart was used to evaluate the models and the results showed that among the machine learning models, M5P had superior performance, with CC of 0.823, RMSE of 454.9 and MAE of 380.9. The model of Sahay and Dutta, with CC of 0.795, RMSE of 460.7 and MAE of 306.1, gave more precise results than the other empirical models. The main advantage of M5P models is their ability to provide practical formulae. In conclusion, the results proved that the developed M5P model with simple formulations was superior to other machine learning models and empirical models; therefore, it can be used as a proper tool for estimating the LDC in rivers.
KW  - Maschinelles Lernen
KW  - Gaussian process regression
KW  - longitudinal dispersion coefficient
KW  - M5 model tree
KW  - random forest
KW  - support vector regression
KW  - rivers
Y1  - 2020
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:gbv:wim2-20200128-40775
UR  - https://www.tandfonline.com/doi/full/10.1080/19942060.2020.1712260
VL  - 2020
IS  - Volume 14, No. 1
SP  - 311
EP  - 322
PB  - Taylor & Francis
ER  - 
TY  - JOUR
A1  - Band, Shahab S.
A1  - Janizadeh, Saeid
A1  - Saha, Sunil
A1  - Mukherjee, Kaustuv
A1  - Khosrobeigi Bozchaloei, Saeid
A1  - Cerdà, Artemi
A1  - Shokri, Manouchehr
A1  - Mosavi, Amir Hosein
T1  - Evaluating the Efficiency of Different Regression, Decision Tree, and Bayesian Machine Learning Algorithms in Spatial Piping Erosion Susceptibility Using ALOS/PALSAR Data
JF  - Land
N2  - Piping erosion is one form of water erosion that leads to significant changes in the landscape and environmental degradation. In the present study, we evaluated piping erosion modeling in the Zarandieh watershed of Markazi province in Iran based on random forest (RF), support vector machine (SVM), and Bayesian generalized linear models (Bayesian GLM) machine learning algorithms. For this goal, due to the importance of various geo-environmental and soil properties in the evolution and creation of piping erosion, 18 variables were considered for modeling the piping erosion susceptibility in the Zarandieh watershed. A total of 152 points of piping erosion were recognized in the study area that were divided into training (70%) and validation (30%) for modeling. The area under curve (AUC) was used to assess the effeciency of the RF, SVM, and Bayesian GLM. Piping erosion susceptibility results indicated that all three RF, SVM, and Bayesian GLM models had high efficiency in the testing step, such as the AUC shown with values of 0.9 for RF, 0.88 for SVM, and 0.87 for Bayesian GLM. Altitude, pH, and bulk density were the variables that had the greatest influence on the piping erosion susceptibility in the Zarandieh watershed. This result indicates that geo-environmental and soil chemical variables are accountable for the expansion of piping erosion in the Zarandieh watershed.
KW  - Maschinelles Lernen
KW  - Bayes-Verfahren
KW  - Naturkatastrophe
KW  - random forest
KW  - support vector machine
KW  - geoinformatics
KW  - OA-Publikationsfonds2020
Y1  - 2020
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:gbv:wim2-20210122-43424
UR  - https://www.mdpi.com/2073-445X/9/10/346
VL  - 2020
IS  - volume 9, issue 10, article 346
SP  - 1
EP  - 22
PB  - MDPI
CY  - Basel
ER  - 
TY  - JOUR
A1  - Nabipour, Narjes
A1  - Mosavi, Amir
A1  - Baghban, Alireza
A1  - Shamshirband, Shahaboddin
A1  - Felde, Imre
T1  - Extreme Learning Machine-Based Model for Solubility Estimation of Hydrocarbon Gases in Electrolyte Solutions
JF  - Processes
N2  - Calculating hydrocarbon components solubility of natural gases is known as one of the important issues for operational works in petroleum and chemical engineering. In this work, a novel solubility estimation tool has been proposed for hydrocarbon gases—including methane, ethane, propane, and butane—in aqueous electrolyte solutions based on extreme learning machine (ELM) algorithm. Comparing the ELM outputs with a comprehensive real databank which has 1175 solubility points yielded R-squared values of 0.985 and 0.987 for training and testing phases respectively. Furthermore, the visual comparison of estimated and actual hydrocarbon solubility led to confirm the ability of proposed solubility model. Additionally, sensitivity analysis has been employed on the input variables of model to identify their impacts on hydrocarbon solubility. Such a comprehensive and reliable study can help engineers and scientists to successfully determine the important thermodynamic properties, which are key factors in optimizing and designing different industrial units such as refineries and petrochemical plants.
KW  - Maschinelles Lernen
KW  - Machine learning
KW  - Deep learning
Y1  - 2020
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:gbv:wim2-20200113-40624
UR  - https://www.mdpi.com/2227-9717/8/1/92
VL  - 2020
IS  - Volume 8, Issue 1, 92
PB  - MDPI
ER  - 
TY  - JOUR
A1  - Shamshirband, Shahaboddin
A1  - Joloudari, Javad Hassannataj
A1  - GhasemiGol, Mohammad
A1  - Saadatfar, Hamid
A1  - Mosavi, Amir
A1  - Nabipour, Narjes
T1  - FCS-MBFLEACH: Designing an Energy-Aware Fault Detection System for Mobile Wireless Sensor Networks
JF  - Mathematics
N2  - Wireless sensor networks (WSNs) include large-scale sensor nodes that are densely distributed over a geographical region that is completely randomized for monitoring, identifying, and analyzing physical events. The crucial challenge in wireless sensor networks is the very high dependence of the sensor nodes on limited battery power to exchange information wirelessly as well as the non-rechargeable battery of the wireless sensor nodes, which makes the management and monitoring of these nodes in terms of abnormal changes very difficult. These anomalies appear under faults, including hardware, software, anomalies, and attacks by raiders, all of which affect the comprehensiveness of the data collected by wireless sensor networks. Hence, a crucial contraption should be taken to detect the early faults in the network, despite the limitations of the sensor nodes. Machine learning methods include solutions that can be used to detect the sensor node faults in the network. The purpose of this study is to use several classification methods to compute the fault detection accuracy with different densities under two scenarios in regions of interest such as MB-FLEACH, one-class support vector machine (SVM), fuzzy one-class, or a combination of SVM and FCS-MBFLEACH methods. It should be noted that in the study so far, no super cluster head (SCH) selection has been performed to detect node faults in the network. The simulation outcomes demonstrate that the FCS-MBFLEACH method has the best performance in terms of the accuracy of fault detection, false-positive rate (FPR), average remaining energy, and network lifetime compared to other classification methods.
KW  - Vernetzung
KW  - wireless sensor networks
KW  - machine learning
KW  - Funktechnik
KW  - Sensor
KW  - Maschinelles Lernen
KW  - Internet of Things
KW  - OA-Publikationsfonds2019
Y1  - 2019
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:gbv:wim2-20200107-40541
UR  - https://www.mdpi.com/2227-7390/8/1/28
VL  - 2020
IS  - Volume 8, Issue 1, article 28
PB  - MDPI
ER  -