UB Weimar OPUS 4.6.3 | 54 Informatik

54 Informatik

54.00 Informatik: Allgemeines (7)
54.01 Geschichte der Informatik (1)
54.04 Ausbildung, Beruf, Organisationen
54.08 Informatik in Beziehung zu Mensch und Gesellschaft (7)
54.10 Theoretische Informatik (1)
54.20 Datenverarbeitungsanlagen: Allgemeines
54.21 Rechnerperipherie, Datenkommunikationshardware
54.22 Datenspeicher
54.23 Rechnerhardware
54.25 Parallele Datenverarbeitung
54.26 Mikrocomputer (1)
54.27 Prozessrechner
54.28 Nichtelektronische Datenverarbeitung
54.29 Datenverarbeitungsanlagen: Sonstiges
54.30 Systemarchitektur: Allgemeines
54.31 Rechnerarchitektur
54.32 Rechnerkommunikation (9)
54.33 Computerbewertung
54.38 Computersicherheit (5)
54.39 Systemarchitektur: Sonstiges (8)
54.50 Programmierung: Allgemeines (2)
54.51 Programmiermethodik (1)
54.52 Software engineering (10)
54.53 Programmiersprachen
54.54 Betriebssysteme
54.55 Auszeichnungssprachen
54.59 Programmierung: Sonstiges
54.61 Datenverarbeitungsmanagement
54.62 Datenstrukturen (1)
54.64 Datenbanken (3)
54.65 Webentwicklung, Webanwendungen
54.70 Computermethodik: Allgemeines (1)
54.71 Logikprogrammierung
54.72 Künstliche Intelligenz (14)
54.73 Computergraphik (19)
54.74 Maschinelles Sehen (14)
54.75 Sprachverarbeitung
54.76 Computersimulation (8)
54.79 Computermethodik: Sonstiges
54.80 Angewandte Informatik (18)
54.81 Anwendungssoftware (1)
54.82 Textverarbeitung (3)
54.84 Webmanagement
54.87 Multimedia (3)
54.88 Computer in der Freizeit (5)
54.89 Angewandte Informatik: Sonstiges (178)
54.99 Informatik: Sonstiges (1)

3 search hits

1 to 3

Sort by

A Hybrid Clustering and Classiﬁcation Technique for Forecasting Short-Term Energy Consumption (2018)

Mosavi, Amir ; Torabi, Mehrnoosh ; Hashemi, Sattar ; Saybani, Mahmoud Reza ; Shamshirband, Shahaboddin

Electrical energy distributor companies in Iran have to announce their energy demand at least three 3-day ahead of the market opening. Therefore, an accurate load estimation is highly crucial. This research invoked methodology based on CRISP data mining and used SVM, ANN, and CBA-ANN-SVM (a novel hybrid model of clustering with both widely used ANN and SVM) to predict short-term electrical energy demand of Bandarabbas. In previous studies, researchers introduced few effective parameters with no reasonable error about Bandarabbas power consumption. In this research we tried to recognize all efﬁcient parameters and with the use of CBA-ANN-SVM model, the rate of error has been minimized. After consulting with experts in the ﬁeld of power consumption and plotting daily power consumption for each week, this research showed that ofﬁcial holidays and weekends have impact on the power consumption. When the weather gets warmer, the consumption of electrical energy increases due to turning on electrical air conditioner. Also, con-sumption patterns in warm and cold months are different. Analyzing power consumption of the same month for different years had shown high similarity in power consumption patterns. Factors with high impact on power consumption were identiﬁed and statistical methods were utilized to prove their impacts. Using SVM, ANN and CBA-ANN-SVM, the model was built. Sine the proposed method (CBA-ANN-SVM) has low MAPE 5 1.474 (4 clusters) and MAPE 5 1.297 (3 clusters) in comparison with SVM (MAPE 5 2.015) and ANN (MAPE 5 1.790), this model was selected as the ﬁnal model. The ﬁnal model has the beneﬁts from both models and the beneﬁts of clustering. Clustering algorithm with discovering data structure, divides data into several clusters based on similarities and differences between them. Because data inside each cluster are more similar than entire data, modeling in each cluster will present better results. For future research, we suggest using fuzzy methods and genetic algorithm or a hybrid of both to forecast each cluster. It is also possible to use fuzzy methods or genetic algorithms or a hybrid of both without using clustering. It is issued that such models will produce better and more accurate results. This paper presents a hybrid approach to predict the electric energy usage of weather-sensitive loads. The presented methodutilizes the clustering paradigm along with ANN and SVMapproaches for accurate short-term prediction of electric energyusage, using weather data. Since the methodology beinginvoked in this research is based on CRISP data mining, datapreparation has received a gr eat deal of attention in thisresear ch. Once data pre-processing was done, the underlyingpattern of electric energy consumption was extracted by themeans of machine learning methods to precisely forecast short-term energy consumption. The proposed approach (CBA-ANN-SVM) was applied to real load data and resulting higher accu-racy comparing to the existing models. 2018 American Institute of Chemical Engineers Environ Prog, 2018 https://doi.org/10.1002/ep.12934

Analyzing and Predicting Quality Flaws in User-generated Content: The Case of Wikipedia (2013)

Anderka, Maik

Web applications that are based on user-generated content are often criticized for containing low-quality information; a popular example is the online encyclopedia Wikipedia. The major points of criticism pertain to the accuracy, neutrality, and reliability of information. The identification of low-quality information is an important task since for a huge number of people around the world it has become a habit to first visit Wikipedia in case of an information need. Existing research on quality assessment in Wikipedia either investigates only small samples of articles, or else deals with the classification of content into high-quality or low-quality. This thesis goes further, it targets the investigation of quality flaws, thus providing specific indications of the respects in which low-quality content needs improvement. The original contributions of this thesis, which relate to the fields of user-generated content analysis, data mining, and machine learning, can be summarized as follows: (1) We propose the investigation of quality flaws in Wikipedia based on user-defined cleanup tags. Cleanup tags are commonly used in the Wikipedia community to tag content that has some shortcomings. Our approach is based on the hypothesis that each cleanup tag defines a particular quality flaw. (2) We provide the first comprehensive breakdown of Wikipedia's quality flaw structure. We present a flaw organization schema, and we conduct an extensive exploratory data analysis which reveals (a) the flaws that actually exist, (b) the distribution of flaws in Wikipedia, and, (c) the extent of flawed content. (3) We present the first breakdown of Wikipedia's quality flaw evolution. We consider the entire history of the English Wikipedia from 2001 to 2012, which comprises more than 508 million page revisions, summing up to 7.9 TB. Our analysis reveals (a) how the incidence and the extent of flaws have evolved, and, (b) how the handling and the perception of flaws have changed over time. (4) We are the first who operationalize an algorithmic prediction of quality flaws in Wikipedia. We cast quality flaw prediction as a one-class classification problem, develop a tailored quality flaw model, and employ a dedicated one-class machine learning approach. A comprehensive evaluation based on human-labeled Wikipedia articles underlines the practical applicability of our approach.

Modeling Non-Standard Text Classification Tasks (2013)

Lipka, Nedim

Text classification deals with discovering knowledge in texts and is used for extracting, filtering, or retrieving information in streams and collections. The discovery of knowledge is operationalized by modeling text classification tasks, which is mainly a human-driven engineering process. The outcome of this process, a text classification model, is used to inductively learn a text classification solution from a priori classified examples. The building blocks of modeling text classification tasks cover four aspects: (1) the way examples are represented, (2) the way examples are selected, (3) the way classifiers learn from examples, and (4) the way models are selected. This thesis proposes methods that improve the prediction quality of text classification solutions for unseen examples, especially for non-standard tasks where standard models do not fit. The original contributions are related to the aforementioned building blocks: (1) Several topic-orthogonal text representations are studied in the context of non-standard tasks and a new representation, namely co-stems, is introduced. (2) A new active learning strategy that goes beyond standard sampling is examined. (3) A new one-class ensemble for improving the effectiveness of one-class classification is proposed. (4) A new model selection framework to cope with subclass distribution shifts that occur in dynamic environments is introduced.

1 to 3

Universitätsbibliothek
Weimar Open Access

54 Informatik

Refine

Document Type

Author

Institute

Keywords

Year of publication

3 search hits

UniversitätsbibliothekWeimar Open Access

54 Informatik

Refine

Document Type

Author

Institute

Keywords

Year of publication

3 search hits

Universitätsbibliothek
Weimar Open Access