Zheng classification with missing feature values using. Computing statistical information on probabilistic data has attracted a lot of attention recently, as the data generated from a wide range of data sources are inherently fuzzy or uncertain. An extended rough set model for generalized incomplete. Rough set theory can be considered as a tool to study the uncertain, indescendent data by classifying the set into ternary. In this paper, we study an important statistical query on probabilistic data. Attribute reduction based on rough set theory starts from an information system that contains data about the objects of interest, which are characterized by a finite set of attributes. This paper deals with knowledge acquisition in incomplete information systems using rough set theory. This paper discusses the information fusion and uncertainty measure based on rough set theory. Multigranulations rough set method of attribute reduction in. Models and attribute reductions covers theoretical study of generalizations of rough set model in various incomplete information systems. Multigranulation rough sets in incomplete information system.
The attribute sets along with the objects in an information system. Accordingly, an elementary set is any set of objects that are not different, a sharp precise set any union of some elementary sets, otherwise the set is rough imprecise, vague. It merges these values together to create a possibly smaller. After giving definitions and concepts of knowledge dependency and knowledge dependency degree for incomplete information system in tolerance rough set model by distinguishing decision attribute containing missing attribute value or not, the result of maintaining reflectivity, transitivity, augmentation, decomposition law and merge law for. Attribute reduction is one of the most important problems in rough set theory. Rough set theory rst, first introduced by pawlak 1,2, is a powerful mathematical tool to. Where m index termsalgorithm, incomplete information system, minimal granule, multigranulation, rough set model. However, classical rough set theory cannot cope with the incomplete information systems where some attribute values are missing. Zheng classification system with missing feature values in tcm can be viewed as a 3tuple s.
Rough set theory overlaps with many other theories such that fuzzy sets, evidence theory, and statistics. This relationship of not distinguishing is a mathematical basis for the theory of rough sets. Analysis of an incomplete information system using the. Procedia apa bibtex chicago endnote harvard json mla ris xml iso 690 pdf downloads 350. We can combine the rules which have the same generalized decisions as. On the unknown attribute values in learning from examples. Clark ross consider and play the opening to schoenbergs three piano pieces, op. The indiscernibility relation is a fundamental concept of the rough set theory. For simplicity and clarity, we combine the same descriptions for the objects in the original data. A granular computing approach to decision analysis using rough set theory iftikhar u. This fundamental insight into mechanism design with incomplete information has allowed many allocation problems to be analyzed and forms the. This is due to the volume, complexity, and heterogeneity of such datasets, as well as fundamental gaps in our knowledge of highdimensional processes where distance measures degenerate curse of dimensionality 1, 2. From a practical point of view, it is a good tool for data analysis.
Incompleteness in xml was addressed recently 4, 7, 15, and one issue that required a complete reworking was the concept of certain answers for queries that return xml documents i. As such, organizations would benefit from partitioning the electorate to not duplicate. Some of them may be shared outside the team, but only in a processed, noneditable form pdf, all the docs are assumed being able to be exported to this format. An incremental approach to attribute reduction from dynamic incomplete decision systems in rough set theory. Combining system theory, process mining and fuzzy logic authors. A modified rough set approach to incomplete information systems. Combining both kinds of data can result in an improve.
Sikder department of computer and information science, cleveland state university, cleveland, oh 44115, usa abstract this paper presents a granular computing approach to decision analysis using rough set theory and its variable precision extension. In this paper it is applied a rough set approach that takes into account an incomplete information system to study the steadystate security of an electric power system. By combining these properties, one can construct distinct rough set models. We study the method of multisource fusion in incomplete multisource systems. Algorithms of minimal mutual compatible granules and. Research article a modified rough set approach to incomplete information systems. Extended tolerance relation to define a new rough set. Knowledge dependency degree in trsm and its application to. A novel threeway decision model based on incomplete information system. Rough set theory is an extension of set theory which proposed by pawlak 1991 for describe and classify the incomplete or insufficient information. O is a nonempty finite set of objects at is a nonempty finite set of attributes, such that for any a. The rough set theory has been conceived as a tool to conceptualize, organize and analyze various types of data, in particular, to deal with inexact, uncertain or vague knowledge.
Knowledge reduction algorithm the rough set theory is a mathematical tool which can quantitatively analyze the imprecise, inconsistent and incomplete information and knowledge. Choi kaist logic and set theory october 7, 2012 1 26. Theoretical study on a new information entropy and its use in attribute reduction ping luo1. To merge these notions into a joint theory that combines their mutual strengths has been the object of a. Rough set theory rst is an extension of set theory for study of the intelligent systems characterized by insuf. A rough setbased incremental approach for learning knowledge in. Choi department of mathematical science kaist, daejeon, south korea fall semester, 2012 s. Dynamic faq retrieval with rough set theory dengyiv chiu, peishin chen, and yachen pan faculty of information management, chung hua university, hsinchu, taiwan 300, r. After nearly thirty years of development, rough set theory has been widely used in the fields of. The divide and conquer method is a typical granular computing method using multiple levels of abstraction and granulations. Rough set theory is an extension of set theory which proposed by pawlak 1991 for. Incomplete information system and rough set theory models and. Information attribute reduction based on the rough set theory.
Also it verifies logic, and allows inconsistent data and no certainty to the discovery of incomplete implications. The concept of similarity classes in incomplete information systems is first proposed. Feature subset selection using rough sets for high. A modified rough set approach to incomplete information. A noisetolerant approach to fuzzyrough feature selection.
It first discusses some rough set extensions in incomplete information systems. Despite the importance of theory, questions relating to its form and structure are neglected in comparison with questions relating to epistemology. Y ang, expansions of rough sets in incomplete information systems, incomplete information system and rough set theory, science press beijing and springerv erlag berlin. Rough set approaches to incomplete information systems. Uncertainty measure based on evidence theory scientific. Pdf the original rough set model is concerned primarily with the approximation of sets described by single binary relation on the universe. The traditional rough set theory is a powerful tool to deal with complete information system, and its performance to process incomplete information system is weak, m. Introduction rough set theory rst for short 1 is put forward by pawlak in 1982, which, as an generalization of set theory for. Roughsetbased decision model for incomplete information systems.
U, g x, f denote the value that x holds on feature f. A granular computing approach to decision analysis using. All eight possible extended rough set models in incomplete information systems are proposed. How important is it to be able to easily compare and merge. Next, probability of matching is defined from data in information systems and then measures the degree of tolerance. The minimal reducts for the incomplete information system iis are as follows. In early eighties, pawlak 22 introduced the theory of rough sets as an extension of set theory for the study of intelligent systems characterized by insufficient and incomplete information 22, 23, 26. Extended tolerance relation to define a new rough set model in. An overview of dna sequencing michigan state university. Zheng and wang developed a rough set and rule tree based incremental knowledge. There have been efforts in studying incomplete information systems for data classification which are based on the extensions of rough set theory.
Two kinds of partitions, lower and upper approximations, are then formed for the mining of certain and association rules in incomplete decision tables. Pdf evolutionary computation for rough set models in. It discusses not only the regular attributes but also the criteria in the incomplete information systems. Logic and information stanford encyclopedia of philosophy. Moreover, concepts of lower and upper approximations are studied as well as their properties. This paper discusses and proposes a rough set model for an incomplete information system, which defines an extended tolerance relation using frequency of attribute values in such a system. Incomplete information and certain answers in general data. Fuzzy sets 1 and rough sets 2 address two important characteristics of imperfect data and knowledge. Data mining with rough set using mapreduce prachi patil student of me computer. Database integration is a growing and increasingly important field in both research and in dustry. By analyzing existing extended models and technical methods of rough set theory, the strategy of model extension is found to be suitable for processing incomplete information systems instead of filling possible values for missing attributes. Next, probability of matching is defined from data in information systems and. Theoretical study on a new information entropy and its use.
The original rough set theory 1, 2 deals with precise. And uncertainty measure is to reflect the uncertainty of an information system. Globalization has created new trends such as market consolidation, vertical market strategies and mergers in the business world. He was choked with indignation and sorrow, as though his good qualities had been stripped from him by a rough hand, like medals. The paper introduces a rough set model to analyze an information system in which some conditions and decision data are. A novel decisionmaking approach to fund investments based on. There is no unifying theory, single method, or unique set of tools for big data science. Besides it is mathematical tool that overcome the uncertainties and doubts. Rough set theory, originated by pawlak 20,21, has become a. At, where is called the domain of an attribute a, is called an information vector of x any attribute domain v. Pdf a new rough set approach to knowledge discovery in. Rough set extensions in incomplete information systems. The aim of this paper is to present a dominancebased fuzzy rough set approach to incomplete intervalvalued information systems. An algorithm to solve attribute reduction in an incomplete decision table is designed.
Roughsetbased decision model for incomplete information. Incomplete information system and rough set theory. Pdf multigranulation decisiontheoretic rough sets in. Theory behind shotgun sequencing haemophilus influenzae 1. Knowledge acquisition in incomplete information systems. Compute the number of supporting objects for each reduct after combining the identical. The discretization algorithm for rough data and its. Nb note bene it is almost never necessary in a mathematical proof to remember that a function is literally a set of ordered pairs. Based on different types of rough set models, the book presents the practical approaches to compute several reducts in terms of these models. If we wish to understand how it is organized, we could begin by looking at the melody, which seems to naturally break. Situation theory barwise and perry 1983, devlin i 991 provides formal mechanisms by way of constraints between situation types that made it possible to merge. Rough set approach to incomplete information systems.
Big data with rough set using map reduce authorstream. Parallel computation of rough set approximations in information. Rough set theory is one of the best methods to process this kind of data. Finding frequent items in probabilistic data proceedings.
Big data with rough set using map reduce authorstream presentation. Information retrieval, machine learning, and data mining. Journal of chemical and pharmaceutical research, 2014, 66. The discretization algorithm for rough data and its application to intrusion detection. Summary we explore faq frequently asked questions retrieval by applying hierarchical agglomerative clustering method and rough set theory.
One moment he was one person, an instant later he was another. Importance of diffing and merging for design specifications documentation. A relative tolerance relation of rough set for incomplete. The essay addresses issues of causality, explanation, prediction, and generalizati on that underlie an understanding of theory. A fuzzy dominance relation which aims to describe the degree of dominance in terms of pairs of objects is proposed. Decision rough set models as new research areas have already commenced to become new attractive topics 17. Grzymalabusse, on the unknown attribute values in learning from examples, in. An incremental approach to attribute reduction from. Therefore, it is necessary to develop a theory which enables classifications of. In this context, this papers proposal aims to address the limitations of rough set theory.
463 391 458 146 1387 847 1227 130 1026 490 1376 396 1354 506 1516 351 1284 1208 1269 554 470 753 215 854 858 1024 464 520 807 685 1432 932 781