Skip to main content
In this paper we address the issue of privacy preserving data mining. Specifically, we consider a scenario in which two parties owning confidential databases wish to run a data mining algorithm on the union of their databases, without... more
    • by 
    •   15  
      Information SystemsData MiningSoft ComputingPure Mathematics
Naive Bayes is one of most effective classification algorithms. In many applications, however, a ranking of examples are more desirable than just classification. How to extend naive Bayes to improve its ranking performance is an... more
    • by 
    •   10  
      Data MiningMarkov ProcessesMonte Carlo MethodsDecision Trees
Constraint satisfaction is becoming the paradigm of choice for solving many real-world problems. To date, most approaches to constraint satisfaction have focused on solving a problem using some form of backtrack search. Furthermore, the... more
    • by 
    •   5  
      Product ConfigurationDecision TreeConstraint SolvingConstraint Satisfaction
There is growing interest in scaling up the widely-used decision-tree learning algorithms to very large data sets. Although numerous diverse techniques have been proposed, a fast tree-growing algorithm without substantial decrease in... more
    • by 
    •   7  
      Text ClassificationScaling upTime ComplexityNaive Bayes
It has been observed that traditional decision trees produce poor probability estimates. In many applications, however, a probability estimation tree (PET) with accurate probability estimates is desirable. Some researchers ascribe the... more
    • by 
    •   6  
      Artificial IntelligenceClassificationDecision TreeNaive Bayes
    • by 
    • Decision Tree Learning
The driving force behind the evolution of computing from automating automation [3] to automating learning [8] can be attributed to Machine Learning algorithms. By being able to generalize from examples, today's computers have the ability... more
    • by 
    •   6  
      Machine LearningEntropySupervised LearningInformation Gain
Accurate probability estimation generated by learning models is desirable in some practical applications, such as medical diagnosis. In this paper, we empirically study traditional decision-tree learning models and their variants in terms... more
    • by 
    •   10  
      Machine LearningDecision TreeLearning ModelSupport vector machine
In data mining, large differences in prior class probabilities known as the class imbalance problem have been reported to hinder the performance of classifiers such as decision trees. Dealing with imbalanced and cost-sensitive data has... more
    • by 
    •   8  
      Data MiningDecision TreesKnowledge DiscoveryDecision Tree
Classification decision tree algorithms are used extensively for data mining in many domains such as retail target marketing, fraud detection, etc. Highly parallel algorithms for constructing classification decision trees are desirable... more
    • by 
    •   13  
      Information SystemsData MiningParallel ProcessingDecision Tree
In this paper, we present "K-Means+ID3," a method to cascade k-Means clustering and the ID3 decision tree learning methods for classifying anomalous and normal activities in a computer network, an active electronic circuit, and a... more
    • by 
    •   11  
      Anomaly DetectionComputer NetworkNearest NeighborDecision Tree
CRIME is one of the major problems encountered in any society and universities together with other higher institutions of learning are not exceptions. Thus, there is an urgent need for security agents and agencies to battle and eradicate... more
    • by 
    •   4  
      EngineeringComputer ScienceDecision TreeDecision Tree Learning
The required learning time and curse of dimensionality restrict the applicability of Reinforcement Learning(RL) on real robots. Difficulty in inclusion of initial knowledge and understanding the learned rules must be added to the... more
    • by 
    •   5  
      Reinforcement LearningOptimization ProblemPrior KnowledgeDecision Tree Learning
    • by 
    •   7  
      Reinforcement LearningTransfer LearningSupervised LearningFunction approximation
This paper develops a new algorithm for inducing cost-sensitive decision trees that is inspired by the multi-armed bandit problem, in which a player in a casino has to decide which slot machine (bandit) from a selection of slot machines... more
    • by  and +1
    •   2  
      Cost sensitive learningDecision Tree Learning
This paper describes a hybrid methodology that integrates genetic algorithms and decision tree learning in order to evolve useful subsets of discriminatory features for recognizing complex visual concepts. A genetic algorithm (GA) is used... more
    • by 
    •   11  
      Evolutionary ComputationPattern RecognitionGenetic AlgorithmFeature Selection
This paper introduces a new methodology for discovering patterns in foodborne disease outbreaks using a data-driven approach. Specifically, our approach uses three data mining methods, namely attribute selection, decision tree learning,... more
    • by 
    •   10  
      Data MiningFood EngineeringSalmonella enteritidisEscherichia coli
There had been an enormous increase in the crime in the recent past. Crimes are a common social problem affecting the quality of life and the economic growth of a society. With the increase of crimes, law enforcement agencies are... more
    • by 
    •   10  
      Computer ScienceInformation RetrievalInformation SecurityInformation Management
Diabetes Melitus (DM) merupakan penyakit kronis yang banyak diderita oleh penduduk Indonesia, penyakit ini disebabkan karena kadar glukosa dalam darah di atas nilai normal. Penyakit ini termasuk penyakit yang rumit dan mematikan, oleh... more
    • by 
    •   4  
      Computer ScienceArtificial IntelligenceDecision TreeDecision Tree Learning
In this paper we present a novel, customizable IE paradigm that takes advantage of predicate-argument structures. We also introduce a new way of automatically identifying predicate argument structures, which is central to our IE paradigm.... more
    • by 
    •   3  
      Information ExtractionArgument StructureDecision Tree Learning
Decision tree learning algorithms have been successfully used in knowledge discovery. They use induction in order to provide an appropriate classification of objects in terms of their attributes, inferring decision tree rules. This paper... more
    • by  and +1
    •   4  
      Knowledge DiscoveryDecision TreeClassification RulesDecision Tree Learning
We address the problem of designing a machine learning tool for the automatic diagnosis of Parkinson's disease that is capable of providing an explanation of its behavior in terms that are easy to understand by clinicians. For this... more
    • by 
    •   3  
      Computer ScienceDecision TreeDecision Tree Learning
Abstract: In this work, decision tree learning algorithms and fuzzy inferencing systems are applied for galaxy morphology classification. In particular, the CART, the C4.5, the Random Forest and fuzzy logic algorithms are studied and... more
    • by 
    •   6  
      Machine LearningFuzzy LogicRandom ForestSpiral Galaxies
    • by 
    •   68  
      EngineeringStatisticsSpanishSignal Processing
    • by 
    •   11  
      Computer ScienceArtificial IntelligenceMachine LearningData Mining
Properly addressing the discretization process of continous valued features is an important problem during decision tree learning. This paper describes four multi-interval discretization methods for induction of decision trees used in... more
    • by 
    •   4  
      Machine LearningDecision TreeNeural NetDecision Tree Learning
    • by 
    •   20  
      Expert SystemsControl TheoryQueueing theoryVirtual Machines
In this work, decision tree learning algorithms and fuzzy inferencing systems are applied for galaxy morphology classification. In particular, the CART, the C4.5, the Random Forest and fuzzy logic algorithms are studied and reliable... more
    • by 
    •   6  
      Machine LearningFuzzy LogicRandom ForestSpiral Galaxies
Artykuł ten jest czwartym z serii artykułów poświęconych przeglądowej prezentacji poszczególnych metod sztucznej inteligencji (AI) prezentowanych jako wyspy archipelagu, co zostało skomentowane i wyjaśnione w pierwszym artykule tego... more
    • by 
    •   8  
      Computer ScienceArtificial IntelligenceNatural Language ProcessingZipf Law
Classification based on decision trees is one of the important problems in data mining and has applications in many fields. In recent years, database systems have become highly distributed, and distributed system paradigms, such as... more
    • by 
    •   15  
      GeneticsDistributed DatabaseData MiningDistributed Algorithms
This paper presents the results of an explorative study on predicting aspects of playing behavior for the major commercial title Tomb Raider: Underworld (TRU). Various supervised learning algorithms are trained on a large-scale set of... more
    • by 
    •   9  
      Data MiningVideo GamesSupervised LearningData Collection
This article presents a new classification algorithm, called CLEF, which induces a -machine by constructing its own features based on the training data. The features can be viewed as defining subsets of the instance space, and they allow... more
    • by 
    •   8  
      Cognitive ScienceMachine LearningDecision TreeSupport vector machine
In this paper we present an approach to using decision tree learning as a basis for improving search performance for real-world constraint satisfaction problems. The problem used to evaluate the approach is a large real-world... more
    • by 
    •   5  
      Automotive IndustryProblem SolvingConstraint SatisfactionDecision Tree Learning
In this article we show that there is a strong connection between decision tree learning and local pattern mining. This connection allows us to solve the computationally hard problem of finding optimal decision trees in a wide range of... more
    • by 
    •   7  
      Information SystemsPattern MiningFrequent Itemset MiningDecision Tree
There had been an enormous increase in the crime in the recent past. Crimes are a common social problem affecting the quality of life and the economic growth of a society. With the increase of crimes, law enforcement agencies are... more
    • by 
    •   9  
      Computer ScienceInformation RetrievalInformation SecurityInformation Management
Discretization is a critical component of data mining whereby continuous attributes of a data set are converted into discrete ones by creating intervals either before or during learning. There are many good reasons for preprocessing... more
    • by 
    •   17  
      Machine LearningData MiningInductive LearningEntropy
We describe and evaluate an information-theoretic algorithm for datadriven induction of classification models based on a minimal subset of available features. The relationship between input (predictive) features and the target... more
    • by 
    •   15  
      Information TheoryData MiningFeature SelectionDatabase Management Systems
Classification decision tree algorithms are used extensively for data mining in many domains such as retail target marketing, fraud detection, etc. Highly parallel algorithms for constructing classification decision trees are desirable... more
    • by 
    •   13  
      Information SystemsData MiningParallel ProcessingDecision Tree
Accurate prediction of fault prone modules (a module is equivalent to a C function or a C+ + method) in software development process enables effective detection and identification of defects. Such prediction models are especially... more
    • by 
    •   19  
      Machine LearningLogistic RegressionDecision TreesSoftware Quality
Accurate prediction of fault prone modules (a module is equivalent to a C function or a C+ + method) in software development process enables effective detection and identification of defects. Such prediction models are especially... more
    • by 
    •   19  
      Machine LearningLogistic RegressionDecision TreesSoftware Quality
Most data mining algorithms assume static behavior of the incoming data. In the real world, the situation is different and most continuously collected data streams are generated by dynamic processes, which may change over time, in some... more
    • by 
    •   13  
      Information SystemsApplied MathematicsData MiningOnline Learning
In inductive databases, there is no conceptual difference between data and the models describing the data: both can be stored and queried using some query language. The approach that adheres most strictly to this philosophy is probably... more
    • by 
    •   3  
      Decision TreeAssociation RuleDecision Tree Learning
This paper proposes an approach to sentence-level paraphrase identification by text canon-icalization. The source sentence pairs are first converted into surface text that approxi-mates canonical forms. A decision tree learn-ing module... more
    • by 
    •   3  
      Supervised LearningDecision TreeDecision Tree Learning
The naive Bayesian classifier provides a simple and effective approach to classifier learning, but its attribute independence assumption is often violated in the real world. A number of approaches have sought to alleviate this problem. A... more
    • by 
    •   13  
      Cognitive ScienceMachine LearningStatistical SignificanceDecision Tree
In today's world, there is an improved advance in hardware technology which increases the capability to store and record personal data about consumers and individuals. Data mining extracts knowledge to support a variety of areas as... more
    • by 
    •   2  
      Decision TreeDecision Tree Learning
Knowledge Discovery and Data Mining (KDD) is a multidisciplinary field of study that focuses on methodologies for extracting useful knowledge from data. During the latest Covid-19 pandemic, there was a significant uptick in online-based... more
    • by 
    •   4  
      Computer ScienceMachine LearningDecision TreeDecision Tree Learning
While data miners can learn defect predictors from static code features, the performance improvement in such detectors is curiously static. One explanation for this ceiling effect is that static code features have limited information... more
    • by 
    •   6  
      Data MiningProduct metricsDefect PredictionDecision Tree Learning
This paper describes a method for classification of remote sensing images integrating the importance of texture with the eficiency of the artificial neural networks. The classification process consists of applying Gabor filters followed... more
    • by 
    •   10  
      GeographyRemote SensingDecision TreeSupervised Classification
Decision trees are probably the most popular and commonly used classification model. They are recursively built following a top-down approach (from general concepts to particular examples) by repeated splits of the training dataset. When... more
    • by 
    •   8  
      IDADecision TreeHierarchical ClusteringIntelligent Data Analysis