Five free open source data mining software

2010-12-20  来源:本站原创  分类:Java  人气:95 

Article describes the Internet that the five free open source data mining software, in turn.


Five free open source data mining software Orange is a component-based data mining and machine learning software suite, its function is friendly, and very powerful, fast and multi-purpose visual programming front-end for browsing data analysis and visualization, based binding the Python for scripting. It contains a complete set of components for data preprocessing, and provides the data accounts, transition, modeling, model evaluation and exploration capabilities. Its by the C + + and Python developers, and its graphic library Qt is a cross-platform framework for development.


RapidMiner , formerly YALE (Yet Another Learning Environment), it is the one for machine learning and data mining and analysis of the test environment, and to study the real-world data mining. It provides a large number of experiments by the composition operator, which operator XML file from the detailed records, and was RapidMiner graphical user interface shown. RapidMiner as the main machine learning process provides more than 500 operators, and, the combination of learning programs and the Weka learning environment property evaluator. It is a standalone tool can be used to do data analysis, is also a data mining engine can be used to integrate into your products.


Five free open source data mining software developed by the Java Weka (Waikato Environment for Knowledge Analysis) is a well-known machine learning machine software, which supports several classic data mining tasks, significant data preprocessing, clustering, classification, regression, virtualization, and function selection. The technology is based on the assumption of a single file or data associated, where each data point is marked by many attributes. Weka uses Java's ability to access the SQL database links database and can handle a database query results. Its main product is the user access to Explorer, also support the same features as the command line, or a component-based knowledge flow interface.


Five free open source data mining software for scientists, engineers and students designed jHepWork is a free open source data analysis framework, which is mainly used open source library to create a data analysis environment, and provide a rich user interface, and those fees in order to software competition. It is mainly used for scientific computing two-and three-dimensional graphics, and includes implementation using Java Mathematical Sciences Library, random number, and other data mining algorithms. jHepWork is based on a high-level programming language Jython, of course, Java code can also be used to call jHepWork math and graphics libraries.


Five free open source data mining software KNIME (Konstanz Information Miner) is a user-friendly, intelligent, and a rich play of open source data integration, data processing, data analysis and data exploration platform. It gives the user the ability to visually create data flows or data channel, optionally run some or all analysis steps, and the back of research results, models and interactive view. KNIME by the Java languages, and through its Eclipse plug-ins based on the way to provide more functionality. Through a plug-in files, the user can file, pictures, and time series Jiaru processing module, and can be integrated into a variety of other open source projects, such as: R Language, Weka, Chemistry Development Kit, and LibSVM.

Source text: (wall)

  • Five free open source data mining software 2010-12-20

    Article describes the Internet that the five free open source data mining software, in turn. Orange Orange is a component-based data mining and machine learning software suite, its function is friendly, and very powerful, fast and multi-purpose visua

  • 5 open source data mining project 2010-12-14

    Orange is a component-based data mining and machine learning software suite, its function is friendly, and very powerful, fast and multi-purpose visual programming front-end for browsing data analysis and visualization, the base bound Python for scri

  • Data Mining Algorithms and Applications [change] 2010-04-14

    Book report research frontier Data Mining Algorithms and Applications Directory Chapter 5 Data Warehouse ... 1.1 Introduction ... 5 Data Warehouse Architecture 1.2 ... 6 1.3 Data warehouse planning, design and development ... 7 1.3.1 Scoping ... 7 1.

  • What is Web data mining 2011-03-11

    Today, read a long article E Web Content Mining , from the title to see if there is nothing special, perhaps an ordinary commercial soft, but after reading the first paragraph that is about Dr. Bing Liu of the University of Illinois Chicago, and Ther

  • Data Warehouse Data Warehouse Study Notes - Data Warehouse - Data Mining (Reprinted) 2010-05-13

    Business intelligence data warehouse technology system mainly (DW), online analytical processing (OLAP) and data mining (DM) composed of three parts. Business intelligence data warehousing is the foundation upon which to build many of the basic repor

  • Ten issues of data mining, ten algorithm 2010-05-08

    In ICDM2005 eve of Professor Wu Xindong, who make the world's leading experts in this direction are listed in their respective areas of data mining that the 10 challenging problems, and they sum up the views of these experts, obtained data mining 10

  • Three levels of Business Intelligence BI ----- Data Report. Data analysis. Data Mining 2010-05-28

    Has been the relationship between a bit confused, just to see a person that makes sense over the article on the reproduced After several years of accumulation, the majority of large enterprises have established a fairly complete CRM, ERP, OA, and oth

  • Openi BI platform based on open source data warehouse system design and development 2010-10-08

    Now we compare the recognized business intelligence system is divided into reporting, OLAP, data warehousing, data mining, and five large ad hoc query, business intelligence system itself is only a relatively new concept, perhaps with the development

  • Large-scale data mining - Chapter One study notes 2011-05-01

    Find Similar Items Chapter A fundamental problem in data mining is to detect similar Items. Such as the page to heavy collection from the web to find near duplicate pages, such pages are usually the same content, but some are mirrored on different si

  • Ten classic data mining algorithms 2011-05-13

    International authoritative academic organizations the IEEE International Conference on Data Mining (ICDM) 2006 Year in December selected the top ten classical data mining algorithms: C4.5, k-Means, SVM, Apriori, EM, PageRank, AdaBoost, kNN , Naive B

  • JPA data source data source to configure Tomcat conversion essay questions 2009-02-28

    To the JPA in the use of the development process, sometimes used in the data source conversion problem, The following methods are only my personal experience of a situation and its solutions, if different, please everyone posted for your reference en

  • The traditional distinction between data mining and data analysis 2010-02-22

    An essential difference between the <br /> Analysis: data mining in the absence of explicit assumptions, the mining of information, knowledge discovery. Data mining the information should have been previously unknown, the three characteristics of ef

  • Study of Data Mining 2010-06-29

    Begin to see data mining of. Read, "Data Mining Concepts and Techniques," the term too much. To their own fuel, the first reading this book, to practice carefully.

  • Web-based data mining and business intelligence 2010-09-14

    1 Introduction to e-commerce business as a new mode of operation shows competing in the global development trend of the world economy and trading system of the change will have a profound impact. In China, the next five years will be the vigorous dev

  • Chinese tutorial data mining 2010-10-27

    Data Mining: Concepts and Techniques

  • Do You Need a Data Shredder Software? 2010-11-20

    The vast majority of internet users think that once they delete a file from their PC, it disappears and is removed permanently from their computer, that is actually not the case, files deleted from your PC are kept in a hidden area on your PC, and IT

  • Summary of classification algorithms in data mining (to) 2011-01-06

    Summary of classification algorithms in data mining (to) (2010-01-05 15:49:59) Reprinted Tags: Classification Algorithm Education Category: BOSS Learning Room This switched Data warehouse,

  • [Zz] data mining neighborhood five classic articles 2011-05-12

    Reprinted from Data Mining blog recent article, citing what they believe the data mining five classic articles. Deep as an endorsement of individuals, so the reproduced it. An Introducti

  • The main meeting area of ​​data mining [Reserved] 2011-05-21

    The field of data mining reproduced] [major conferences Class: the top three database conference SIGMOD, VLDB, ICDE, data mining KDD, also related to the actual machine learning ICML, as we

  • Mining what type of data mining models 2011-06-28

    Data mining data mining task is used to specify the mode to find the type of general data mining tasks can be divided into two categories: description and prediction. Descriptive mining tasks characterize the general characteristics of data in the da