|
Data Mining Using SAS Enterprise Miner |
|||||||||||||
![]() |
Data Mining Using SAS Enterprise Miner introduces the reader to a wide variety of data mining techniques in SAS® Enterprise Miner. This first-of-a-kind book explains the purpose of -- and reasoning behind -- every node that is a part of SAS® Enterprise Miner with regard to SEMMA design and SAS data mining analysis. Each chapter starts with a short introduction to the assortment of statistics that are generated from the various SAS® Enterprise Miner nodes, followed by detailed explanations of the configuration settings and the generated results that are located within each node. The end result of the author’s meticulous presentation is a well crafted study guide on the various methods that one employs to randomly sample, partition, transform, and filter the data within the process flow of SAS® Enterprise Miner. The book will explain the wide assortment of modeling designs that are available in addition to the process of assessing the various models under comparison in SAS® Enterprise Miner v4.3. | ||||||||||||
| view back cover | |||||||||||||
|
A Quick Peek of the Book |
|||||||||||||
| Extra Chapters of my Book | |||||||||||||
|
Contents of my Book on this Page |
|||||||||||||
| SAS Data Mining Links | |||||||||||||
| · SAS Data Mining Links | |||||||||||||
|
|||||||||||||
|
The following is the SAS/IML programming code that is in regards to my book, Data Mining Using SAS® Enterprise Miner. |
|||||||||||||
| Linear Regression Modeling: SAS/IML programming code that computes traditional regression estimates. The SAS/IML program will calculate the predicted values, residual values, parameter estimates and associated standard errors and t-test statistics. | |||||||||||||
| Logistic Regression Modeling: SAS/IML programming code that computes logistic regression estimates. The SAS/IML program will calculate the parameter estimates and the likelihood ratio goodness-of-fit statistics by fitting a binary-valued response variable to predict based on the maximum likelihood method. An iterative process is applied in computing the maximum likelihood parameter estimates to determine the final parameter estimates until convergence. | |||||||||||||
| K-Means Clustering: SAS programming code that computes the k-means clustering estimates. Each observation is assigned to the cluster with the smallest squared Euclidean distance based on two separate clusters that are created. | |||||||||||||
| Principal Components: SAS/IML programming code that computes the principal component estimates from the 2004 major league baseball hitters. The SAS/IML program will calculate the principal component scores based on the correlation matrix since the various hitting departments that are measured in different units. A scatter plot from the first two principal components will be generated in order for you to observe the variability, outliers and the various groupings that are formulated from the best hitters in the game of baseball. | |||||||||||||
| Matignon, Randall, An Overview of SAS Enterprise Miner: The paper is designed to make the reader get familiar with the working environment of SAS® Enterprise Miner v4.3. The paper will provide you with the general option settings that are available when you first open SAS® Enterprise Miner v4.3. | |||||||||||||
| Matignon, Randall, Data Mining Using SAS Enterprise Miner: The paper is in reference to my book that is a overview to the multitude of nodes that are available in SAS® Enterprise Miner v4.3. The paper will provide you with the purpose of each node, the option settings that are available and the results that are generated from each node. | |||||||||||||
| SAS Institute, Finding the Solution to Data Mining: This is an update to the subsequent paper on SAS® Enterprise Miner. | |||||||||||||
| SAS Institute, Finding the Solution to Data Mining: The paper is in reference to understanding the capability of data mining and the various nodes that can be used in Enterprise Miner v4.3 to perform data mining. | |||||||||||||
| Lajiness, Michael S., A Practical Introduction to the Power of Enterprise Miner: This is a well-written paper on SAS® Enterprise Miner. The paper provides a short description of the SEMMA process in designing a process flow diagram using SAS® Enterprise Miner for statistical modeling. | |||||||||||||
| Ripley, B.D. , Statistical Data Mining: The paper is written by one of the most knowledgeable person in the field of data mining. The paper explains both traditional clustering and SOM clustering along with some of the modeling designs that are used in Enterprise Miner v4.3. | |||||||||||||
| SAS Institute, Data Mining and the Case for Sampling: The paper explains the various sampling methods that are available in the Sampling node of SAS Enterprise Miner v4.3 This paper discusses the use of sampling as a statistically valid practice for processing large databases. The paper discusses the advantages and disadvantages of sampling for data mining, in addition, to explaining the importance of a random sample in achieving a quality sample. | |||||||||||||
| Allison, Paul, Multiple Imputation of Missing Data: The paper is in reference to the multiple imputation method that is an option that is available in the Replacement node of SAS® Enterprise Miner. The Replacement node is designed to impute or estimate missing values. Multiple imputation uses an appropriate model to predict the missing values of the variable by all other variables with non-missing values by iteratively fitting the model numerous times, then averaging the estimates. | |||||||||||||
| Bao, Xlinli, Mining Transaction/Order Data Using SAS Enterprise Miner: The paper is in regards to the Association node in SAS® Enterprise Miner. The paper described the process of analyzing items purchased from the ASSOCS data set SAS® data set that was used in my book. | |||||||||||||
| Neville, Padraic, Decision Trees for Predictive Modeling: The paper is in reference to the decision tree modeling that is used in the Tree node in SAS® Enterprise Miner. The article will explain the various decision tree modeling methods. In addition, the article will briefly explain the various ensemble modeling designs such as combining models, boosting, and bagging resampling. | |||||||||||||
| SAS Institute, The ARBORETUM Procedure: The paper is in reference to the SAS® data mining procedure that is used to perform decision tree modeling. The importance of this paper is that some of the option settings that are available in the the Tree node are explained in the article. | |||||||||||||
| SAS Institute, DMNeural Procedure: The paper is in reference to the SAS® dmneural procedure that is used to perform dmneural network modeling. The importance of this paper is that some of the option settings that are available in the the Princomp/Dmneural node are explained in the article. | |||||||||||||
|
Cox, James, Multidimensional Binary Search Trees Used for Associative Searching: The paper is in reference to the RD-tree partitioning technique that is used in the Memory-Based Reasoning node in SAS® Enterprise Miner. The partitioning technique is designed to determine the number of data points to use in calculating the fitted values in nearest neighbor modeling. The number of data points to combine is determined by the smoothing constant that must be provided in the predictive or classification modeling design. The technique performs binary splits to the data in which the final partitioning of the data results in a hypercube of data points that are used in calculating the fitted values. |
|||||||||||||
|
Breiman, Leo, Arcing Classifiers: The paper is in reference to boosting resampling that is used in the Ensemble node. The paper provides the reader with the formula that is used in SAS® Enterprise Miner and boosting resampling in which the weight estimates are calculated that are used to adjust the estimated probabilities from the classification model to generate the probability estimates to the boosting model. |
|||||||||||||
| Cerrito, Patrica B., Comparison of Enterprise Miner and SAS/Stat for Data Mining: The paper compares some of the procedure output listings from various statistical procedures that are available in SAS® for data mining with the results that are generated from some of the SAS® Enterprise Miner nodes. | |||||||||||||
| The following link is to my personal blog. The purpose of my blog is to give you an opportunity to respond to me by posting any questions that you might have with my book, that is "Data Mining Using SAS Enterprise Miner". Otherwise, please feel free to ask any questions that you might have with SAS® Enterprise Miner. | |||||||||||||
| Data Mining Using SAS Enterprise Miner Blog | |||||||||||||
|
John Wiley & Sons, Inc. 111 River Street |
|||||||||||||
|
© copyright www.sasenterpriseminer.com - SAS data mining analysis | data mining modeling. |