Data Mining Algorithms – Principal Component Analysis
Principal component analysis (PCA) is a technique used to emphasize the majority of the variation and bring out strong patterns in a dataset. It is often used to make data easy to explore and...
View ArticleData Mining Algorithms – Support Vector Machines
Support vector machines are both, unsupervised and supervised learning models for classification and regression analysis (supervised) and for anomaly detection (unsupervised). Given a set of training...
View ArticleData Mining Algorithms – Pluralsight Course
This is a bit different post in the series about the data mining and machine learning algorithms. This time I am honored and humbled to announce that my fourth Pluralsight course is alive. This is the...
View ArticleData Mining Algorithms – Naive Bayes
I am continuing with my data mining and machine learning algorithms series. Naive Bayes is a nice algorithm for classification and prediction. It calculates probabilities for each possible state of the...
View ArticleData Mining Algorithms – Decision Trees
Decision Trees is a directed technique. Your target variable is the one that holds information about a particular decision, divided into a few discrete and broad categories (yes / no; liked / partially...
View ArticleData Mining Algorithms – Neural Network
A neural network is a powerful data modeling tool that is able to capture and represent complex input/output relationships. The motivation for the development of neural network technology stemmed from...
View ArticleDevWeek 2016 BI in SQL Server 2016 Workshop Setup
I got some questions about virtual machine / notebook setup for my Business Intelligence in SQL Server 2016 DevWeek post-conference workshop. I am writing this blog because I want to spread this...
View ArticleData Mining Algorithms – Logistic Regression
It’s been awhile since I wrote the last blog on the data mining / machine learning algorithms. I described the Neural Network algorithm. In addition, it is a good time to write another post in order to...
View ArticleEmbrace R @ SQL Nexus 2017 & SQL Saturday #626
R is the hottest topic in SQL Server 2016. If you want to learn how to use it for advanced analytics, join my seminar at SQL Nexus conference on my 1st in Copenhagen. Although there is still nearly a...
View ArticleData Mining Algorithms – EM Clustering
With the K-Means algorithm, each object is assigned to exactly one cluster. It is assigned to this cluster with a probability equal to 1.0. It is assigned to all other clusters with a probability equal...
View Article