Skip to content

Tag Archives: R Data-science

In this article, we will see what is Error in Hypothesis Testing, different types of errors that occur in hypothesis testing, and how to calculate… Read More
Cluster analysis or clustering is a technique to find subgroups of data points within a data set. The data points belonging to the same subgroup… Read More
Feature engineering is the most important technique used in creating machine learning models. Feature Engineering is a basic term used to cover many operations that… Read More
Split testing is another name of A/B testing and it’s a common or general methodology. It’s used online when one wants to test a new… Read More
A boxplot is a graphical representation of groups of numerical data through their quartiles. Box plots are non-parametric that they display variation in samples of… Read More
Prerequisite: Multiple Linear Regression using R A well-fitting regression model produces predicted values close to the observed data values. The mean model, which uses the… Read More
REST(Representational state transfer) API is an architectural style that includes specific constraints for building APIs to ensure that they are consistent, efficient, and scalable. REST… Read More
REST(Representational state transfer) API is an architectural style that includes specific constraints for building APIs to ensure that they are consistent, efficient, and scalable. REST… Read More
Arthur Samuel, a pioneer in the field of artificial intelligence and computer gaming, coined the term “Machine Learning”. He defined machine learning as – “Field… Read More
Apriori algorithm is used for finding frequent itemsets in a dataset for association rule mining. It is called Apriori because it uses prior knowledge of… Read More
Packages in the R programming are a collection of R functions, compiled code, and sample data. They are stored under a directory called “library” in… Read More
A popular nonparametric(distribution-free) test to compare outcomes between two independent groups is the Mann Whitney U test. When comparing two independent samples, when the outcome… Read More
The prime aim of any machine learning model is to predict the outcome of real-time data. To check whether the developed model is efficient enough… Read More
Repeated K-fold is the most preferred cross-validation technique for both classification and regression machine learning models. Shuffling and random sampling of the data set multiple… Read More
There are many cases in data analysis where you’ll want to compare means for two populations or samples and which technique you should use depends… Read More