Machine learning can learn from the data that is processed and analyzed. In other words, the more data that is processed, the more it can learn.Machine learning has similarities to the human reasoning process. Unlike traditional analysis, the generated model cannot evolve as data is accumulated.
R, as a dialect of GNU-S, is a powerful statistical language that can be used to manipulate and analyze data. Additionally, R provides many machine learning packages and visualization functions, which enable users to analyze data on the fly. Most importantly, R is open source and free
This course has been designed keeping the current requirements in the world of Data Analysis. This knowledge will open up huge opportunities for students who would like to pursue careers in the field of DATA SCIENCE.
Pedagogy
The Course will be taught using a combination of Lecture and hands on with standard sample datasets. Participants will bring their own Laptops.
Prerequisite
The participants should have undergone a course on basic and inferential statistics. Knowledge of computer basics
Course Outline : |
|
Introduction to R |
- Downloading and installing R
- Downloading and installing R Studio
- Installing and loading packages
- Reading and writing data
- Using R to manipulate data
- Applying basic statistics
- Visualizing data
|
Regression with R |
- Linear
- Multiple
- Logistic Regression
- Regression Diagnostics
|
Classification |
- Logistic
- Decision Trees
- KNN
|
Unsupervised Learning |
|
Model evaluation |
- Confusion Matrix
- Kappa Statistics
- AIC Criteria
- ROC curve
|
Machine Learning and deep Learning
Course Outline : |
|
Introduction to R – (16 hrs) |
- Downloading and installing R
- Downloading and installing R Studio
- Installing and loading packages
- Reading and writing data
- Using R to manipulate data
- Applying basic statistics
- Visualizing data
|
Introduction to Analytics |
|
Introduction – Basics of Statistics –
Descriptive statistics,
Hypothesis testing,
Estimation – (16 hrs) |
- Data exploration
- Data Preparation
- Model Building
- Model Evaluation
|
Predictive Analytics–(12 hrs) |
- Linear and Multilinear prediction with case study
- Regression Trees
- Understanding Regression Trees – why?
- SDR
- Model Evaluation
- Time series - Understanding of time series and suitability with case study
|
Classification/Regression Analytics with Algorithms and case study – (16- hrs) |
- Logistic Regression
- KNN
- K Means
- Random Forest
- SVM
|
NLP – (6 hrs) |
- Introduction to text mining and NLP
- Concept of document and Corpus
- Document Matrix
- Plotting word cloud
- Bayesian rule
- Model building
- Model accuracy
- Model improvement
|
Neural Network – (8hrs) |
- Introduction to deep learning
- Concept of Artificial neural
- Building Feed Forward NeuralNetwork
- Impact of Hidden Layers
- Gradient Descent
- Weights Update
- Model building
- Model evaluation and improvement
|