R is a programming language and software environment for statistical computing and graphics. It was originally developed in the early 1990s by Ross Ihaka and Robert Gentleman at the University of Auckland in New Zealand. R Language is a popular language for data analysis and statistical computing due to its powerful data processing and visualization capabilities. It has a large and active community of users and developers who have contributed to a wide variety of packages for data manipulation, modeling, and visualization.
R is open source and free to use and runs on a variety of platforms including Windows, Mac OS X, and Linux. It is also highly extensible, with a rich ecosystem of packages and tools that make it easy to build custom solutions for a wide range of data analysis tasks. Some of the key features of R include:
- Built-in data structures for working with vectors, arrays, and matrices, lists, and data frames
- Powerful graphics and visualization capabilities, including support for a wide variety of chart types and plotting options
- Comprehensive support for statistical modeling and analysis, including linear and nonlinear regression, time series analysis, and more
- An active and supportive community of users and developers with a wide range of resources and tools available online
R Language Course Details
R language for statistical programming, various features of R, introduction to R Studio, statistical packages, familiarity with different data types and functions, learning to deploy them in various scenarios. Use SQL to apply ‘join’ function, components of R Studio like code editor, visualization and debugging tools and learn about R-bind.
Burraq IT Solutions is a leading training institute in Pakistan that provides various in-house and campus-based training programmes in the most in-demand and emerging skills of marketing, web development, designing, editing, and all other Professional skills.
R functions, code compilation and data in well-defined format called R Packages, R Package structure, package metadata and testing, CRAN (Comprehensive R Archive Network), vector creation and variables values assignment
Sorting Data Frame
R functionality, Rep function, generating repeats, sorting and generating factor levels, transpose and stack function
Matrices and Vectors
Introduction to matrix and vector in R, understanding various functions like Merge, Strsplit, Matrix manipulation, rowSums, rowMeans, colMeans, colSums, sequencing, repetition, indexing and other functions
Reading Data from External Files
Understanding subscripts in plots in R, how to obtain parts of vectors, using subscripts with arrays, as logical variables, with lists and understanding how to read data from external files
Generate plots in R, graphs, bar plots, line plots, histograms and components of a pie chart
Analysis of Variance (ANOVA)
Understanding analysis of variance (ANOVA) statistical technique, working with pie charts and histograms and deploying ANOVA with R, one-way ANOVA and two-way ANOVA
K-Means clustering for cluster and affinity analysis, cluster algorithm, cohesive subset of items, solving clustering issues, working with large datasets, association rule mining affinity analysis for data mining and analysis and learning co-occurrence relationships
Association Rule Mining
Introduction to Association Rule Mining, various concepts of Association Rule Mining, various methods to predict relations between variables in large datasets, algorithm and rules of Association Rule Mining and understanding single cardinality
Regression in R
Understanding what is simple linear regression, various equations of line, slope, Y-intercept regression line, deploying analysis using regression, the least square criterion, interpreting the results and standard error to estimate and measure of variation
Analyzing Relationship with Regression
Scatter plots, two-variable relationship, simple regression analysis and line of best fit
Deep understanding of the measure of variation, the concept of co-efficient of determination, F-test, the test statistic with an F-distribution, advanced regression in R and prediction linear regression
Logistic regression mean and logistic regression in R
Advanced Logistic Regression
Advanced logistic regression, understanding how to do prediction using logistic regression, ensuring if the model is accurate, understanding sensitivity and specificity, confusion matrix, what is ROC, a graphical plot illustrating binary classifier system and ROC curve in R for determining sensitivity/specificity trade-offs for a binary classifier
Receiver Operating Characteristic (ROC)
Detailed understanding of ROC, area under ROC curve, converting the variable, data set partitioning, understanding how to check for multicollinearity, how two or more variables are highly correlated, building of model, advanced data set partitioning, interpreting of the output, predicting the output, detailed confusion matrix and deploying the Hosmer-Lemeshow test for checking whether the observed event rates match the expected event rates
Database Connectivity with R
Connecting to various databases from the R environment, deploying the ODBC tables for reading the data and visualization of the performance of the algorithm using confusion matrix.
Logistic Regression Case Study
In this case study, you will get a detailed understanding of the advertisement spends of a company that will help to drive more sales. You will deploy logistic regression to forecast the future trends, detect patterns and uncover insights and more, all through the power of R programming. Due to this, the future advertisement spends can be decided and optimized for higher revenues.
Multiple Regression Case Study
You will understand how to compare the miles per gallon (MPG) of a car based on various parameters. You will deploy multiple regression and note down the MPG for the car make, model, speed, load conditions, etc. It includes the model building, model diagnostic and checking the ROC curve, among other things.
Receiver Operating Characteristic (ROC) Case Study
You will work with various data sets in R, deploy data exploration methodologies, build scalable models, predict the outcome with highest precision, diagnose the model that you have created with various real-world data, check the ROC curve and more.
Duration: 1 Month
Timing: 9AM-11AM, 11AM-1PM, 1PM-3PM, 3PM-5PM, 5PM-7PM, 7PM-9PM