Bachelors Level/First Year/Second Semester/Science bit/second semester/basic statistics/syllabus

Bachelors In Information Technology

Institute of Science and Technology, TU

Nature of the course: (Theory+Lab)

F.M: 60+20+20 P.M: 24+8+8

Credit Hrs: 3Hrs

Basic Statistics [STA154]
Course Objective
i.
To impart the knowledge of descriptive statistics, correlation, regression, concept of sampling and sampling distribution, theoretical as well as applied knowledge of probability and some probability distributions.
Course Description

The course familiarizes students with the basic concepts of statistics including introduction, diagrammatical and graphical representation, descriptive statistics, probability, random variables, sampling, and correlation and regression.

S1:Introduction[5]
1
Basic concept of statistics(definitions, concept of descriptive and inferential statistics), application of Statistics in different fields including information technology, limitations of statistics; scales of measurement(nominal, ordinal, interval and ratio), variables(Discrete, continuous and categorical), types of data(cross-sectional and longitudinal) and data source(primary and secondary), data preparation- editing, coding, and transcribing.
S2:Diagrammatical and Graphical Presentation of Data[3]
1
Bar diagrams; Pie diagrams; Pareto chart; Graph of frequency distribution (Histogram, frequency polygon, frequency curve, less than ogive and more than ogive, stem and leaf display) and their interpretations
S3:Descriptive Statistics[7]
1
Measures of central tendency: Definition of measures of central tendency, arithmetic mean and its mathematical properties, weighted mean, median, mode, empirical relations between mean , median and mode; choice of measure of central tendency, interpretations
2
Measures of dispersion: Need of measures of dispersion, absolute and relative measures, range, quartile deviation, mean deviation, standard deviation and their relative measures including coefficient of variation, choice of appropriate measure of dispersion, interpretations
3
Measures of skewness: Concept of skewness, types of skewness, Pearson’s coefficient of skewness, Bowley’s coefficient of skewness; Exploratory Data Analysis (EDA): Five number summary, box and whisker plots, outliers, use of five number summary and boxplots to assess the skewness of data distribution
4
Measures of kurtosis: Concept of kurtosis, types of kurtosis, measure of kurtosis based on percentiles; overall assessment of nature of data distribution
5
Moments: Introduction of moments, central moments and raw moments, relations between central moments and raw moments, measures of skewness and kurtosis based on moments
6
Problems and illustrative examples related to IT
S4:Introduction to Probability[7]
1
Concepts of probability, definitions of probability (mathematical, statistical and subjective approach), terminologies used in probability, laws of probability (additive and multiplicative), conditional probabilities, Bayes theorem: prior and posterior probabilities
2
Problems and illustrative examples related to IT
S5:Random Variables and Mathematical Expectation[3]
1
Concept of a random variable and its types, probability distribution of a random variable, mathematical expectation of a discrete random variable, standard deviation and variance of discrete random variable, addition and multiplication theorems of expectation and variance(without proof).
2
Problems and illustrative examples related to IT
S6:Probability Distributions[6]
1
Probability distribution function, Binomial distribution, Poisson distribution, Normal distribution and their characteristic features; applications of these distributions in IT related data problems
2
Problems and illustrative examples related to computer Science and IT
S7:Sampling and Sampling Distribution[7]
1
Definitions of population, sample survey vs. census survey, sampling error and non-sampling error, concept of parameter and statistic, types of sampling(concept of simple random, stratified, cluster and systematic sampling, concept of non-probability sampling), standard error of mean, standard error of proportion, sampling distribution of mean and proportion, need of inferential statistics, concept of central limit theorem, concept of estimation(point and interval), confidence interval estimation for mean & proportion, problem specific interpretations of confidence interval
2
Problems and illustrative examples related to IT
S8:Correlation and Linear Regression[7]
1
Bivariate data, bivariate frequency distribution, correlation between two variables, Karl Pearson’s coefficient of correlation(r), assumptions of Pearson’s correlation coefficient, properties of correlation coefficient, Spearman’s rank correlation including repeated ranks, interpretation of correlation coefficient, need of regression analysis, fitting of lines of regression by the least squares method, interpretation of regression coefficients, coefficient of determination(R2) and its interpretation, residual plots for assessing the goodness of fit of the model
2
Problems and illustrative examples related to IT
References
1.
Michael Baron (2013). Probability and Statistics for Computer Scientists. 2nd Ed., CRC Press, Taylor & Francis Group, A Chapman & Hall Book.
2.
Ronald E. Walpole, Raymond H. Myers, Sharon L. Myers, & Keying Ye(2012). Probability & Statistics for Engineers & Scientists. 9th Ed., Printice Hall.
3.
Douglas C. Montgomery & George C. Ranger (2003). Applied Statistics and Probability for Engineers. 3rd Ed., John Willey and Sons, Inc.
4.
Richard A. Johnson (2001). Probability and Statistics for Engineers. 6th Ed., Pearson Education, India
Labrotary Work
Computational Statistics (Using any statistical software such as Microsoft Excel, SPSS, STATA, etc. whichever convenient)
1.
Diagrammatical and graphical presentation of data
2.
Computation of measures of central tendency (ungrouped and grouped data), use of an appropriate measure and interpretation of results and computation of partition values
3.
Computation measures of dispersion (ungrouped and grouped data) and computation of coefficient of variation.
4.
Measures of skewness and kurtosis using method of moments, measures of skewness using box and whisker plot
5.
Scatter diagram, correlation coefficient (ungrouped data) and interpretation. Compute manually and check with computer output
6.
Fitting of simple linear regression model (results to be verified with computer output), residuals plot
7.
Conditional probability and Bayes theorem
8.
Problems related to Binomial, Poisson and Normal probability distributions
9.
Problems related sampling and sampling distribution of mean and proportion, confidence interval estimation for mean and proportion