Question 1

Data

Accepted Answer

Facts and figures from which conclusions can be drawn.

Question 2

What is Statistics?

Accepted Answer

Statistics is a way to get information from data.

Question 3

Data set

Accepted Answer

The data that are collected for a particular study

Question 4

Elements

Accepted Answer

Data set consists of Elements.  Ex: stocks, students, homes for sale, or other entries

Question 5

Variable

Accepted Answer

Any characteristic of an element. Ex: price of a stock, height of a student

Question 6

Measurement

Accepted Answer

A way to assign a value of a variable.

Question 7

Quantitative

Accepted Answer

The possible measurements  are numbers that represent quantities.

Question 8

Qualitative

Accepted Answer

The possible measurements are descriptive and not numbers.

Question 9

Cross-sectional data

Accepted Answer

Data collected at the same or approximately the same point in time

Question 10

Time series data

Accepted Answer

Data collected over different time periods

Question 11

Population

Accepted Answer

A set of all elements about which we wish to draw conclusions

Question 12

Census

Accepted Answer

An examination all elements of a population

Question 13

Sample

Accepted Answer

A subset of the elements of a population

Question 14

Descriptive Statistics

Accepted Answer

The science of describing the important aspects of a data set measurements.  DOES NOT allow us to draw any conclusions or make any interference about the data.

Question 15

Inferential Statistics or Statistical Inference

Accepted Answer

Set of methods, but it is used to draw conclusions or inferences about characteristics of populations based on data from a sample.  The process of making an estimate, prediction or decision about a population based on a sample.

Question 16

Statistical Inference

Accepted Answer

The science of drawing conclusion/inference about a population from a sample

Question 17

Bar Chart, Pie Chart, Pareto Chart, Histogram

Accepted Answer

A form of Descriptive Statistics using Graphical Techniques.  Allows statistics practitioners to present data in ways that make it easy for the reader to extract useful information.

Question 18

Mean, Median

Accepted Answer

Popular numerical techniques in descriptive statistics to describe the location of the data.

Question 19

Range, Variance, Standard Deviation

Accepted Answer

Numerical technique in descriptive statistics to measure the variability of the data.

Question 20

Business analytics

Accepted Answer

The use of traditional and newly developed statistical methods, advances in IS, and techniques from management science to explore and investigate past performance
Descriptive analytics,
Predictive analytics,
Prescriptive analytics

Question 21

Big data

Accepted Answer

Often needs quick analysis to support business decision making.

Question 22

Descriptive modeling

Accepted Answer

Which typically uses data aggregation to provide hindsight and insight into the past and strives to answer: “What has happened?”  
Predictive modeling

Question 23

Descriptive analytics

Accepted Answer

The use of traditional and newer graphics to represent easy-to-understand visual summaries of up-to-the-minute data
Dot plots,
Time series plots,
Bar chart,
Histograms,
Dashboards,
Numerical techniques

Question 24

Predictive analytics

Accepted Answer

Methods used to find anomalies, patterns, and associations in data sets to predict future outcomes
Linear regression,
Logistic regression,
Decision trees,
Neural networks,
Cluster analysis,
Factor analysis,
Association Rules

Question 25

Data mining

Accepted Answer

The use of predictive analytics, algorithms, and IS techniques to extract useful knowledge from huge amounts of data
K-Means algorithm,
Support Vector Machines,
Bayesian Belief Network,

Question 26

Prescriptive analytics

Accepted Answer

Looks at variables and constraints, along with predictions from predictive analytics, to recommend courses of action
Optimization sub-routine,
Liner programming,
Non-linear programming,
Dynamic programming,
Integer programming,
Simulation

Question 27

Nominal

Accepted Answer

A qualitative variable of description for which there is no meaningful ordering, or ranking, of the categories
Example: gender, car color
Only limited statistical techniques are applicable

Question 28

Ordinal

Accepted Answer

A qualitative variable for which there is a meaningful ordering, or ranking, of the categories
Example: teaching effectiveness, choice of preference
Only limited statistical techniques are applicable

Question 29

Interval Variables

Accepted Answer

Real numbers, i.e. heights, weights, prices, etc.
Also referred to as quantitative or numerical data.
Arithmetic operations can be performed on Interval Data, thus its meaningful to talk about 2*Height, or Price + $1, and so on.

Question 30

Qualitative Variables

Accepted Answer

Nominal and Ordinal.  The possible measurements are descriptive and not numbers.

Question 31

Graphical Descriptive Technique for Nominal/Ordinal (Qualitative) Data

Accepted Answer

Frequency,
Relative Frequency,
Percentage (%) Frequency,
Cumulative Relative Frequency (Ogive),
Bar Chart,
Pie-Chart,
Pareto Analysis,
Contingency Table

Question 32

Graphical Descriptive Technique for Interval (Quantitative) Data

Accepted Answer

Frequency Table,
Histogram,
Ogive,
Dot Plot,
Stem-and-Leaf Plot,
Scatterplot

Question 33

frequency distribution

Accepted Answer

We can summarize the data in a table that presents the categories and their counts called a frequency distribution.

Question 34

relative frequency distribution

Accepted Answer

Lists the categories and the proportion with which each occurs.

Question 35

Frequency

Accepted Answer

The number of items in each ‘class’ in the data

Question 36

Relative frequency

Accepted Answer

Summarizes the proportion of items in each class

Question 37

Bar chart

Accepted Answer

A vertical or horizontal rectangle represents the frequency for each category
Height can be frequency, relative frequency, or percent frequency

Question 38

Pie chart

Accepted Answer

A circle divided into slices where the size of each slice represents its relative frequency or percent frequency

Question 39

Pareto principle

Accepted Answer

In many economies, most of the wealth is held by a small minority of the population (80% - 20% principle)
Application:  a few classes of defects accounts for most quality problems in manufacturing.

Question 40

Development of Pareto Chart

Accepted Answer

Develop Bar chart representing the frequency of occurrence
Bars are arranged in decreasing height from left to right
Chart is augmented by plotting a cumulative percentage point for each bar (Pareto Line)

Question 41

Cross Classification Table

Accepted Answer

Lists the Frequency of each combination of values for two variables as a first step.
To describe the relationship between two nominal variables, we must remember that we are permitted only to determine the frequency of the values.

Question 42

Contingency Tables

Accepted Answer

Classifies data on two dimensions
Rows classify according to one dimension
Columns classify according to a second dimension

Question 43

Frequency Distribution

Accepted Answer

A frequency distribution is a list of data classes with the count of values that belong to each class
The frequency distribution is a table

Question 44

Histogram

Accepted Answer

The histogram is a picture of the frequency distribution

Question 45

K

Accepted Answer

K is the number of classes.
K = 1 + 3.3 Log10 (n)

Question 46

n

Accepted Answer

n is the number of elements within the sample.

Question 47

N

Accepted Answer

N is the number of elements in the entire population.

Question 48

Length or Width of a class

Accepted Answer

(Max - Min) / k

Question 49

Skewed to the right

Accepted Answer

The right tail of the histogram is longer than the left tail

Question 50

Skewed to the left

Accepted Answer

The left tail of the histogram is longer than the right tail

Question 51

Symmetrical

Accepted Answer

The right and left tails of the histogram appear to be mirror images of each other

Question 52

Cumulative Distributions

Accepted Answer

To do this, use the same number of classes, class lengths, and class boundaries used for the frequency distribution.
Rather than a count, we record the number of measurements that are less than the upper boundary of that class.
A running total

Question 53

Ogive

Accepted Answer

A graph of a cumulative distribution

Question 54

Frequency Polygons

Accepted Answer

Plot a point above each class midpoint at a height equal to the frequency of the class
Useful when comparing two or more distributions

Question 55

Dot Plots

Accepted Answer

A Dot placed on a real number line to quickly show potential
Useful for detecting outliers.

Question 56

Stem-and-Leaf Displays

Accepted Answer

Purpose is to see the overall pattern of the data, by grouping the data into classes
the variation from class to class,
the amount of data in each class,
the dist of the data within each class,
Best for small to moderately sized data distributions

Question 57

Scatter Plots

Accepted Answer

Used to study relationships between two variables
Each data has two-dimensions
Place one variable on the x-axis
Place a second variable on the y-axis
Place dot on pair coordinates

Question 58

Linear

Accepted Answer

A straight line relationship between the two variables

Question 59

Linear Positive

Accepted Answer

When one variable goes up, the other variable goes up

Question 60

Linear Negative

Accepted Answer

When one variable goes up, the other variable goes down

Question 61

No linear relationship

Accepted Answer

There is no coordinated linear movement between the two variables

Question 62

Data Warehouses

Accepted Answer

A process for centralized data management and retrieval and has as its ideal objective the creation and maintenance of a central repository for all of an organizations data.

Question 63

Response variable vs factors

Accepted Answer

When initiating a study, we first define our variable of interest, or response variable.  Other variables, typically called factors, that may be related to the response variable.

Question 64

Experimental Study

Accepted Answer

Means we are able to set or manipulate the values of the factors.

"Know" box contains:
Time elapsed:
Retries:

BSTAT 5301