click below
click below
Normal Size Small Size show me how
1- Research& Assess
Research and Assessment Methods
| Term | Definition |
|---|---|
| Mean | The sum of all the values in a data set divided by the total number of values in the set. Often represented by the symbol x . |
| Median | the midpoint value in an ordered distribution |
| Midrange | the value produced by adding the highest value and the lowest value and dividing by two. |
| Mode | the most frequent entry in a distribution. If a distribution has more than one mode, that distribution is said to be bi-modal. |
| Range | the difference between the highest and lowest value in the data set. The larger the number, the greater the range and thus the greater the dispersion. |
| Standard Deviation | a measure of how much the data in a certain collection are scattered around the mean. A low standard deviation means that the data are tightly clustered; a high standard deviation means that they are widely scattered |
| variable | a quantity or function that may assume any given value or set of values. Some prefer this alternate definition: a symbol for an unspecified member of a class of things or statements. |
| Types of variables include: | • Qualitative • Quantitative • Continuous • Dichotomous • Discrete |
| Qualitative | a descriptive term used when the above variable types fall on either the nominal or ordinal scale |
| Quantitative | a descriptive term used when the above variable types fall on either the interval or ratio scale |
| Continuous | a variable where the value can change to another at any time |
| Dichotomous | a variable where the value is fixed or static. |
| Discrete | a variable where the values are drawn from a finite set. |
| constant | is an unchanging value. The most recognized constant is the value for pi: 3.14 |
| SURVEYS | Surveys are a method of collecting information by asking a set of predetermined questions in a predetermined sequence via a controlled questionnaire to a representative sample (subset) of a given population (whole). |
| cross sectional survey | is used to evaluate a point in time |
| longitudinal survey | is used to evaluate a situation over time. |
| Types of surveys. | Mailed surveys Telephone surveys In-person surveys |
| sampling frame | the source and/or method used to draw the sample. |
| Types of sample designs (6) | Random, System, Stratified, Cluster, Convenience, Volunteer |
| Random | everyone has an equal chance of being selected |
| System | a list is developed, with every xth person selected |
| Stratified | a given population is divided into multiple strata (classes) from which sampling is conducted. |
| Cluster | a given population is divided into logical sections, with selections made from selected sections |
| Convenience | use the individuals and/or results that are readily available |
| Volunteer | individuals volunteer their participation |
| EMPIRICAL KNOWLEDGE | Empirical knowledge is a body of experience. It reflects what people learn through experience and common sense |
| 4 levels of measurement | Nominal Scale, Ordinal Scale, Interval Scale, Ratio Scale |
| Nominal Scale | is the assignment of numbers of symbols for the purpose of designating subclasses that represent unique characteristics. o Identifying (i.e. SSN, TIN, EIN, etc.) o Categorizing (i.e. gender, rank, color, affiliations, etc.) |
| Ordinal Scale | is the assignment of numbers or symbols for the purpose of identifying ordered relationships of a particular characteristic without specified intervals ( Height order of 1st Graders, Richest people in Phila County) |
| Interval Scale | is the assignment of numbers for the purpose of identifying ordered relationships of a particular characteristic with the ability to determine differences in amounts. o Temperature o Years of sunspot activity o Aircraft altitudes |
| Ratio Scale | is the assignment of numbers for the purpose of identifying ordered relations of some characteristic, with the ability to determine differences and an absolute zero point. |
| Frequency Distribution | is used to understand the raw data. Often in frequency distribution, the values are placed in some logical order (i.e. lowest to highest) so the researcher can quickly have a basic grasp of the “feel” of the data. |
| Frequency Distribution Approaches | Tabular formats, Class formats, Cross-Tabs, Histograms, Scatterplot, Frequency Polygons, Ogive |
| Variance | is the square of the standard deviation - the average squared deviations from the mean. The formula is the same as that for the standard deviation except the “s” variable is squared, and no square root function is performed. |
| Hypothesis Testing | is conducted to determine outcomes based on the scientific method. First, the statistician must declare the predicted (desired) outcome, then must also identify and describe all possible outcomes. |
| Research Hypothesis | (designated H1) is a statement that describes the interrelationships between different characteristics. It is what the researcher is seeking to prove through the analysis. |
| Null Hypothesis | (designated H0) is the opposite of the research hypothesis. It is what the researcher is seeking to prove wrong so that the research hypothesis can be assumed to be correct by implication |
| 2 concepts of distribution | Parametric- assumes the data are arranged in a normal distribution and are measured on the interval scale. Non-parametric- is used when the data are arranged in a skewed or other non-normal pattern. |
| Parametric terms | Normal Distribution (bell curve), Z-score, T-test, ANOVA, Correlation, Regression |
| Correlation | measures the strength of the relationship between variables or the degree to which two variables are correlated. It is used to demonstrate relationships between situations and/or actors, even disparate ones (think apples and oranges). The test is linear. |
| Z-score | is a measure of the distance, from the mean. It is used to determine if something would, or would not, happen. |
| T-Test | Allows us to compare the means of two groups and determine how likely the difference between the two means occurred by chance. *Researcher to know the # in each group, diff. between the means of each group, and the standard deviation for each group. |
| Types of T-Tests | Correlated t-test: concerned with the diff. between the ave. scores of a single sample of individuals who is assessed at 2 diff times(“before” vs. “after”) Independent t-test: Compares ave. of two samples that are selected independently of each other. |
| ANOVA | is an extension of the t-test. It stands for Analysis of Variance. ANOVA identifies the relationship between two variables. |
| Regression | is a statistical test of the effect one variable (condition/actor) has on another while holding all other conditions constant. This test is also linear |
| Non-parametric terms | Positive Skew, Negative skew, Chi-Square |
| POPULATION TERMS | Estimates, Projections, Forecasts, Migration, birth rate, death rate, migration rate |
| Estimates | calculated for current population levels |
| Forecasts | are subjective and apply only to selected projections |
| Projections | are calculated for future population levels |
| Migration | is the movement of people into and out of a given study area. |
| Birth Rate | is the total number of babies born per 1000 females in their childbearing years (typically 15-40). |
| Death Rate | is the total number of deaths per 1000 people in the total population |
| Migration Rate | 2 subcategories: o In-Migration measures the total number of people coming into study area. o Out-Migration measures the total number of people leaving study area. |
| Historical Extrapolation | is a basic method of projecting or estimating a population. This method assumes that past trends will continue into the future. As with any extrapolation, projecting too far into the future will result in erroneous conclusions |
| Linear projections | assume that the population change will continue at the same numeric increase over the life of the projection. The formula for linear projections is y=a+bx. The result is a straight line on the graph. |
| Geometric projections | assume the rate of change remains constant., resulting in a curved line. The formula is y=abx |
| Exponential projections | assume that the population change will change exponentially over the life of the projection. The formula is y=aebx. The result is a curved line on the graph. |
| Modified Exponential projections | assume that the pop will change exponentially over the life of projection, but assumption is subject to a cap on the change. The formula is y=c+abx. The result is again a curved line, but the upper or lower limit modifies the line into an azimuth. |
| Gombertz projections | are modified exponential projections on steroids. It is used to project a change in direction over time. Formula: Yc = ca ^(bx) |
| Polynomial projections | are calculated y=a+bx+cx2+dx3+ … αxn |
| The Ratio Method | compares the study area (i.e. a City) to a larger known entity (i.e. a State). The method is good for point in time comparisons as well as comparing relative changes over time. |
| Cohort Component (aka Cohort Survival) | population projections are commonly used to predict what the population will be for a given area in the future. The projection is based on past trends and data |
| ECONOMIC BASE ANALYSES | Economic base analyses divide regional industries into Basic (export) and Non-basic (local) sectors and assumes that the basic sector (exports!) drives the economy |
| limitations to economic base analysis | -Does not account for demographics -No spatial orientations -Before the “information age” (no e-commerce, telecommuting) -US Census Bureau, the DOL is unreliable below county level -Shift from the SIC system to the NAICS |
| Export base defining ways | Empirical Approach, Minimum Requirements Approach, Location Quotient |
| SHIFT-SHARE ANALYSIS | Shift-Share Analysis is a descriptive technique for analyzing sources of change in the regional economy by looking at national share, industry mix, and regional shift. |
| National Share (NS) | estimates the total employment in a given industry in the region if said industry in the region grows at the same rate as the nation. In a nutshell, it simply shows the extent to which the national economy grows or declines. |
| Industry Mix (IM) | Estimates relative change in employment in an industry based on the diff. in growth rates between industry nationally / national economy. Shows which industries in your region are growing or declining relative to the national economy. |
| Regional Shift (RS) | Estimates change in employment in an industry in region based on the diff. in growth rates between industry in the region and the same industry nationally. |
| Calculate Shift-Share | To calculate shift-share, for a given data type (we’ll use employment here), simply add the three variables together: Ei,r = NSi,r + IMi,r + RSi,r |
| INPUT-OUTPUT ECONOMIC ANALYSIS | Input-output analysis focuses on intermediate sales between an economy’s sectors, or the circular flow of the economy (see graphic below). It is based on more of an accounting methodology than a theory (unlike economic base analysis). |
| INPUT-OUTPUT TABLES (3) | Transactions table: shows cash flow between economic sectors Direct requirements table: produced by dividing each cell in the by the sum of that cell’s column Total requirements table: Re-iterates the reqs table ($/ each economic sector) |
| CENSUS 2000 | “The 2000 Decennial Censuses of Population & Housing.” was performed as follows. • USPS- Master Address File was created from the 1990 file • A questionnaire was mailed/ collected • Info was entered into a digital database, then tabulated and mapped. |
| Census statistical units | Metropolitan Statistical Area (MSA), Consolidated Metropolitan Statistical Area (CMSA), Primary Metropolitan Statistical Area (PMSA), Minor Civil Division (MCD), Census Tract, Block Groups, Tribal Designated Statistical Area |
| Consolidated Metropolitan Statistical Area (CMSA) | A regional MSA that is composed of two or more smaller, constituent MSAs, each of which is called a Primary Metropolitan Statistical Area (PMSA) |
| Metropolitan Statistical Area (MSA): | An area – consisting of one or more counties that contains an urbanized core of over 50,000 residents |
| Minor Civil Division (MCD): | A Census unit that is only used in 29 states, and that usually (always in Pennsylvania) corresponds to a municipality |
| Census County Divisions | are used in the 21 states that don’t have MCDs. |
| Census Tract: | A small Census unit with approximately 4,000 residents. Census tracts are only used in MSAs and in some other counties |
| Block Numbering Areas | are used where census tracts are not. |
| Block Groups | These are groups of blocks, which are the smallest Census units. |
| Tribal Designated Statistical Area | A census unit drawn by tribes that do not have a recognized land area. |
| POPULATION ESTIMATES | Extrapolation techniques, Cohort-Component Population Projections, The Housing Unit Method, The Ratio Correlation Method, Ratio (or Step-Down) Method, Administrative Records Method, Comparative Method |
| Information Systems | RELATIONAL DATABASES (RDBMS) MANAGEMENT INFORMATION SYSTEMS (MIS) SPATIAL DATABASES (SDB) GEOGRAPHIC INFORMATION SYSTEMS (GIS) |
| Impact Analyses | -NET PRESENT VALUE -BENEFIT-COST ANALYSIS -COST EFFECTIVENESS ANALYSIS -FISCAL IMPACT ANALYSIS |
| NET PRESENT VALUE | The net present value formula is used to show the net monetary value of a project, discounted to present value *Internal Rate of Return: This analysis technique uses a variation of the net present value formula |
| BENEFIT-COST ANALYSIS | Benefit-Cost Analysis is used both to determine the net monetary value of a project and to weigh the net monetary values of alternative, competing projects |
| BENEFIT-COST ANALYSIS Alternatives | The Total Cost of a Project Over its Lifetime and The Annualized Cost of a Project, Planning Balance Sheets, Goals Achievement Matrix |
| COST EFFECTIVENESS ANALYSIS | Cost effectiveness analysis (or CEA) is usually used to compare two competing projects that will provide roughly the same benefits. *However, it is more complex than merely comparing the total or annualized costs of the projects over their lifetimes |
| Equivalent Uniform Annual Cost Analysis (EUAC) | EUAC is a more complex form of comparing the annualized costs of two or more competing projects with roughly identical benefits |
| Equivalent Uniform Annual Benefit Analysis (EUAB) | EUAB compares the benefits of two or more competing projects with roughly the same costs |
| FISCAL IMPACT ANALYSIS | Also called “cost-revenue analysis.” In general, fiscal impact analysis is used to estimate the costs and revenues that a proposed development will bring to an area’s governments and schools. |