ARM Lecture 5

November 22, 2018 | Author: fasal malik | Category: Stratified Sampling, Sampling (Statistics), Sample Size Determination, Statistics, Confidence Interval

Share Embed Donate

Report this link

Short Description

Download ARM Lecture 5...

Description

Advanced Research Methods (ARM) Company Logo

Sampling Design

Lecture - 5

What is difference between data and statistics?



Recall… 

Statistics is a tool for converting data into information:: Statistics information

Data

Information

But where then does data come from? How is it gathered? How do we ensure its accurate? Is the data reliable? Is it representative of the population from which it was drawn?

Sampling Sampling is that part of statistical practice which is concerned with the selection of individual observations intended to yield some knowledge about a population of concern, especially for the purposes of statistical inference.



… Sampling is the process of selecting a small number of elements from a larger defined target group of elements such that the information gathered from the small group will allow judgments to be made about the larger groups

What is a sample?

A sample is a portion of the elements of a population. A sample is chosen to make inferences about the population by examining or measuring the elements in the sample.

Reasons for Sampling Researchers rarely survey the population for two reasons(Adér, Mellenbergh, & Hand, 2008): (1)The cost is too high and (2)The population is dynamic, i.e., the component of population could change over time. E.g. patients in a hospital

Advantages of sampling: (1) The cost is lower, (2) Data collection is faster, and (3) It is possible to ensure homogeneity and to improve the accuracy and quality of the data because the data set is smaller.

Basics of Sampling Theory Population Element Defined target population Sampling unit Sampling frame

Selection of Elements Population



Population Element



Sampling



Survey





Census

Definitions 

Population: The target population is the collection of elements or objects that possess the information sought by the researcher and about which inferences are to be made. The target population should be defined in terms of elements, sampling units, extent, and time.  An element is the object about which or from which the information is desired, e.g., the respondent.  A sampling A sampling unit is an element, or a unit containing the element, that is available for selection at some stage of the sampling process. E.g. organization organization  Extent refers to the geographical boundaries.  Time is the time period under consideration.

Sampling frame Sampling frame (synonyms: "sample frame", "survey frame") is the actual set of units from which a sample has been drawn: in the case of a simple random sample, all units from the sampling frame have an equal chance to be drawn and to occur in the sample.  In the ideal case, the sampling frame should coincide with the population of interest. 

Example 

Consider, a survey aimed at establishing the number of potential customers for Easypaisa in the population of Islamabad City. The research team has drawn 1000 numbers at random from a telephone directory for the city, made 200 calls each day from Monday to Friday from 8am to 5pm and asked some questions. 

In this example, population example, population of interest is all inhabitants of the city; the sampling frame includes only those dwellers who satisfy all the following conditions:  has a telephone;  the telephone number is included in the directory;  likely to be at home from f rom 8am to 5pm from Monday to Friday;  not a person who refuses to answer all telephone surveys.

Sampling Plans… A sampling plan is just a method or procedure A sampling for specifying how a sample will be taken from a population.



What is a Good Sample? Accurate: absence of bias





Precise estimate: sampling error

Is sample unbiased?

Types of Errors Sampling and  Non-Sampling Errors… 

Sampling Error 

Sampling error is any type of bias that is attributable to mistakes in either drawing a sample or determining the sample size

Sampling errors are caused by sampling design. It includes: (1) Selection error: error: Incorrect selection probabilities are used. (2) Estimation error: error: Biased parameter estimate because of the elements in these samples.



E.g. Two samples of size 10 of 1,000 households. If we happened to get the highest income level data points in our first sample and all the lowest income levels in the second, this delta is due to sampling error.





Increasing the sample size will reduce this type of error.

Non-sampling errors 

Nonsampling errors are more serious and are due to mistakes made in the acquisition of data or due to the sample observations being selected improperly. Non-sampling errors are caused by the mistakes in data processing. It includes: (1) Overcoverage Overcoverage:: Inclusion of data from outside of the population. (2) Undercoverage Undercoverage:: Sampling frame does not include elements in the population. (3) Measurement error: error: The respondent misunderstand misunderstand the question. error: Mistakes in data coding. (4) Processing error: (5) Non-response Non-response::

Increasing the sample size will size will not reduce this type of error.  Acquisition errors arise from the recording of incorrect responses, due to: 

   

— incorrect measurements being taken because of faulty equipment, — mistakes made during transcription from primary sources, — inaccurate recording of data due to misinterpretation of terms, or — inaccurate responses to questions concerning sensitive issues.

Sampling Methods

Probability sampling

Nonprobability sampling

Steps in Sampling Design What is the relevant population?  What are the parameters of interest?  What is the sampling frame? frame?  What is the type of sample?  What size sample is needed?  How much will it cost? 

Steps

Define the Population Determine the Sampling Frame Select Sampling Technique(s) Determine the Sample Size Execute the Sampling Process

Classification of Sampling Techniques Sampling Techniques

Nonprobability Sampling Techniques

Convenience Sampling

Judgmental Sampling

Simple Random Sampling

Systematic Sampling

Probability Sampling Techniques

Quota Sampling

Stratified Sampling

Snowball Sampling

Cluster Sampling

Other Sampling Techniques

Company Logo

Non-Probability Sampling Designs

Nonprobability Nonprobability Sampling Methods Convenience sampling relies upon convenience and access Judgment sampling relies upon belief that participants fit characteristics Quota sampling emphasizes representation of specific characteristics Snowball sampling relies upon respondent referrals of others with like characteristics

Nonprobability Sampling Reasons to use  Procedure satisfactorily meets the sampling objectives  Lower Cost  Limited Time  Not as much human error as selecting a completely random sample  Total list population not available

Nonprobability Sampling Convenience Sampling  Purposive Sampling 

 

Judgment Sampling Quota Sampling

Snowball Sampling



Convenience Sampling Convenience sampling attempts to obtain a sample of convenient elements. Often, respondents respondents are selected because they happen to be in the right place at the right time.  

 

use of students, and members of social organizations mall intercept interviews without qualifying the respondents department stores using charge account lists “people on the street” interviews

Judgmental Sampling Judgmental sampling is a form of convenience sampling in which the population elements are selected based on the judgment of the researcher.  



test markets purchase engineers selected in industrial marketing research expert witnesses used in court

Quota Sampling Quota sampling may be viewed as two-stage restricted judgmental sampling.  The first stage consists of developing control categories, or quotas, of population elements.  In the second stage, sample elements are selected based on convenience or judgment. Population composition Control Characteristic Sex Male Female

Sample composition

Percentage

Percentage

Number

48 52 ____ 100

48 52 ____ 100

480 520 ____ 1000

Snowball Sampling In snowball sampling, sampling, an initial group of respondents is selected, usually at random. After being interviewed, these respondents are asked to identify others who belong to the target population of interest.  Subsequent respondents are selected based on the referrals. 

Company Logo

Probability Sampling Designs

Probability Sampling Designs Simple random sampling  Systematic sampling  Stratified sampling 

 

Proportionate Disproportionate

Cluster sampling  Double sampling 

Simple Random Sampling 





Each element in the population has a known and equal probability of selection. Each possible sample of a given size (n) has a known and equal probability of being the sample actually selected. This implies that every element is selected independently of every other element.

Systematic Sampling The sample is chosen by selecting a random starting point and then picking every ith element in succession from the sampling frame.  The sampling interval, i, is determined by dividing the population size N by the sample size n and rounding to the nearest integer.  When the ordering of the elements is related to the characteristic of interest, systematic sampling increases the representativeness of the sample.  For example, there are 100,000 elements in the population and a sample of 1,000 is desired. In this case the sampling interval, i, is is 100. A random number between 1 and 100 100 is selected. If, for example, this number is 23, the sample consists of elements 23, 23 , 123, 223, 323, 423, 523, and so on. 

Stratified Sampling A two-step process in which the population is partitioned into subpopulations, or strata.  The strata should be mutually exclusive and collectively exhaustive in that every population element should be assigned to one and only one stratum and no population elements should be omitted.  Next, elements are selected from each stratum by a random procedure, usually SRS.  A major objective of stratified sampling is to increase precision without increasing cost. 

Stratified Sampling 







The elements within a stratum should be as homogeneous as possible, but the elements in different strata should be as heterogeneous as possible. Finally, the variables should decrease the cost of the stratification process by being easy to measure and apply. In proportionate stratified sampling, the size of the sample drawn from each stratum is proportionate to the relative size of that stratum in the total population. In disproportionate stratified sampling, the size of the sample from each stratum is proportionate to the relative size of that stratum and to the standard deviation of the distribution of the characteristic of interest among all the elements in that stratum.

Cluster Sampling 









The target population is first divided into mutually exclusive and collectively exhaustive subpopulations, or clusters. Then a random sample of clusters is selected, based on a probability sampling technique such as SRS. For each selected cluster, either all the elements are included in the sample (one-stage) or a sample of elements is drawn probabilistically (two-stage). Elements within a cluster should be as heterogeneous as possible, but clusters themselves should be as homogeneous as possible. Ideally, Ideally , each cluster should be a small-scale representation of the population. In probability proportionate to size sampling, sampling, the clusters are sampled with probability proportional to size. In the second stage, the probability of selecting a sampling unit in a selected cluster varies inversely with the size of the cluster.

Types of Cluster Sampling Fig. 11.3

One-Stage Sampling

Cluster Sampling

Two-Stage Sampling

Simple Cluster Sampling

Multistage Sampling

Probability Proportionate to Size Sampling

Sample vs. Census

Type of Study

1. Budget

Sample Sizes Used in Marketing Research Studies Table 11.2

Type of Study

Problem identificat

Factors to Consider in Sample Design

Research objectives

Degree of accuracy

Resources

Time frame

Knowledge of target population

Research scope

Statistical analysis needs

Determining Sample Size How many completed questionnaires do we need to have a representative sample?  Generally the larger the better, but that takes more time and money.  Answer depends on: 

  

How different or dispersed the population is. Desired level of confidence. Desired degree of accuracy. a ccuracy.

Common Methods for Determining Sample Size 

Common Methods:    

Budget/time available Executive decision Statistical methods Historical data/guidelines

Factors Affecting Sample Size for Probability Designs Variability of the population characteristic under investigation  Level of confidence desired in the estimate  Degree of precision desired in estimating the population characteristic 

n = [N Z ]/[Ne + σ Z ] 2

2

2

2

2

Where  e is the specified error  σ is the SD of the population  N is the population  Z is the table value of Z-Table. For a 95% Confidence Interval, value of Z is 1.96

Probability Sampling and Sample Sizes

For a simple sample size calculator, click here: http://www.surveysystem.com/sscalc.htm

Company Logo

Research Design

Measurement 

Selecting observable empirical events



Using numbers or symbols to represent aspects of the events

Applying a mapping rule to connect the observation to the symbol



What is Measured? 

Objects: Objects:  

Things of ordinary experience Some things not concrete

Properties: Properties: characteristics of objects



Characteristics of Data Classification  Order  Distance (interval between numbers)  Origin of number series 

Data Types Order Interval Origin Nominal

none none none

Ordinal

yes

unequal

none

Interval yes equal or unequal Ratio yes equal zero

none

Sources of Measurement Differences Respondent  Situational factors  Measurer or researcher  Data collection instrument 

Validity 

Content Validity



Criterion-Related Validity  



Predictive Concurrent

Construct Validity

Reliability 

Stability 

Test-retest

Equivalence 



Parallel forms

Internal Consistency   

Split-half KR20 Cronbach’s alpha

Practicality 

Economy



Convenience



Interpretability

Company Logo

MEASUREMENT SCALES

What is Scaling? Scaling is assigning numbers to indicants of the properties of objects



Types of Response Scales Rating Scales  Ranking Scales  Categorization 

Types of Rating Scales Simple category  Multiple choice, single response  Multiple choice, multiple response  Likert scale  Semantic differential 

• Numerical • Multiple rating • Fixed sum • Stapel • Graphic rating

Rating Scale Errors to Avoid 

Leniency  

Negative Leniency Positive Leniency

Central Tendency  Halo Effect 

Types of Ranking Scales 

Paired-comparison



Forced Ranking



Comparative

Dimensions of a Scale 

Unidimensional



Multidimensional

Scale Design Techniques Arbitrary scaling  Consensus scaling  Item Analysis scaling  Cumulative scaling  Factor scaling 

Sarndal, Swenson, and Wretman (1992), Model Assisted Survey Sampling, Springer-Verlag  Fritz Scheuren (2005). "What is a Margin of Error?", Chapter 10, in "What is a Survey?", American Statistical Association, Washington 

Company Logo

Thank you for your kind attention Go forth and research…. ….but be careful out there.

ARM Lecture 5

Short Description

Description

Comments

We need your help!