EXAMPLE 1: THIS IS HOW I WOULD DO IT IN SAS. If code makes you nervous, use the simpler method in EXAMPLE 2, below.
Note: What you're describing sounds like a proportional sample, not a
cluster sample, so that's what I've shown here. Hope that meets your
needs.
/******** sort by strata *****/
proc sort data=MED_pts_155k ; by GRoup A_B_C clients ; run ;
/******** create sample design ***/
proc surveyselect noprint
data= MED_pts_155k
method=srs
seed = 7
n = 6000
out = sample_design ;
strata GRoup A_B_C /
alloc=prop NOSAMPLE
allocmin = 2 ; /*** min of 2 per stratum. ****/
run ;
/******** pull sample **********/
proc surveyselect noprint
data= MED_pts_155k
method=sys
seed = &seed
n = sample_design
out = MY_SAMPLE ;
strata GRoup A_B_C ;
run ;
The "alloc = prop" option gives you proportional (i.e. 'even') sampling. The "nosample" option in SAS allows you to generate a separate file outlining the sample design. You then use the design in a second stage where you actually pull the sample. If this is too much bother you can leave off the "nosample" option, and go straight to pulling your sample as we
as we did in the simpler example below.
Note that in the second step above we've chosen to switch to 'method = SYS', instead of simple random sample (SRS). SRS would work too, but since you may have different types of responses by client, you might want to sample in a representative way across the range of clients. To do that you sort within each stratum by client and intentionally sample in even increments across the range of clients; this is a called a "systematic" sample (SYS).
EXAMPLE 2: SIMPLER
You could also do it all in one simple step if you want less code, and don't need to see the sample design broken down in a separate file.
/******** sort by strata *****/
proc sort data=MED_pts_155k ; by GRoup A_B_C ; run ;
/******** pull sample **********/
proc surveyselect noprint
data= MED_pts_155k
method= SRS
seed = 7
n = 6000
out = MY_SAMPLE ;
strata GRoup A_B_C /
alloc=prop
allocmin = 2 ;
run ;
In both examples we're assuming you have two stratification variables:
'GRoup' and a second variable 'A_B_C' which contains values of a, b.
or c. Hope that helps. Cluster sampling is possible in SAS as well, but as noted above, I've illustrated a proportional sample here since that seems to be what you need. Cluster sampling would take a little more space to describe.