Translate This Page Print This Page Email a Friend Newsletter Sign-Up
Text Size : Sm Med Lg

Data & Statistics
Community Health Survey: Public Use Data

As part of the Health Department's ongoing commitment to make data from the New York City Community Health Survey (CHS) available for analysis, grant writing, policymaking and program development, we provide downloadable datasets for use by researchers, students, and the public health community.

Datasets are available on this page for each year of the CHS. For more information on the CHS, visit:

Prior to downloading the datasets below, please review the following information:

  1. Due to the complex sampling design of the CHS, data must be analyzed in a software program capable of handling complex survey data (such as SUDAAN or STATA). Annotated sample code for analysis using SAS and SUDAAN is provided for each year. More information about design statements and nesting variables are located in those programs.


  2. Specific items on the CHS change from year to year as the Health Department develops initiatives or new public health issues emerge. Please review our brief question matrix (pdf) to determine which year(s) contains your topics of interest.


  3. Although a topic may be included every year, question wording or response categories may vary from one year to the next. These changes may be minor or substantial, and comparing estimates across years may not always be appropriate. The CHS variable crosswalk (xls) can be used as a guide to assess comparability, but all users are strongly encouraged to review the questionnaires and compare data across years of the CHS with caution.  Please send an email to epidatarequest@health.nyc.gov if you need guidance about comparing questions across years.  

  4. The CHS was designed to provide estimates on a variety of health measures at the United Hospital Fund (UHF) neighborhood level. However, some conditions may be rare or the sample sizes for some populations quite small, making CHS estimates potentially unreliable. Organizations have various guidelines for when an estimate should be considered unreliable. Our suggested guidelines for CHS data reliability (PDF) incorporate relative standard error (RSE), confidence interval width, and sample size.


  5. For any published analysis using downloadable CHS data, please reference the url of this webpage, the CHS year(s) analyzed, and the date on which the dataset was downloaded (suggested citation below). On occasion, updates are made to some variables. Would you like to be notified when CHS datasets are released or updated? Please send an email to epidatarequest@health.nyc.gov.


  6. Analyses of smoking data for 2003 should use the special smoking dataset (CHS 2003 smoking) provided below.


  7. CHS began including a cell-phone-only sample in 2009.  For most measures, the inclusion of a cell-phone-only sample has had only a nominal effect (see Epi Research Report - Results from the 2008 Cell Phone Pilot Study).  However, researchers should use caution when comparing multiple years of CHS data and note any difference in the populations surveyed.


  8. Most variables in the CHS have very few missing values (less than 2% on average). Responses of “don’t know” and “refused” are coded as missing (.d and .r, respectively). In select cases, responses of “don’t know” are coded as a non-missing response category because these responses were intended to be a valid response category; they comprise more than 10% of responses; or to maintain historical consistency. By default, SAS and SUDAAN exclude missing values from analysis. If you are interested in using methods such as imputation to address missing data, please send an email to survey@health.nyc.gov (include "imputation" in the subject heading).

  9. If an analysis requires the combination of multiple years of data, a multi-year weight is needed.  Multi-year weights are only available for data years 2002-2008, and 2009-2012.  At this time, survey data from 2009-2012 cannot be combined with earlier years. Please send an email to epidatarequest@health.nyc.gov for more information on multi-year survey weights.


  10. In 2011, the Health Department updated its weighting methodology of the Community Health Survey, consistent with other large state and national surveys. New weighting methods incorporate Census 2010 data and additional demographic characteristics to best represent the population of adult New Yorkers. After analyzing possible effects of these changes, the Health Department found that the updated methodology has minimal or no effect on CHS health estimates and does not impact the interpretation of trends in prevalence (percentages) over time. Full details can be found in the methodology update report.     
CHS Public Use Datasets

2012
Questionnaire (PDF)
Codebook (PDF)
Dataset (SAS)
Call in program and sample code for analyzing data (SAS + SUDAAN)
Format program
Format statement program


2011
Questionnaire (PDF)
Codebook (PDF)
Dataset (SAS)
Call in program and sample code for analyzing the data (SAS+SUDAAN)
Format program
Format statement program

2010
Questionnaire (PDF)
Codebook (PDF)
Dataset (SAS)
Call in program and sample code for analyzing the data (SAS+SUDAAN)
Format program
Format statement program

2009
Questionnaire (PDF)
Codebook (PDF)
Dataset (SAS)
Call in program and sample code for analyzing the data (SAS+SUDAAN)
Format program 
Format statement program 

2008
Questionnaire (PDF)
Codebook (PDF)
Dataset (SAS)
Call in program and sample code for analyzing the data (SAS+SUDAAN)
Format program 
Format statement program 

2007
Questionnaire (PDF)
Codebook (PDF)
Dataset (SAS)
Call in program and sample code for analyzing the data (SAS+SUDAAN)
Format program 
Format statement program  

2006
Questionnaire (PDF)
Codebook (PDF)
Dataset (SAS)
Call in program and sample code for analyzing the data (SAS+SUDAAN)
Format program 
Format statement program  

2005
Questionnaire (PDF)
Codebook (PDF)
Dataset (SAS)
Call in program and sample code for analyzing the data (SAS+SUDAAN)
Format program 
Format statement program  

2004
Questionnaire (PDF)
Codebook (PDF)
Dataset (SAS)
Call in program and sample code for analyzing the data (SAS+SUDAAN)
Format program
Format statement program   

2003
Questionnaire (PDF)
Codebook (PDF)
Dataset (SAS)
Call in program and sample code for analyzing the data (SAS+SUDAAN)
Format program
Format statement program   

2003 for Smoking Analyses
Questionnaire (PDF) - Fall 2003 Survey (PDF) [Note: 2003 full survey questionnnaire available above]
Codebook (PDF)
Dataset (SAS)
Call in program and sample code for analyzing the data (SAS+SUDAAN)
Format program 
Format statement program  

2002
Questionnaire (PDF)
Codebook (PDF)
Dataset (SAS)
Call in program and sample code for analyzing the data (SAS+SUDAAN)
Format program
Format statement program 
 

Suggested Citation

New York City Department of Health and Mental Hygiene. Community Health Survey [YEAR(S)]; public use dataset accessed on [DATE].