In 2010-2011 the NYC Department of Health and Mental Hygiene conducted the Physical Activity and Transit Survey (PAT). As part of the Health Department's ongoing commitment to make data available for analysis, policymaking and program development, we have made these datasets available for download for use by researchers, students, and the public health community. The PAT Survey consisted of three parts: (1) a telephone survey of physical activity and health (2) a weeklong accelerometer device component of a sub-sample of participants and (3) a weeklong GPS device component. Survey and accelerometer data are now available for download and public use.
Three datasets are available for download:
For more information on the PAT, visit:
Prior to downloading the datasets below, please review the following information:
Due to the complex sampling design of the PAT, data should be analyzed in a software program capable of calculating variance for complex survey data. These include SAS, SPSS, SUDAAN, and STATA. Annotated sample code for analysis using SAS and SUDAAN is provided for all three data sets. More information about design statements and nesting variables are located in those programs.
The PAT was designed to provide estimates on borough level. United Hospital Fund (UHF) neighborhood indicators are included in the data set, however the sample sizes for these areas may be small and the estimates may be unreliable. For more information, see our suggested guidelines for data reliability (PDF) which incorporate relative standard error (RSE), confidence interval width, and sample size.
For any published analysis using downloadable PAT data, please reference the url of this webpage, the CHS year(s) analyzed, and the date on which the dataset was downloaded (suggested citation below).
The PAT survey was conducted using both landline and cellular telephone sample frames; a variable is included in the dataset to indicate these two sample types.
Most variables in the PAT have very few missing values (less than 2% on average). Responses of 'don't know' and 'refused' are coded as missing (.d and .r, respectively, outliers are coded with a .v). In select cases, responses of 'don't know' are coded as a non-missing response category because these responses were intended to be a valid response category; they comprise more than 10% of responses; or to maintain historical consistency. By default, SAS and SUDAAN exclude missing values from analysis. If you are interested in using methods such as imputation to address missing data, please send an email to email@example.com (include "imputation" in the subject heading).
The questionnaire contains self-reported information on chronic conditions, mental health, neighborhood perceptions and physical activity.
The accelerometer overview file contains weeklong data for all participants with at least 10 hours over wear time on 4+ days, per NHANES protocol from 2005-2006; the minutes file contains information on each minute of the week (including non-wear time) for each of these 679 accelerometer participants.
New York City Department of Health and Mental Hygiene. Physical Activity and Transit Survey 2010-2011 ; public use dataset accessed on [DATE].
Physical Activity Guidelines Advisory Committee. Physical Activity Guidelines Advisory Committee Report, 2008. Washington, DC: Department of Health and Human Services, 2008. http://www.health.gov/paguidelines
World Health Organization. Global Physical Activity Questionnaire and Analysis Guide: http://www.who.int/entity/steps/resources/GPAQ_Analysis_Guide.pdf
T roiano RP, Berrigan D, Dodd KW, Masse LC, Tilert T, McDowell M. Physical Activity in the United States Measured by Accelerometer. Medicine & Science in Sports & Exercise. 2007.
National Cancer Institute: Applied Research, Cancer Control and Population Sciences, SAS Programs for Analyzing NHANES 2003-2004 Accelerometer Data. http://appliedresearch.cancer.gov/nhanes_pam