The Christakis LabHarvard University

Our facebook data sets, described in K. Lewis, J. Kaufman, M. Gonzalez, A. Wimmer, and N.A. Christakis, "Tastes, Ties, and Time: A New (Cultural, Multiplex, and Longitudinal) Social Network Dataset Using Facebook.com," Social Networks  30(4): 330-342 (October 2008)  MS#090 will be available from the IQSS website in 2010. Additional information will be located at http://dvn.iq.harvard.edu/dvn/dv/t3.

 

Our HuDiNe dataset regarding our phenotypic disease network is described in C.A. Hidalgo, N. Blumm, A.L. Barabasi, and N.A. Christakis, "A Dynamic Network Approach for the Study of Human Phenotypes," PLoS Computational Biology  5(4): e1000353. doi:10.1371/journal.pcbi.1000353 (April 2009)  MS#098 is available here.

 

Our leadership insularity algorithm code (in Python) is described in S. Arbesman and N.A. Christakis, "Leadership Insularity: A New Measure of Connectivity Between Central Nodes in Networks," Connections  forthcoming (2010)  MS#112 is available here.

 

For our computerized version of the file that links social security numbers to the place and year of issue, as described by Block, Matanoski, Seltser, "A Method for Estimating Year of Birth Using Social Security Number," American Journal of Epidemiology  118(3): 377-395 (1983), and as used in our paper, L. Jin, F. Elwert, J. Freese, and N.A. Christakis, "Preliminary Evidence Regarding the Hypothesis that the Sex Ratio at Sexual Maturity May Affect Longevity in Men," Demography  47(3): 579-586 (2010)  MS#111 is available here.

 

The FHS-Net is available to qualified researchers. Given the origin of these data in clinical records and given the rules of the Framingham Heart Study, not all the data were releasable, however. In June, 2009, the study's administrators, with our assistance, posted a version of these data in a secure online repository. FHS implemented a variety of changes to the data in order to help protect subject confidentiality before posting. Specifically: (1) all date information was changed to a monthly resolution rather than daily; (2) only 9,000 cases rather than 12,000 could be posted (e.g., all non-genetically related relative ties, such as adopted siblings, step-children, etc., were removed); (3) individuals who did not consent to the release of "sensitive information" were excluded; and (4) the available covariates (e.g., geographic coordinates) were restricted. We have re-run some of our analyses on this restricted dataset, and many (but not all) of our results survive these restrictions. This dataset is distributed via the SHARE database at dbGAP.

 

Back to Top