The Ohio State University

Department of Statistics

Observational Data Reading Group



Time: Alternating Fridays 3:00-3:55pm
Location: 212 Cockins Hall

Spring Semester 2016

Topic: Network Modeling and Causal Inference

Schedule:

Date Topic
January 15 Group discussion of: McLaughlin, Handcock and Johnston (2015) Inference for the visibility distribution for respondent-driven sampling. Joint Statistical Meetings Proceedings, Social Statistics Section, Alexandria, VA: American Statistical Association. 2259-2267.
January 29 Group discussion of: Zhang, Levina, and Zhu. Estimating network edge probabilities by neighborhood smoothing. arXiv:1509.08588
February 12 Group discussion of: C. Gao, Z. Ma, A. Zhang and H.Zhou. Achieving Optimal Misclassification Proportion in Stochastic Block Model. arXiv:1505.03772
February 26 Group discussion of: Krivitsky and Kolaczyk (2015) On the question of effective sample size in network modeling: an asymptotic inquiry. Statistical Science 30(2): 184-198.
March 11 Karly Jacobsen on modeling epidemics on networks.
March 25 Presentation by Mehmet Caner, in part based on: Caner and Zhang (2014) Adaptive elastic net for generalized methods of moments. Journal of Business and Economics Statistics. 32:30-47.
April 8 Group discussion of: Angrist, JD, Imbens, GW, and Rubin, DB. (1996). Identification of Causal Effects Using Instrumental Variables. Journal of the American Statistical Association, 91(434), 444–455. http://doi.org/10.2307/2291629
April 22 Group discussion of: O'Malley, A. J., Elwert, F., Rosenquist, J. N., Zaslavsky, A. M. and Christakis, N. A. (2014), Estimating peer effects in longitudinal dyadic data using instrumental variables. Biometrics, 70: 506–515. doi: 10.1111/biom.12172, led by Ziyue Chen.

Autumn Semester 2015

Topic: Networks

Schedule is listed here.

Autumn Semester 2014

Topic: Network Sampling


Schedule:
Date Topic
September 11 Ran Wei will present an introduction to networks.
You can look at the Kolaczyk book, after appropriate logins, here: http://link.springer.com.proxy.ohiolink.edu:9099/book/10.1007%2F978-0-387-88146-1 , or if that link doesn't work, you can try here: https://olc1.ohiolink.edu/search~S0?/aKolaczyk/akolaczyk/1%2C4%2C15%2CB/frameset&FF=akolaczyk+eric+d&6%2C%2C6
Chapters 1-4 are the "introductory" material.
September 25 Discussion leaders: Yanan Jia and Hui Yang.
Topic: Kolaczyk Chapter 5
October 9 Anna Mohr on Statistical Models for Network Data (but we ran out of time for Comparing Networks of Different Size)
October 23 Yu Wang on Nesreen K. Ahmed, Nick Duffield, Jennifer Neville, and Ramana Kompella. 2014. Graph sample and hold: a framework for big-graph analytics. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD '14). ACM, New York, NY, USA, 1446-1455. DOI=10.1145/2623330.2623757 http://doi.acm.org/10.1145/2623330.2623757
November 6 Discussion on Future Directions for Sampling in Networks
November 20 Group Discussion on: Cosma Rohilla Shalizi and Alessandro Rinaldo (2013) CONSISTENCY UNDER SAMPLING OF EXPONENTIAL RANDOM GRAPH MODELS. The Annals of Statistics. 41(2): 508-535. DOI: 10.1214/12-AOS1044
http://arxiv.org/pdf/1111.3054v4.pdf
Summary Slides.
December 4 Ran Wei on Respondent-Driven Sampling and Adaptive Web Sampling. Suggested reading:
  1. Respondent-Driven Sampling
    1. Probability Based Estimation Theory for Respondent Driven Sampling (Volz and Heckathorn, 2008)
    2. Improved Inference for Respondent-Driven Sampling Data With Application to HIV Prevalence Estimation (Gile, 2012)
    3. Network Model-Assisted Inference from Respondent-Driven Sampling Data (Gile and Hancock, 2012)
  2. Adaptive Web Sampling:
    1. Adaptive Network and Spatial Sampling (Thompson, 2011)


Spring Semester 2014


Schedule:
Date Topic
January 14 Field Trip: IPR seminar by Bo Lu 12:30-1:30 in 038 Townshend Hall
January 28 RESCHEDULED DUE TO WEATHER
February 11 Jingjing Yan on: Choosing a "Best" Measure for the Combination of Studies in Meta-analysis for Binary Events.
February 25 Kevin Donges on: A Simulation Study of the Effect of Study Duration on Modeling Environmental Risk of Cancer
March 18 Dave Kline on: "Missing Data in Meta-Analysis: Multiple Imputation Approaches"
April 1 Hui Yang on: "Effects of Bounding Interviews in the National Crime Victimization Survey (NCVS)"
April 15 Zach Thomas on "Bayesian Physical-Statistical Modeling for Causal Inference in Climate"



Autumn Semester 2013


Schedule:
Date Topic
September 5 Jingjing Yan will present on "Imputing Data for General Purpose Estimation".
September 19 Yanan Jia will present on "Bilinear Mixed Effects Models for Affiliation Networks".
October 3 Elizabeth Petraglia on ... TBA
October 17
October 31 Danielle Sullivan will present on "Hot Deck Imputation for Nonignorable Missing Data"
November 14 Robert Ashmead will present on "Causal Inference Using Propensity Score Methods with Complex Survey Data".



Spring Semester 2013


Schedule:
Date Topic
Series on reducing bias when combining data from multiple data sources
January 25 Elly Kaizar will present an overview of the series topic
February 8 Elizabeth Petraglia on Lohr, S. L. and Brick, J. M. (2012), Blending domain estimates from two victimization surveys with possible bias. Can J Statistics, 40: 679–696. doi: 10.1002/cjs.11153
February 22 Andrew Olsen on Molitor, N., et al. 2009. Using Bayesian Graphical Models To Model Biases In Observational Studies To Combine Multiple Sources Of Data: Application To Low Birth Weight And Water Disinfection By-Products. JRSS-A. 172:615-637.
March 8 JingJing Yan on Welton, N. J., Ades, A. E., Carlin, J. B., Altman, D. G. and Sterne, J. A. C. (2009), Models for potentially biased evidence in meta-analysis using empirically based priors. Journal of the Royal Statistical Society: Series A (Statistics in Society), 172: 119.136. doi: 10.1111/j.1467-985X.2008.00548.x
Series on rensitivity analysis in causal inference
March 22 Bo Lu will present an overview of the series topic A background article that you might find helpful is: Rosenbaum, PR (2005) Sensitivity Analysis in Observational Studies. Encyclopedia of Statistics in Behavioral Science , Volume 4, pp 1809-1818. (available at the library).
April 5 Meng Li on Dual and Simultaneous Sensitivity Analysis for Matched Pairs Joseph L. Gastwirth, Abba M. Krieger and Paul R. Rosenbaum Biometrika, Vol. 85, No. 4 (Dec., 1998), pp. 907-920
April 19 Claire Zhu on design sensitivity, based on Chapter 14 of: P.R. Rosenbaum, Design of Observational Studies, Springer Series in Statistics, entitled "The Power of a Sensitivity Analysis and Its Limit". You can read this online via the OSU Library's electronic copy.

Autumn Quarter 2012


Schedule:
Date Topic
Series on Replicate Weights in Large Scale Complex Sample Surveys
August 31 Elizabeth Stasny on the American Community Survey and replicate weighting.
September 14 Andrew Bean on Variance estimation for complex surveys using replication techniques by KF Rust and Jnk Rao Stat Methods Med Res 1996 5: 283 (available for free in the SEL basement stacks)
September 28 Andrew Olsen on Disclosure risk and replication-based variance estimation by Wilson W. Lu and Randy R. Sitter, Statistica Sinica 18(2008), 1669-1687.
Series on Statistical Issues in Environmental Health Studies: Multiple Exposure Pathways, Confounding, and Lagged Dose-Response Relationships
October 12 Kate Calder will present an overview of the series topic
October 26 Sam Bussman on: Peng, R.D., Dominici, F., and Louis, T.A. (2006). "Model choice in time series studies of air pollution and mortality." Journal of the Royal Statistical Society, Series A, 169, 179-203.
November 9 Kevin Donges on: Welty, L.J., Peng, R.D., Zeger, S.L., and Dominici, F. (2009). "Bayesian distributed lag models: estimating effects of particulate matter air pollution on daily mortality." Biometrics, 65 282-291.
November 23 Thanksgiving Holiday




Spring Quarter 2012


Schedule:
Date Topic
April 6 Omer Ozturk on "Combining multi-ranker information in judgment post strati?ed and ranked set samples when sets are partially ordered".
April 20 Tian Chen on Judgement Poststratification.
May 4 Elizabeth Stasny
May 18 Dave Kline on Systematically missing confounders in individual participant data meta-analysis of observational cohort studies. Statistics in Medicine. Volume 28, Issue 8, pages 1218-1237, 15 April 2009.
June 1 Muriel Fang on Estimating panel data models in the presence of endogeneity and selection


Winter Quarter 2012

Tentative Schedule:
Date Topic
Jan 6 Elly Kaizar, on "Propensity Score Analysis with Matching Weights", a working paper by Liang Li
Jan 20 Robert Ashmead, on "Adjusted Kaplan-Meier estimator and log-rank test with inverse probability of treatment weighting for survival data" by Jun Xie and Chaofeng Liu. Published in Statistics in Medicine 2005; 24:3089-3110.
Feb 3 Elizabeth Petraglia, on "Nonresponse Error, Measurement Error, And Mode Of Data Collection: Tradeoffs in a Multi-mode Survey of Sensitive and Non-sensitive Items" by Joseph W. Sakshaug, Ting Yan, and Roger Tourangeau. Published in Public Opinion Quarterly 2010; 75(5): 909-961.
Feb 17 Hui Zheng, on Heteroscedastic Regression in Hierarchical Age-Period-Cohort Models
Mar 2 Elizabeth Stasny, on the AAPOR Report on Online Panels, published in the Public Opinion Quarterly (2010) 74(4): 711-781.


Autumn Quarter 2011

Tentative Schedule:
Date Topic
Oct 10 Chris Browning and Kate Calder on network analysis. The presentation will include discussion of their current work, and will also be based in part on the paper: Mark S. Handcock and Krista J. Gile. Modeling social networks from sampled data. Ann. Appl. Stat. Volume 4, Number 1 (2010), 5-25.
Oct 24 Rebecca Andridge on quality indicators in surveys and R-scores. The presentation will include discussion of her current work, and will also be based in part on these two papers, which should be "easy" to read:
Nov 7 Bo Lu on dual frame survey sampling. Bo has suggested that you take a look at: C. J. Skinner and J. N. K. Rao (Mar., 1996) Estimation in Dual Frame Surveys With Complex Designs Journal of the American Statistical Association Vol. 91, No. 433 (pp. 349-356). He says it's a bit on the technical side, so it's not important to understand everything in the paper.
Nov 21 Elly Kaizar on Record Linkage. She will be presenting the ideas in: Results from simulated data sets: probabilistic record linkage outperforms deterministic record linkage Tromp, Miranda; Ravelli, Anita C.; Bonsel, Gouke J.; et al. Journal of Clinical Epidemiology 64 (2011) 565-572.
Dec 5 Hong Zhu on dependence estimation for bivariate survival data through Kendall's tau. In addition to her own research, her talk will be based on the ideas in: Lakhal, Lajmi; Rivest, Louis-Paul; and Beaudoin, David (2009) "IPCW Estimator for Kendall's Tau under Bivariate Censoring," The International Journal of Biostatistics: Vol. 5: Iss. 1, Article 8.


Spring Quarter 2011

Date Topic
April 8 Erinn: TBA
April 22 Johanna: S. Van Buuren, et al. (2006) Fully conditional specification in multivariate imputation. Journal of Statistical Computation and Simulation. 76(12):1049-1064
May 6
May 20 Peter: TBA
June 3 Kate: Jennifer Hill. (2011) Bayesian Nonparametric Modeling for Causal Inference. Journal of Computational and Graphical Statistics. 20(1):217-240