Data Science

IAP- Managing Confidential Research Data

Apr 25, 12:00pm to 3:00pm



This class is on research design and research methods related to confidential information. In this class we’ll discuss how to recognize sensitive information; prepare for IRB review; reduce risks in data collection; evaluate information threats and vulnerabilities; organize and store sensitive data;  understand data use agreements; and create data management plans.  If you’re a researcher, whether a late career grad student, faculty, or professional research staff, this class is for you.


Brown Bag: Issues in Curating the Open Web at Scale - with Gary Price

Sep 20, 12:00pm to 1:00pm



Much of the web remains invisible: resources are undescribed, unindexed or simply buried --  as many people rarely look past the first page of Google searches or are unavailable from traditional library resources.

At the same time many traditional library databases pay little attention to quality content from credible sources accessible on the open web.

How do we build collections of quality open-web resources (i.e. documents, specialty databases, and multimedia) and make them accessible to individuals and user groups when and where they need it?

Brown Bag: Towards an Open Science Publishing Platform - with Vitek Tracz

Feb 23, 12:00pm to 1:00pm



The traditional way of publishing new findings in journals is becoming increasingly outdated and no longer serves the needs of much of science.  Vitek will discuss a new approach being developed by F1000, an Open Science Platform, that combines immediate publication (like a preprint) with formal, invited, and transparent post-publication peer review.

Transparency and Inference for Big Data -- Third Census-MIT Big Data Workshop

Dec 07, 12:00pm to Dec 08, 5:00pm


E25-401 MIT, Cambridge, MA

Survey costs are increasing and response rates are decreasing. These pressures are forcing official statistical agencies to re-examine the way they collect data. Big Data are potential drivers of innovation that may reduce survey costs and respondent burden, but that also pose threats to inference and transparency. We need to understand how using big data, in conjunction with survey data collection, can address the issues posed by rising costs and nonresponse while producing transparent inference.


Location Confidentiality and Official Surveys -- Second Census-MIT Big Data Workshop

Nov 30, 12:00pm to Dec 01, 5:00pm


E25-202 MIT, Cambridge, MA

Based on mobile devices alone, commercial entities have the potential to collect extensive, fine grained, continuous, and identifiable records of a persons location and movement history, accompanied with a partial record of other mobile devices (potentially linked to people) encountered over that history. This information is increasingly used for commercial purposes, such as targeted advertising, and for scientific research.