Obama’s “Precision Medicine” Initiative and Privacy

Jay Stanley, Senior Policy Analyst, ACLU Speech, Privacy, and Technology Project

September 18, 2015

Yesterday we got a lot of new detail on the likely shape of President Obama’s proposed “Precision Medicine Initiative” (or PMI). This project envisions creating a research database of genetic data on one million volunteers, an idea that promises potentially huge medical benefits, but also raises significant privacy questions.

The initiative was first announced in the president’s State of the Union address last January, and then launched with fanfare a few weeks later in a White House gathering of genetics experts, patients, academics, and government officials. To their credit, White House officials have recognized the privacy challenges, and have sought input from privacy experts from the beginning (the ACLU was invited to the program launch, and to several meetings since to discuss the privacy issues involved). In July, the White House produced a document that outlined some basic “privacy and trust” principles to which the program is supposed to adhere.

But many of the crucial details have still been lacking, making it hard to know how to evaluate what has been essentially just a vision. The privacy document is quite good and covers all the bases, but seems to defer any hard choices between using data in every possible way for research, and protecting privacy. For example, in meetings I attended top health officials waxed enthusiastic about the potential advantages of crowdsourcing genetic databases – leaving them wide open so that a thousand people can explore them, which they said in their experience inevitably yields discoveries that don’t come to light if data access is granted to only a small cadre of certified professional “researchers.” Certainly that comports with everything we know about the advantages of open-source and crowdsourced data. At the same time, the document envisions strict rules and limits around how people’s data will be used and shared. It’s not clear to me how those two visions will co-exist.

It’s been hard to make judgments without more details, which is why I’ve been eagerly awaiting the recommendations of an NIH advisory group that was tasked with developing recommendations for some of the nuts and bolts of how this initiative would actually work. Yesterday, in a public meeting broadcast live via conference call, the advisory group formally presented its recommendations to NIH Director Francis Collins. At the end of the meeting, Collins announced that he was accepting the recommendations, clearing the way for implementation according to the report’s blueprint.

These are very complex issues and I haven’t had time to properly digest them, but here are some significant features of the program outlined in the report (a slide show was also presented at the meeting):

Volunteers whose information is entered into the database (which the program likes to call “participants”) are envisioned as being drawn mainly from health providers such as Kaiser Permanente. Any other individual will also be able to directly volunteer to be included.
The data obtained from each participant will include not just their genome, but also Electronic Health Record data—essentially their complete medical records, including such things as narrative documents, EKG and EEG waveform data, scan imagery, and “mobile health” data from wearable sensors. It will also include a bio-sample of their actual tissue—most likely a blood sample—and the results of a baseline physical exam. Basically, all available medical data that could prove useful. One reason that the system will seek to draw from established health providers like Kaiser is that they already have all this information on their patients in one place.
The million-person Cohort is envisioned as being longitudinal—it will feature an ongoing relationship with participants, including continuous information collection.
Information and findings will also be fed back to participants—both aggregate scientific findings, and also findings of individual relevance.
The database would be open for exploration by any researchers—anyone from academic professionals to high school students.
Any new data that results from (for example) running a new algorithm on the Cohort would have to be shared back with the project and available to others. This is good; this database will belong to the public and its fruits should likewise belong to the public.
The report details a governance system that includes significant input from program participants. Also good.

Perhaps most significantly for privacy, the report recommends that the program “should create and use de-identified data for research whenever feasible to do so.” At the same time, it also wants participants to be “re-contactable.” In its key paragraphs on privacy, the report recognizes the complexities involved:

A national cohort that includes a highly interactive approach to communicating with and soliciting input from study participants will necessarily have to operate in two data management modes, while respecting participant preferences and terms of consent. The “fully identified” mode of operations will be needed for messaging, study appointment reminders, phone interactions, etc….

Aggregate data assembled for analysis will need to be de-identified by removal of standard classes of personal identifiers such as those specified by HIPAA Limited Data Set and Safe Harbor provisions. These are imperfect privacy standards, however, and the clinical and research-generated data are expected to be rich in features that make each individual’s contribution unique. Uniqueness is not synonymous with re-identification (which requires, in addition, a naming source), but the proliferation of data mining methods and potential naming sources (voter lists, public registries, social media postings, ancestry web sites, etc.) means that technology alone will be insufficient to address issues of data privacy for the PMI cohort. Expert testimony presented at [a program workshop] brought forth the view that de-identification should not be thought of as a guarantee of anonymity, but rather simply “another disincentive to attempting re-identification of individuals.” Acceptable use policies with substantial enforceable sanctions will need to be developed or adapted from other similar research efforts to complement the technical approaches to deidentification of data.

In short, it may be possible to re-identify participants from medical records in the database, but those who attempt to do so will be subject to unspecified “penalties.”

Ultimately, the report thus punts on the hardest details for now with a recommendation that the program “engage data privacy experts to create an effective combination of technology and policy to minimize risks to re-identification of de-identified data.” On yesterday’s call, as in prior meetings, I have certainly been favorably impressed with the thoughtfulness and thoroughness with which White House and NIH staff have approached the policy issues raised by this project, including the privacy implications as well as a number of other knotty issues it raises. That said, strictly from a privacy point of view, there remain some significant questions for those contemplating volunteering for this program. It does not look as though this will be an airtight, privacy-protective system where subjects’ data will be technologically guaranteed private. And of course as with any large data store in today’s world the cybersecurity questions are considerable. A fair amount of trust will have to be placed by participants in those who run this program.

Of course, many people will be inspired to volunteer for this program out of a desire to help researchers fight diseases—diseases that have already affected them or people they love, or out of an abstract desire to contribute to humanity. Those are motivations we can all honor. Scientists say there’s real potential for this kind of database to revolutionize many areas of medicine. The exploitation of medical data for good is not like using big data to try to spot terrorists, a misguided effort where the privacy downsides are vastly eclipsed by the (unlikely) benefits. In a chart included in the report yesterday, the authors estimate that with a population of a million people, there will be 6,400 cases of Parkinson’s within 5 years, for example, 18,000 cases of Lupus, 32,600 cases of breast cancer, and similar numbers for many other conditions. That will allow a lot of exploration of genetic and environmental causes of disease. Such possibilities are something that we privacy advocates do not fail to take into consideration when judging uses of data.

And not everyone feels they need airtight privacy, even for their medical records and the sensitive information they so often contain. Some people are already making their genomes public.

But it’s also important for people to have a clear understanding of what the privacy risks might be, both so that those risks can be ameliorated where possible, and also so that individuals can make a fully informed decision about whether they want to participate. We want volunteers to go in with their eyes wide open. The proposal outlined yesterday, and the project overall as it unfolds, will have to be studied and analyzed closely by privacy advocates.

Learn More About the Issues on This Page

Related Content

Press Release

Apr 2025

Privacy & Technology

+2 Issues

Human Rights First Joins ACLU and NYCLU in Amicus Brief to Protect First Amendment Rights and Interests of NGOs Advocating for U.S. Sanctions

Today, Human Rights First, the American Civil Liberties Union (ACLU), and the New York Civil Liberties Union (NYCLU) filed an amicus brief with the U.S. District Court for the Eastern District of New York, in support of Democracy for the Arab World Now’s (DAWN) efforts to block an individual sanctioned for violence in the Israeli occupied West Bank from accessing information about DAWN’s advocacy for sanctions against him. The brief argues that various protections, including the First Amendment and reporter’s privilege, bar the court from granting the discovery requested in this case. The brief also emphasizes how such discovery requests, if granted, would put civil society groups at serious risk of irreparable harm and chill their vital advocacy work on human rights and corruption issues. In August 2024, Isaac Levi Pilant was sanctioned by the U.S. government under the West Bank sanctions program, for attacking and forcefully expelling Palestinians from a West Bank settlement. At the time, human rights groups, media outlets, and witnesses had documented Pilant’s alleged role in violent attacks against Palestinians, and DAWN had publicly recommended that the U.S. government impose sanctions on him and others for such violence. The sanctions against Pilant were lifted in January 2025, after President Trump effectively terminated the West Bank sanctions program. Pilant then filed an application against DAWN and its executive director, Sarah Leah Whitson, pursuant to a U.S. law that provides a mechanism for foreign litigants to obtain discovery from people and entities in the United States.The application seeks a court order for information related to DAWN’s investigation of Pilant and its sanctions advocacy efforts. Pilant says he seeks the information for use in a possible future defamation case in Israel against an Israeli human rights organization. The brief explains how the U.S. government has established frameworks and processes to encourage nongovernmental organizations (NGOs) to share sensitive information that can assist it in more effectively implementing various human rights and corruption sanctions and visa restriction programs. Undermining the protections for NGOs to securely and confidentially share this information would not only impact the ability of the U.S. government to use such tools to hold human rights abusers and corrupt actors accountable, but it would also put NGOs, victims of abuse, and others in civil society in jeopardy by opening them up to retaliation and harassment from people they accuse of human rights violations. “Human rights and corruption sanctions are impactful tools of accountability because they threaten the reputations and financial interests of abusers. Forcing NGOs to share information about their sanctions advocacy would put them at grave risk of violence and retaliation from repressive governments and powerful private individuals,” said Amanda Strayer, Senior Counsel for Accountability at Human Rights First. “U.S. courts should not become a forum for sanctioned actors to harass and seek retribution against civil society groups that advocate for measures to hold them accountable.” The brief also argues that Pilant’s broad discovery request implicates information protected under the First Amendment and the reporter’s privilege, which provide grounds to reject his request under the Section 1782 statute. Supreme Court precedent requires the Court to give weight to the serious First Amendment and policy considerations before granting such a request. In this case, these considerations should result in the Court denying Pilant’s discovery request. “It is the nature of human rights reporting that it often draws the ire of accused human rights violators. But the law is clear that such individuals cannot coopt U.S. courts in an attempt to harass and endanger human rights organizations and the victims of abuses whose stories they safeguard. That’s why this is an easy case, and we hope the court has no trouble concluding that the First Amendment protects DAWN’s rights to free speech and association, and bars enforcement of the meritless request for intrusive discovery,” said Nathan Freed Wessler, Deputy Director of the ACLU Speech, Privacy, and Technology Project. “NGOs can play a critical role in providing accountability for human rights abuses, and the Constitution protects them from being forced to reveal certain confidential aspects of that work,” said Bobby Hodgson, assistant legal director at the New York Civil Liberties Union. “DAWN is being targeted by a foreign litigant implicated in serious human rights violations in an effort to weaponize our court system to silence critics. We urge the court to reject these requests and recognize that the discovery process does not create an end run around the First Amendment.”

Court Case: In Re: Application of Isaac Levi Pilant, for an Order Pursuant to 28 U.S.C. § 1782 to Conduct Discovery for Use in a Foreign Proceeding

Affiliate: New York
Human Rights First Joins Aclu And Nyclu In Amicus Brief To Protect First Amendment Rights And Interests Of Ngos Advocating For U.s. Sanctions. Explore Press Release.
Podcast

May 2025

Free Speech

+2 Issues

Know Your Digital Privacy Rights with Esha Bhandari and Daniel Kahn Gillmor

By: ACLU
Know Your Digital Privacy Rights With Esha Bhandari And Daniel Kahn Gillmor. Explore Podcast.
Press Release

Apr 2025

Privacy & Technology

ACLU Sues Social Security Administration and Department of Veterans Affairs for Information about DOGE Data Access

WASHINGTON – The American Civil Liberties Union filed a lawsuit today to enforce a Freedom of Information Act (FOIA) request sent to the Department of Veterans Affairs (VA) and the Social Security Administration (SSA) seeking urgent transparency about the so-called Department of Government Efficiency’s (DOGE) secretive efforts to access and analyze Americans’ sensitive personal information. In its FOIA request, originally filed in February with 40+ federal agencies, the ACLU asked for any records that reveal whether DOGE or its representatives have sought or obtained access to databases containing personally identifiable information, financial records, health care data, or other sensitive government-held records of Americans. The request also sought information on DOGE’s use of artificial intelligence (AI) to analyze government data, which raises alarms about the potential for mass surveillance and politically motivated misuse of that deeply personal information. “The federal government cannot dodge accountability by ignoring our lawful demands for transparency,” said Nathan Freed Wessler, deputy director of the ACLU’s Speech, Privacy, and Technology Project. “The American people have an urgent need to know if their private financial, medical, and personal records are being illegally accessed, analyzed, or weaponized by Trump's unaccountable team of unvetted outsiders. This is doubly true for our seniors and veterans, who are at particular risk if their data has been accessed illegally.” Given the urgency of the request, the ACLU requested expedited processing, which was granted by many agencies, including the Department of Defense, the Department of Education, and the Department of Health and Human Services. The SSA declined the request for expedited processing and has failed to respond to the ACLU’s appeal, and the VA failed to act on the request altogether. “Granting DOGE access to VA data systems would not only violate federal law but it would undermine the very core of the VA mission: to care for veterans, their families, caregivers and survivors,” said Michelle Fraling, Skadden Fellow with the ACLU’s Center for Liberty. “Given the millions of veterans and family members who depend on VA benefits and services, it is imperative that we have full transparency into DOGE’s relationship with VA and any access to veteran records.” In March, a federal judge barred DOGE representatives from accessing sensitive data at the Social Security Administration. Reporting from the Washington Post, however, has suggested that DOGE personnel have gone to great lengths to try to circumvent the court order. This lawsuit in part seeks access to records outlining DOGE’s access to private, sensitive information about Social Security Administration beneficiaries. “If DOGE is forcing its way into our private data, it is forcing itself into our private lives,” said Lauren Yu, Williams J. Brennan Fellow with the ACLU’s Speech, Privacy, and Technology Project. “Congress mandated strict privacy safeguards for a reason, and Americans deserve to know who has access to their social security numbers, their bank account information, and their health records. Government actors cannot continue to shroud themselves in secrecy while prying into our most sensitive records.” The suit was filed in the U.S. District Court for the District of Columbia. You can view the lawsuit here and read more about the FOIA requests here.

Court Case: U.S. DOGE Service Access to Sensitive Agency Records Systems Multiagency FOIA
Aclu Sues Social Security Administration And Department Of Veterans Affairs For Information About Doge Data Access. Explore Press Release.
Podcast

Mar 2025

Free Speech

+2 Issues

Free Mahmoud Khalil with Ben Wizner and Baher Azmy
Free Mahmoud Khalil With Ben Wizner And Baher Azmy. Explore Podcast.

DEMOCRACY

JUSTICE

LIBERTY

Obama’s “Precision Medicine” Initiative and Privacy

Stay informed

Learn More About the Issues on This Page

Related Content

Human Rights First Joins ACLU and NYCLU in Amicus Brief to Protect First Amendment Rights and Interests of NGOs Advocating for U.S. Sanctions

Know Your Digital Privacy Rights with Esha Bhandari and Daniel Kahn Gillmor

ACLU Sues Social Security Administration and Department of Veterans Affairs for Information about DOGE Data Access

Free Mahmoud Khalil with Ben Wizner and Baher Azmy