Creating a data resource: what will it take to build a medical information commons?

Deverka, Patricia A.; Majumder, Mary A.; Villanueva, Angela G.; Anderson, Margaret; Bakker, Annette C.; Bardill, Jessica; Boerwinkle, Eric; Bubela, Tania; Evans, Barbara J.; Garrison, Nanibaa’ A.; Gibbs, Richard A.; Gentleman, Robert; Glazer, David; Goldstein, Melissa M.; Greely, Hank; Harris, Crane; Knoppers, Bartha M.; Koenig, Barbara A.; Kohane, Isaac S.; La Rosa, Salvatore; Mattison, John; O’Donnell, Christopher J.; Rai, Arti K.; Rehm, Heidi L.; Rodriguez, Laura L.; Shelton, Robert; Simoncelli, Tania; Terry, Sharon F.; Watson, Michael S.; Wilbanks, John; Cook-Deegan, Robert; McGuire, Amy L.

doi:10.1186/s13073-017-0476-3

Correspondence
Open access
Published: 22 September 2017

Creating a data resource: what will it take to build a medical information commons?

Patricia A. Deverka¹,
Mary A. Majumder²,
Angela G. Villanueva²,
Margaret Anderson³,
Annette C. Bakker⁴,
Jessica Bardill⁵,
Eric Boerwinkle⁶,
Tania Bubela⁷,
Barbara J. Evans⁸,
Nanibaa’ A. Garrison⁹,
Richard A. Gibbs¹⁰,
Robert Gentleman¹¹,
David Glazer¹²,
Melissa M. Goldstein¹³,
Hank Greely¹⁴,
Crane Harris¹⁵,
Bartha M. Knoppers¹⁶,
Barbara A. Koenig¹⁷,
Isaac S. Kohane¹⁸,
Salvatore La Rosa⁴,
John Mattison¹⁹,
Christopher J. O’Donnell²⁰,
Arti K. Rai²¹,
Heidi L. Rehm²²,
Laura L. Rodriguez²³,
Robert Shelton²⁴,
Tania Simoncelli²⁵,
Sharon F. Terry²⁶,
Michael S. Watson²⁷,
John Wilbanks²⁸,
Robert Cook-Deegan²⁹ &
…
Amy L. McGuire²

Genome Medicine volume 9, Article number: 84 (2017) Cite this article

4659 Accesses
29 Citations
6 Altmetric
Metrics details

Abstract

National and international public–private partnerships, consortia, and government initiatives are underway to collect and share genomic, personal, and healthcare data on a massive scale. Ideally, these efforts will contribute to the creation of a medical information commons (MIC), a comprehensive data resource that is widely available for both research and clinical uses. Stakeholder participation is essential in clarifying goals, deepening understanding of areas of complexity, and addressing long-standing policy concerns such as privacy and security and data ownership. This article describes eight core principles proposed by a diverse group of expert stakeholders to guide the formation of a successful, sustainable MIC. These principles promote formation of an ethically sound, inclusive, participant-centric MIC and provide a framework for advancing the policy response to data-sharing opportunities and challenges.

Background

In 2011, a National Academies report advocated for the creation of an “Information Commons,” an individual-centric, multilayered, widely accessible informational resource to support integration and use of data from biomedical research and clinical care for precision medicine [1]. This report built on the spirit of open science embodied in the publicly funded Human Genome Project and its Bermuda Principles, which were modeled on data-sharing practices in nematode biology and called for daily sharing of DNA sequence data well before publication [2]. Since then, data initiatives facilitating clinical and genomic data sharing have proliferated across multiple sectors and internationally. The term “commons” in this context evokes the groundbreaking work of Nobel laureate Elinor Ostrom, who developed a theoretical framework to address governance challenges in building and sustaining shared natural resources [3]. She and others then extended the concept to “knowledge commons,” applying to resources that are not necessarily depleted by use [4]. This framework has informed sophisticated commentary on the challenges of data sharing in the healthcare context [5].

One lesson from Ostrom’s work is that a successful, sustainable commons requires stakeholder participation. With health data, stakeholder engagement is required to address long-standing policy concerns about data access, privacy and security protections, data ownership, and issues of data quality, interoperability, and network sustainability. Without a nuanced understanding of these complexities and a policy framework to address them, it will be difficult to obtain stakeholder buy-in and to realize the promise of advancing precision medicine and promoting a learning healthcare system.

In an effort to enact Ostrom’s lessons and to begin to address these critical issues, we have joined together as representatives of a range of stakeholders, including healthcare systems, clinical laboratories, technology companies, academia, government, nongovernmental organizations, and patient and community advocacy groups. Drawing on our first-hand experiences, we have created an initial list of eight core principles (1–8) for building a successful, sustainable medical information commons (MIC), defined as “a networked environment in which diverse health, medical, and genomic data on large populations become broadly available for research use and clinical applications”.

Core principles

1. The MIC should be a healthy “ecosystem” of data initiatives connected through a standard approach to policy, interoperability, and collaborative work

The MIC does not refer to a particular network architecture, but rather to an ecosystem that encompasses multiple actors and a comprehensive, high-level, stakeholder-informed framework to guide decision making about data control and access. Some of the initiatives contributing to the MIC may have their own policies and governance structures for sharing data. Thus, the larger MIC may encompass multiple smaller commons, as defined by Elinor Ostrom. Rather than attempting to create uniformity, the preferred approach is to develop core principles (including the eight set forth here) and policies that ensure the needs and concerns of key stakeholders are considered and addressed across different models. Building the MIC ecosystem will require not only the creation of appropriate stakeholder-specific incentives for sharing data, but also new approaches in how large-scale data-sharing policies are developed and implemented by academia, industry, and government. The Global Alliance for Genomics and Health and the Patient-Centered Outcomes Research Network are current examples of experiments in multistakeholder policy development.

2. The MIC must bring together diverse sources of data from individuals with different states of health

An optimal MIC would include diverse sources of data from a broad range of individuals. Data (phenome and demographics) would be obtained from electronic health records, genomes, and personal health and environmental sources, including wearables, and would be voluntarily shared by individuals and continuously updated throughout their lives. Including data from a broad range of individuals is essential to enable analysis of disease risk, identification of factors contributing to disease resistance/resilience, and testing of pharmacogenomic targets. Aggregation could be accomplished through multiple strategies, including: linking data from current or planned large-scale population-based initiatives (like the Precision Medicine Initiative’s All of Us Research Program) and research efforts by health systems (such as Kaiser-Permanente and Partners HealthCare); linking disease-specific repositories; directing data from existing cohorts into a centralized data repository; and compiling searchable metadata into a directory or index that enables authorized inquiries to be directed to the location where responsive data are held.

Furthermore, targeting recruitment of individuals in various states of health would provide a rich data source. Researchers and clinicians would have access to information to understand the full spectrum of disease etiologies and define the natural history of disease, accelerate the development of more accurate diagnoses and treatments, and reduce the difficulty, time, and expense of identifying and recruiting relevant cohorts for clinical studies. However, interoperability and adopting data-sharing policies that maximize the utility of the data across heterogeneous sources are major challenges for the MIC. The variation between the many electronic health record formats created by different companies, customized for different health systems, and tailored for different purposes from medical value to ease of billing, magnify these challenges.

3. A participant-centric model is critical for the sustainability of the MIC

The existing opaque system of exchanging data built solely around data holders and users must transition to a system that involves participants. Involvement may require that the system is built with the participant at the center, or at least require a system that more meaningfully empowers the individuals whose data are being shared to make decisions about access and use [6, 7]. Empowerment requires not just giving participants choices but helping them understand their choices in a very technical area. Policy development should focus on options that simplify the system as much as possible, including the possibility of inserting trusted intermediaries to navigate the complex politics involved in data exchange [8]. Tools such as Blockchain, FHIR, Sync for Science and Sync for Genes, Private Access, and Blue Button can be refined and/or created to facilitate the ability of individuals to contribute and control data in the MIC. Use of these tools promotes autonomous and informed decision making without requiring that individuals own their data in a legal sense or exercise exclusive unstructured control over access to their data (see principle 8).

If data-sharing initiatives subscribing to the participant-centric model are to thrive within the MIC, strategies supporting an individual’s choice to contribute data directly to the MIC must be developed. These strategies should be simple, convenient, dynamic, community-based, and grounded in an ethical foundation that builds and sustains trust by being trustworthy. Existing US law generally gives individuals the right to access their healthcare data and requires covered entities to share it with others (including in electronic form), but more work is needed to help individuals exercise this right and to implement authorized data access and transfer [6, 7]. As the MIC evolves, community-driven and community-based efforts will likely increase in number, and it will be important that these data repositories are connected to and interoperable with traditional institutionally governed initiatives.

The participant-centric model must also integrate the rights, legal obligations, and interests of other stakeholders, including researchers, clinicians, and institutions. To promote cooperation among these various stakeholders, incentives to share data (or not) must be analyzed and, where appropriate, augmented (for example, rewarding researchers in the areas of promotion, publication credit, and tenure for sharing data).

4. It is important to reach out to and engage under-represented populations and to investigate the feasibility and acceptability of a public health approach

Relying on individuals to voluntarily contribute their data through existing healthcare delivery and communication channels will not be enough to engage a truly diverse group of participants. The situation is further complicated by the historical mistrust of research based on past injustices experienced by underserved groups, such as Native Americans and African Americans, which may result in under-representation of minority populations in the MIC. Under-representation will weaken the scientific generalizability and clinical utility of the resulting resources by failing to address some of the most genetically diverse communities, thus missing important scientific insights and further exacerbating health disparities.

A public health approach featuring automatic enrollment coupled with opt-out mechanisms and robust protections against data misuse might help ensure a representative cohort and address concerns about selection bias. Such an approach, however, raises concerns about unjustified and unwanted infringement on individual liberties. Systematic outreach and alternative engagement methods and strategies to invite communities to join the commons are needed to achieve equitable representation in the MIC. Some communities have possible channels for engagement through trusted intermediaries and community representatives, such as tribal governments and the Genetic Resource Center of the National Congress of American Indians, or via “community engagement studios” [9, 10].

5. Building trust is an iterative process and requires investment of efforts beyond informed consent

Building trust is essential for a successful MIC. Fostering and maintaining trust involves active engagement, takes time, and requires keen tuning to people’s needs, as trust can easily be damaged and, once lost, it is difficult to restore. Potential ways to build and sustain trust with participants include meaningful and authentic engagement through trusted intermediaries (including advocacy groups and foundations) or giving participants a meaningful voice in governance and/or decisions regarding their data.

Simply relying on traditional requirements for informed consent is insufficient, especially when obtaining consent is treated as a transaction rather than an iterative, ongoing process of communication. Participant choices can change over time as circumstances change. Such evolution supports the adoption of dynamic, process-oriented engagement and consent. Other trust-building factors include transparent communication of data distribution and uses, clear data-sharing and distribution rules, and meaningful sanctions for misuse. Demonstration of the financial viability of data repositories would minimize participants’ data security concerns related to disruption of funding or potential bankruptcy.

6. Regulatory policies that rely on a sharp distinction between the “kingdom of research” and the “kingdom of clinical care” must be reconsidered

The current regulatory system clearly distinguishes research data from clinical data. Researchers and clinicians are governed by different legal rules and ethical norms when collecting, storing, and using health data, depending on whether those data were collected as part of a research study or as part of clinical care. This distinction is far less meaningful for participants. As long as they are provided protections against informational harms, participants are mainly concerned that the data promote progress toward disease prevention and treatment and improvements for themselves or others in the future. The MIC ideally draws on data from both research and clinical care settings in order to contribute to a learning healthcare system, as well as incorporating new sources of “real world” data—such as lifestyle and environmental exposure data.

Regulatory frameworks governing the MIC should provide consistent rights and protections to all participants and data contributors, regardless of the circumstances leading to their involvement in the MIC, and should accommodate both clinical and scientific uses of shared data. For example, the Health Insurance Portability and Accountability Act (HIPAA) Privacy Rule guarantees individuals a right to access and receive copies of their health record data, yet this right remains poorly understood, difficult to exercise, and only covers the portion of a person’s data held by traditional healthcare providers and other HIPAA-regulated entities [6, 7]. Accordingly, in the USA, data exclusively residing in research files (rather than health records) are not subject to HIPAA’s individual access right unless the research laboratory is part of a larger HIPAA-regulated covered entity [11]. This disparate treatment of research and clinical data may confuse participants who want to contribute their data to the MIC and does not make sense in the context of translational genomic research, where sequence data are stored in research files but have clinical implications.

7. Changes in technology and in the scale and scope of data sharing demand reconsideration of current policy frameworks related to privacy and security

Rich, integrated, multifactorial datasets linking people’s genomic, clinical, and environmental/lifestyle data are inherently susceptible to re-identification, even when de-identified according to current standards. This concern will only intensify as better infrastructure solutions enable disparate data and tissue resources and large datasets, in general, to become linked [12]. In keeping with a commitment to transparency, educating participants and the public about the potential for re-identification—without exaggerating the level of risk or losing sight of the benefits of data sharing—is important. Few things could destroy participants’ trust in the MIC more quickly and completely than unforeseen data breaches and re-identification. While re-identification risks cannot be eliminated, their likelihood can be reduced by: (1) requiring more complete accountings of disclosures and downstream uses of data in de-identified forms, (2) developing laws and regulations that distinguish benign uses of re-identification from nefarious ones, and (3) implementing stronger sanctions and enforcement mechanisms for data misuse [13]. Finally, there are indications that individuals differ in their degree of concern about privacy, and in how they view trade-offs between risk and benefit in the context of data sharing to facilitate research and improvements in public health [14, 15]. These differences may justify the use of data-sharing models that give participants more control over the level of risk they are willing to incur [15,16,17,18].

8. Distinguishing data ownership from data access and control is critical. Notions of unitary, exclusive property rights to data run counter to building the MIC

Individuals may believe they alone “own” their healthcare and related personal data; however, exclusive ownership that is typically, and often inaccurately, associated with land and physical objects is especially misplaced in the context of the MIC, where multiple copies of data exist in multiple places. The legal status of this information is not entirely clear. In the USA, courts and legislatures have rejected individuals’ exclusive ownership claims to both biospecimens [19,20,21] and data [6]. Nonetheless, individuals do have recognized rights and interests related to health and genomic data. For example, individuals have a right under the HIPAA Privacy Rule to access and retain copies of their health data. However, since they do not exclusively own those data, they cannot prevent care delivery institutions from retaining copies of the data as they are legally obligated to do so in order to have a proper record of each patient’s care. In addition, the concept of exclusive ownership is in direct tension with the notion of a commons and is antithetical to the goals of the MIC. Governance structures in the MIC should focus on these non-mutually exclusive rights and interests, as well as legal and moral concepts such as trusted, custodial, and fiduciary relationships.

Conclusions

Progress in biomedical research and movement toward a learning health system that can fully take advantage of precision medicine will depend on building a robust MIC. The challenges are many and substantial. We propose eight principles that, if built into data-sharing infrastructure and practices, can improve prospects for developing a trusted MIC. While there is a moral obligation to use the data and a duty toward the people who are contributing the data, the moral imperative alone is insufficient to make data sharing successful and sustainable. There must be standard approaches to policy and governance of data initiatives in the MIC ecosystem (principle 1) that bring together data from diverse individuals (principle 2). It is essential that participants reside at the center of the MIC (principle 3), under-represented populations are engaged (principle 4), and there is investment in efforts beyond informed consent to build and sustain trust (principle 5). Finally, legal, regulatory, and technical barriers and enablers for data sharing must also be considered and updated (principles 6–8).

These eight core principles provide a framework for advancing the policy response to data-sharing opportunities and challenges. If these principles are followed, the resulting MIC can promote broader data use (for both clinical applications and the advancement of research interests), be more inclusive and result in more diverse participation, and accrue more benefits and avoid informational harms to participants. Adoption of these principles by stakeholders will increase the likelihood that the MIC ecosystem will fulfill the promise of precision medicine.

Abbreviations

HIPAA:: Health Insurance Portability and Accountability Act
MIC:: Medical information commons

References

The NationalAcademies Press. Toward precision medicine: building a knowledge network for biomedical research and a new taxonomy of disease. 2011. https://www.nap.edu/catalog/13284/toward-precision-medicine-building-a-knowledge-network-for-biomedical-research. Accessed 8 September 2017.
Google Scholar
Cook-Deegan R, Ankeny RA, Jones KM. Sharing data to build a medical information commons: from Bermuda to the Global Alliance. Annu Rev Genomics Hum Genet. 2017;18:389–415.
Article CAS PubMed Google Scholar
Ostrom E. Analyzing long-enduring, self-organized, and self-governed CPRs. In: Governing the commons: the evolution of institutions for collective action. 1st ed. Cambridge; New York: Cambridge University Press; 1990. p. 58–102.
Chapter Google Scholar
Hess C, Ostrom E, editors. Understanding knowledge as a commons: from theory to practice. Cambridge: MIT Press; 2007.
Google Scholar
Schofield PN, Bubela T, Weaver T, Portilla L, Brown SD, Hancock JM, et al. Post-publication sharing of data and tools. Nature. 2009;461:171–3.
Article CAS PubMed Google Scholar
Evans BJ. Barbarians at the gate: consumer-driven health data commons and the transformation of citizen science. SSRN Scholarly Paper. Rochester: Social Science Research Network; 2016. https://papers.ssrn.com/abstract = 2750347. Accessed 8 September 2017.
Google Scholar
Evans BJ. Power to the people: data citizens in the age of precision medicine. Vanderbilt Entertainment and Technology Law. 2016;19:243–65.
Google Scholar
Erlich Y, Williams JB, Glazer D, Yocum K, Farahany N, Olson M, et al. Redefining genomic privacy: trust and empowerment. PLoS Biol. 2014;12:e1001983.
Article PubMed PubMed Central Google Scholar
National Congress of American Indians. American Indian and Alaska Native Genetics Resource Center. http://genetics.ncai.org/. Accessed 8 Sept 2017.
Joosten YA, Israel TL, Williams NA, Boone LR, Schlundt DG, Mouton CP, et al. Community engagement studios: a structured approach to obtaining meaningful input from stakeholders to inform research. Acad Med. 2015;90:1646–50.
Article PubMed PubMed Central Google Scholar
Evans BJ, Dorschner MO, Burke W, Jarvik GP. Regulatory changes raise troubling questions for genomic testing. Genet Med. 2014;16:799–803.
Article PubMed PubMed Central Google Scholar
Mattison J. Secondary use of protected health information. In: Information privacy in the evolving healthcare environment. HIMSS; 2013. p. 149–70.
Google Scholar
Stead WW. Recommendations on de-identification of protected health information under HIPAA. 2017. https://www.ncvhs.hhs.gov/wp-content/uploads/2013/12/2017-Ltr-Privacy-DeIdentification-Feb-23-Final-w-sig.pdf. Accessed 8 Sept 2017
McGuire AL, Oliver JM, Slashinski MJ, Graves JL, Wang T, Kelly PA, et al. To share or not to share: a randomized trial of consent for data sharing in genome research. Genet Med. 2011;13:948–55.
Article CAS PubMed PubMed Central Google Scholar
Burstein MD, Robinson JO, Hilsenbeck SG, McGuire AL, Lau CC. Pediatric data sharing in genomic research: attitudes and preferences of parents. Pediatrics. 2014;133:690–7.
Article PubMed PubMed Central Google Scholar
Shelton RH. Electronic consent channels: preserving patient privacy without handcuffing researchers. Sci Transl Med. 2011;3:69cm4.
Article PubMed Google Scholar
Kaye J, Whitley EA, Lund D, Morrison M, Teare H, Melham K. Dynamic consent: a patient interface for twenty-first century research networks. Eur J Hum Genet. 2015;23:141–6.
Article PubMed Google Scholar
Horn EJ, Edwards K, Terry SF. Engaging research participants and building trust. Genet Test Mol Biomarkers. 2011;15:839–40.
Article PubMed Google Scholar
Moore v. Regents of the University of California, 51 Cal. 3d 120 (1990); 271 Cal. Rptr. 146; 793 P.2d 479 [https://www.courtlistener.com/opinion/2608931/moore-v-regents-of-university-of-california/]
Greenberg v. Miami Children’s Hospital Res. Inst., Inc., 264 F. Supp. 2d 1064 (S.D. Fl. 2003) [https://www.courtlistener.com/opinion/2507167/greenberg-v-miami-childrens-hospital-res-inst-inc/]
Washington University v. Catalona, 437 F. Supp. 2d 985 (E.D.Mo. 2006) [https://www.courtlistener.com/opinion/2492481/washington-university-v-catalona/]

Download references

Acknowledgements

We thank the following individuals for their contribution to the materials used to engage the expert stakeholders: Erika Versalovic, Juli Bollinger, Wendy Lee, Maureen Maurer, Alexis Abboud, and Caroline Abboud.

Funding

The work for this project is supported by the National Institutes of Health, National Human Genome Research Institute grant R01 HG008918. We also acknowledge the following funding sources K01 HG008818 (NG), P20 HG007243 (BK), U01 HG006507 and U01 HG007307-02S2 (BE), U54 HG007963 (IK), CC2-GA-2000-52 (ST), U01 HG006500 (AM), and the Robert Wood Johnson Foundation (BE).

Author information

Authors and Affiliations

American Institutes for Research, Health Care Group, 100 Europa Drive, Suite 315, Chapel Hill, NC, 27517, USA
Patricia A. Deverka
Center for Medical Ethics and Health Policy, Baylor College of Medicine, One Baylor Plaza, Houston, TX, 77030, USA
Mary A. Majumder, Angela G. Villanueva & Amy L. McGuire
Deloitte, 1919 N Lynn, Arlington, VA, 22209, USA
Margaret Anderson
Children’s Tumor Foundation, 120 Wall Street, 16th Floor, New York, NY, 10005, USA
Annette C. Bakker & Salvatore La Rosa
Department of English, Concordia University, 1455 Boulevard de Maisonneuve O, Montreal, QC, H3G 1M8, Canada
Jessica Bardill
Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston and Human Genome Sequencing Center, Baylor College of Medicine, One Baylor Plaza, Houston, TX, 77030, USA
Eric Boerwinkle
Faculty of Health Sciences, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada
Tania Bubela
Law Center and Department of Electrical and Computer Engineering, University of Houston, 4604 Calhoun Road, Houston, TX, 77004, USA
Barbara J. Evans
Treuman Katz Center for Pediatric Bioethics, Seattle Children’s Research Institutes, 1900 Ninth Avenue, Room 677, Seattle, WA, 98101, USA
Nanibaa’ A. Garrison
Human Genome Sequencing Center, Baylor College of Medicine, One Baylor Plaza, Houston, TX, 77030, USA
Richard A. Gibbs
23andMe, 899 West Evelyn Avenue, Mountain View, CA, 94041, USA
Robert Gentleman
Verily Life Sciences LLC, 269 E. Grand Avenue, South San Francisco, CA, 94080, USA
David Glazer
Milken Institute School of Public Health, The Georgetown Washington University, 950 New Hampshire Avenue, NW, Second Floor, Washington, DC, 20052, USA
Melissa M. Goldstein
Center for Law and the Biosciences, Stanford University, Neukom N361, Stanford, CA, 94305, USA
Hank Greely
Illumina, Inc., 5200 Research Pl, San Diego, CA, 92122, USA
Crane Harris
Centre of Genomics and Policy, McGill University, Montreal, 740 Avenue Drive Penfield, Suite 5200, Montreal, Quebec, H3A 0G1, Canada
Bartha M. Knoppers
Institute for Health and Aging, and Department of Anthropology, History, and Social Medicine, University of California San Francisco, San Francisco, CA, 94143, USA
Barbara A. Koenig
Department of Biomedical Informatics, Harvard Medical School, 10 Shattuck Street, Boston, MA, 02115, USA
Isaac S. Kohane
Kaiser Permanente, University of California San Diego and Singularity University, 4965 Maynard Street, San Diego, CA, 92122, USA
John Mattison
Center for Population Genomics, Boston Veterans Administration Healthcare, 73 Mount Wayte Avenue, Framingham, MA, 01702, USA
Christopher J. O’Donnell
Duke University School of Law; Duke Innovation and Entrepreneurship Initiative, 210 Science Drive, Box 90360, Durham, NC, 27708, USA
Arti K. Rai
Harvard Medical School, Brigham and Women’s Hospital and The Broad Institute of MIT and Harvard, 65 Landsdowne Street, Cambridge, MA, 02139, USA
Heidi L. Rehm
Division of Policy, Communications, and Education, National Human Genome Research Institute 31 Center Drive, Room 4B09, Bethesda, MD, 20892, USA
Laura L. Rodriguez
Private Access, Inc, 19800 MacArthur Boulevard, Suite 300, Irvine, CA, 92612, USA
Robert Shelton
Broad Institute of Harvard and MIT, 415 Main Street, Cambridge, MA, 02142, USA
Tania Simoncelli
Genetic Alliance, 4301 Connecticut Avenue, NW, Suite 404, Washington, DC, 20008, USA
Sharon F. Terry
American College of Medical Genetics and Genomics, 7101 Wisconsin Avenue, Suite 1101, Bethesda, MD, 20814, USA
Michael S. Watson
Sage Bionetworks, 1100 Fairview Avenue N., Mailstop M1-C108, Seattle, WA, 98109, USA
John Wilbanks
School for the Future of Innovation in Society and Consortium for Science, Policy and Outcomes, Arizona State University, and Senior Fellow, FasterCures, a Center of the Milken Institute, 1834 Connecticut Avenue, NW, Washington, DC, 20009, USA
Robert Cook-Deegan

Authors

Patricia A. Deverka
View author publications
You can also search for this author in PubMed Google Scholar
Mary A. Majumder
View author publications
You can also search for this author in PubMed Google Scholar
Angela G. Villanueva
View author publications
You can also search for this author in PubMed Google Scholar
Margaret Anderson
View author publications
You can also search for this author in PubMed Google Scholar
Annette C. Bakker
View author publications
You can also search for this author in PubMed Google Scholar
Jessica Bardill
View author publications
You can also search for this author in PubMed Google Scholar
Eric Boerwinkle
View author publications
You can also search for this author in PubMed Google Scholar
Tania Bubela
View author publications
You can also search for this author in PubMed Google Scholar
Barbara J. Evans
View author publications
You can also search for this author in PubMed Google Scholar
Nanibaa’ A. Garrison
View author publications
You can also search for this author in PubMed Google Scholar
Richard A. Gibbs
View author publications
You can also search for this author in PubMed Google Scholar
Robert Gentleman
View author publications
You can also search for this author in PubMed Google Scholar
David Glazer
View author publications
You can also search for this author in PubMed Google Scholar
Melissa M. Goldstein
View author publications
You can also search for this author in PubMed Google Scholar
Hank Greely
View author publications
You can also search for this author in PubMed Google Scholar
Crane Harris
View author publications
You can also search for this author in PubMed Google Scholar
Bartha M. Knoppers
View author publications
You can also search for this author in PubMed Google Scholar
Barbara A. Koenig
View author publications
You can also search for this author in PubMed Google Scholar
Isaac S. Kohane
View author publications
You can also search for this author in PubMed Google Scholar
Salvatore La Rosa
View author publications
You can also search for this author in PubMed Google Scholar
John Mattison
View author publications
You can also search for this author in PubMed Google Scholar
Christopher J. O’Donnell
View author publications
You can also search for this author in PubMed Google Scholar
Arti K. Rai
View author publications
You can also search for this author in PubMed Google Scholar
Heidi L. Rehm
View author publications
You can also search for this author in PubMed Google Scholar
Laura L. Rodriguez
View author publications
You can also search for this author in PubMed Google Scholar
Robert Shelton
View author publications
You can also search for this author in PubMed Google Scholar
Tania Simoncelli
View author publications
You can also search for this author in PubMed Google Scholar
Sharon F. Terry
View author publications
You can also search for this author in PubMed Google Scholar
Michael S. Watson
View author publications
You can also search for this author in PubMed Google Scholar
John Wilbanks
View author publications
You can also search for this author in PubMed Google Scholar
Robert Cook-Deegan
View author publications
You can also search for this author in PubMed Google Scholar
Amy L. McGuire
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

PD led the synthesis of commentary from the expert stakeholders and drafted the manuscript. All authors contributed to the content and revisions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Amy L. McGuire.

Ethics declarations

Competing interests

Baylor College of Medicine (BCM) and Miraca Holdings, Inc. have formed a joint venture with shared ownership and governance of the Baylor Genetics Laboratories. All co-authors at BCM have no competing interests. MA was employed at FasterCures during the development of this manuscript and is currently employed by Deloitte. RG is employed by and a shareholder in 23andMe, plus is on the Scientific Advisory Board for the Bioconductor Project and Elixir (EU). DG is employed by Verily. CH is employed by Illumina, Inc. HLR is on the Scientific Advisory Board of Genome Medical, Inc. RS is the CEO of Private Access, Inc. The remaining authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Deverka, P.A., Majumder, M.A., Villanueva, A.G. et al. Creating a data resource: what will it take to build a medical information commons?. Genome Med 9, 84 (2017). https://0-doi-org.brum.beds.ac.uk/10.1186/s13073-017-0476-3

Download citation

Published: 22 September 2017
DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/s13073-017-0476-3

Creating a data resource: what will it take to build a medical information commons?

Abstract

Background

Core principles

1. The MIC should be a healthy “ecosystem” of data initiatives connected through a standard approach to policy, interoperability, and collaborative work

2. The MIC must bring together diverse sources of data from individuals with different states of health

3. A participant-centric model is critical for the sustainability of the MIC

4. It is important to reach out to and engage under-represented populations and to investigate the feasibility and acceptability of a public health approach

5. Building trust is an iterative process and requires investment of efforts beyond informed consent

6. Regulatory policies that rely on a sharp distinction between the “kingdom of research” and the “kingdom of clinical care” must be reconsidered

7. Changes in technology and in the scale and scope of data sharing demand reconsideration of current policy frameworks related to privacy and security

8. Distinguishing data ownership from data access and control is critical. Notions of unitary, exclusive property rights to data run counter to building the MIC

Conclusions

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Publisher’s Note

Rights and permissions

About this article

Cite this article

Keywords

Genome Medicine

Contact us

Creating a data resource: what will it take to build a medical information commons?

Abstract

Background

Core principles

1. The MIC should be a healthy “ecosystem” of data initiatives connected through a standard approach to policy, interoperability, and collaborative work

2. The MIC must bring together diverse sources of data from individuals with different states of health

3. A participant-centric model is critical for the sustainability of the MIC

4. It is important to reach out to and engage under-represented populations and to investigate the feasibility and acceptability of a public health approach

5. Building trust is an iterative process and requires investment of efforts beyond informed consent

6. Regulatory policies that rely on a sharp distinction between the “kingdom of research” and the “kingdom of clinical care” must be reconsidered

7. Changes in technology and in the scale and scope of data sharing demand reconsideration of current policy frameworks related to privacy and security

8. Distinguishing data ownership from data access and control is critical. Notions of unitary, exclusive property rights to data run counter to building the MIC

Conclusions

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Genome Medicine

Contact us