Data integrity
Data integrity refers to the accuracy and consistency of data over its lifecycle, which is crucial for industries like pharmaceuticals and scientific research that rely on precise and unaltered results. Ensuring data integrity involves the careful collection, classification, and storage of information, and it begins with the organizational culture surrounding data management. Key roles in this process include data producers, providers, and users, each responsible for maintaining the integrity of the data they handle.
The use of metadata is essential in supporting data integrity, as it offers contextual information about the data, helping to establish an audit trail that records all interactions with the data throughout its lifecycle. However, challenges such as transitioning from manual to digital data collection, lack of oversight, and potential for errors can threaten data integrity. Regulatory agencies like the US FDA and the UK's MHRA provide guidelines to promote best practices in data management, emphasizing the importance of training and compliance to uphold integrity. By understanding the dynamics of data integrity, organizations can foster an environment that prioritizes accurate and reliable information.
On this Page
Subject Terms
Data integrity
Data integrity is the maintenance of and assurance of accuracy of data over its lifecycle. Developing methods to collect, classify, and store information is part of ensuring integrity and trusting the data. It is a concept that is vital in industries such as pharmaceuticals and scientific research, as they rely on accurate and unmanipulated results in various testing. Producers, managers, and users play important roles in governing data by fostering an environment that makes data accessible and free of falsification and poor management.
Overview
Data integrity is maintained through stages to ensure accuracy and structure. It starts with an organization's work culture, from project conception to personnel to archiving. Project management leadership is responsible for configuring methods in which data is collected and what data is essential. Management establishes the means of data collection and storage, either manually through print sources or digitally through databases. Leadership also assigns roles to individuals responsible for the data, from data producers (those who generate the data, such as scientists, students, and researchers) to data providers (those who are responsible for data accessibility) to data users (those who access the data for their work). Management also is in charge of training these individuals to support data integrity.
Metadata describes other data included in the data, such as information added to digital data. This can include an assigned user identification that is attached to a value to trace where the data originated, a measuring unit associated with the numbers calculated in research, or a time stamp indicating when the data was last modified or retrieved. The data owner determines what metadata is required and can later use it to check integrity. Metadata can be used to establish an audit trail, or an electronic record that tracks actions involving the data through its lifecycle. An audit trail can pinpoint data creation, modification, and deletion and trace times of accessibility, from its last use to its previous user.
Challenges to data integrity include changing data collection over time and lack of oversight and security. A switch from manual input to digital or an upgrade in technology can alter the values of previous data models. Leaving data unchecked over time also can hinder integrity as errors, falsifications, or result manipulations can live on. Not all collection methods are foolproof. Manual processes can pose the greatest risk to integrity, as it can lack security enforcement and an audit trail that a digital process can provide. Digital data collection can suffer from easier access to distortion and lack of human oversight.
Agencies such as the US Food and Drug Administration (FDA) and the United Kingdom's Medicines and Healthcare Products Regulatory Agency (MHRA) issue recommendations to firms to promote data integrity in testing and research. Their guidance outlines the handling of data, quality control, audit trail review, and personnel training in compliance with current good manufacturing practices (CGMPs). These recommendations can be enforced through inspections, warning letters to management if the firm is noncompliant, or other regulatory actions set up by the agency.
Bibliography
Beck, Bernd, et al. "The Impact of Data Integrity on Decision Making in Early Lead Discovery." Journal of Computer-Aided Molecular Design, vol. 29, no. 9, Sep. 2015, pp. 911–921.
Duggineni, Sasidar. "Data Integrity and Risk." Open Journal and Risk, vol. 12, no. 2, June 2023, doi.org/10.4236/ojop.2023.122003. Accessed 18 Nov. 2024.
McDowall, R. D. "Understanding the Layers of a Laboratory Data Integrity Model." Spectroscopy, vol. 31, no. 4, Apr. 2016, pp. 14–23.
McDowall, R. D. "What Is Data Integrity Training?" Spectroscopy, vol. 30, no. 11, Nov. 2015, pp. 34–41.
National Academy of Science, et al. Ensuring the Integrity, Accessibility, and Stewardship of Research Data in the Digital Age. National Academies Press, 2009.
Newton, Mark E. "Data Integrity in a Nutshell." Scientific Computing, Dec. 2013, pp. 16–18.
"Responsible Conduct in Data Management." Office of Research Integrity, https://ori.hhs.gov/education/products/n‗illinois‗u/datamanagement/dmotopic.html. Accessed 25 Mar. 2017.
US Department of Health and Human Services, et al. "Data Integrity and Compliance with CGMP: Guidance for Industry." Food and Drug Administration, Apr. 2016, www.fda.gov/downloads/drugs/guidances/ucm495891.pdf. Accessed 25 Mar. 2017.
Yu, Yong, et al. "Enhanced Privacy of a Remote Data Integrity-Checking Protocol for Secure Cloud Storage." International Journal of Information Security, vol. 14, no. 4, Aug. 2015, pp. 307–318.