Skip to Main Content

Data Management for Wits: Data Life Cycle

The following is general advice,data varies hugely between types of research and projects.

Data Life Cycle

Research Data Life cycle

Research data management (or RDM) is a term that describes the organization, storage,preservation, and sharing of data collected and used in a research project.It involves the everyday management of research data during the lifetime of a research project (for example, using consistent file naming conventions).It also involves decisions about how data will be preserved and shared after the project is completed (for example, depositing the data in a repository for long-term archiving and access).

”Data is whatever you use to conclude results"

Research Data Life cycle adatpted from DataOne by Wits Data services with thanks to Susie Allard

The data life cycle provides a high level overview of the stages involved in successful management and preservation of data for use and reuse. Multiple versions of a data life cycle exist with differences attributable to variation in practices across domains or communities.


plan collect assure describe preserve discover integrate analyze

Research Data Life cycle has eight components:

Plan:Project management and action plan regarding all organisation and processing of data for the purposes of the Research project .
Collect:Systematically gather information through observations, experimentation, explorations: theoretical and artistic. It can also be a version of discovery.
Assure: Perform systematic quality processes during collection, entry, and analysis such that the data can be audited and claims made by evidence proved. It can also be an iteration of collection.
Analysis: Examine data to understand and form conclusions about the nature and relationship of its components or practically to see what effects or use can be made of the information contain therein.
Describes:Metadata : describes,annotates and provides context so data can be easily discovered, understood, and reproduced , this includes documentation and methods. Metadata in legal terms is knowhow. It can constitution an additional layer of data or a first stage of analysis via structuring and organisation
Integrate: Combine data from disparate sources to form one linked set of data that can be readily analysed, or organise and add to data such that it can speak to conclusion from other data. In systematic review meta analysis.
Discover:Locate and obtain relevant data for use in research, or discovery including academic literature and existing databases.
Preserve:Actions taken (use of sustainable formats, deposit in trusted repository) to ensure data is accessible for a set period.

Some research activities might use only part of the life cycle; for instance, a project involving meta-analysis might focus on the Discover, Integrate, and Analyze steps, while a project focused on primary data collection and analysis might bypass the Discover and Integrate steps. In addition, other projects might not follow the linear path depicted here, or multiple revolutions of the cycle might be necessary. Further, some scientists or teams (e.g. those engaged in modeling and synthesis) may create new data in the process of discovering, integrating, analyzing, and synthesizing existing data.

Key Concepts

Data collection is the systematic recording of information;

Data analysis involves working to uncover patterns and trends in datasets;

Data interpretation involves explaining those patterns and trends. Scientists interpret data based on their background knowledge and experience; thus, different scientists can interpret the same data in different ways. By publishing their data and the techniques they used to analyze and interpret those data, scientists give the community the opportunity to both review the data and use them in future research.

From: Visionlearning

How to name a file

How to name a file

  1. It should explain, completely and comprehensively, what is in the document: Focus G: male: corruption, jobs, family pressure.
  2. Give your project a name, and have a rule for naming sub files and types: like Drafts: general introduction supervisor comments: fix flow and citations, data: translated transcription incomplete Zulu.
  3. If you are repeatedly saving a document or a dataset change its name.  if you are working on it, the content will change. Rule of thumb 5 to 6.
  4. Get version control right, we recommend GitHub, Open science framework or any stats software.

And if it called draftfinalfinal.doc with comments or bits of your research project are all over the place in different folders, well join the club, then please go back and rename your files properlyBut to properly backup data it needs to be organized