Open Data criteria

For the allocation of the open data incentive as an additional indicator of the LOM (Leistungsorientierte Mittel, performance-oriented funding at the Charité) or, respectively, the analogous funding for the open sharing of research data at the MDC, below criteria regarding data and publications need to be fulfilled. Additional Charité-specific criteria for the respective researcher’s eligibility to receive LOM funding are not considered here. The criteria for the open data incentive as of 2020 (regarding this also see below) are as follows:

  • Research data have been made freely accessible by researchers of the Charité or MDC, respectively
  • The data can be raw, primary, or secondary data (e.g. from analyses of freely available datasets, meta-analyses, or health technology assessments); the data would thus allow the analytical replication (retracing of analysis steps) for at least a part of the study’s results; reporting of statistical values (means, standard deviations, p-values etc.) is not sufficient
  • Data have been shared in the context of an article publication; thus, stand-alone datasets without reference to an article are not considered
  • The publication contained an explicit reference to the dataset(s); a reference to e.g. supplementary materials without further explanation is not sufficient
  • The data are indeed available and can be accessed at the time of checking
  • Data have been shared in a machine-readable format; for tables e.g. CSV, Excel or Word files, but not PDFs or image formats

The open data definition applied does not include:

  1. Analysis scripts, computer programs, and other methods, materials, and protocols, even if their development was the goal of the research project and/or their presentation was the focus of the publication; if for the development or validation data have been collected and shared, these can, however, fall under the open data definition
  2. Data contained within the article text itself, as long as these are not embedded tables, which can be accessed as digital objects for themselves as well
  3. Image, audiovisual, and other data which primarily serve illustrative purposes
  4. Data supporting case reports, unless these were shared in repositories (archives) of the respective discipline
  5. Data, which is only accessible on request or when fulfilling certain requirements, independent of the type of data, the access mechanism chosen, and the justification of these restrictions
  6. Due to misunderstandings we observed, we also ask not to confound open data with open access (i.e., the free availability of article publications)

The application of aforementioned criteria always yields some borderline cases. If you are of the opinion that your or your department’s publication has mistakenly not been classified as an open data publication, please send a short explanatory note to quest@bihealth.de, and we will check this and contact you. In addition, the semi-automated search for open data only takes place within the English-language body of literature. If you should have shared data supporting articles in other languages, please inform us about it.

The inclusion and exclusion criteria are valid for the LOM allocation in 2020 (publication period 2016-2018). The criteria require continuous adjustment, and will be developed further in coming years. As part of this development, further criteria for the reusability of data in line with the FAIR criteria are to be applied and/or data shared under restricted access are to be incentivized, as long as the access restriction is justified and there is a transparent access mechanism.