Go to main content

Introducing Open Data Format

Introducing Open Data Format: A Platform-Independent, Non-Proprietary, Metadata-Enriched, Multilingual Data Format and its Implementation in R and Stata.

A Platform-Independent, Non-Proprietary, Metadata-Enriched, Multilingual Data Format and its Implementation in R and Stata

Publication details

Authors:
Xiaoyao Han, Tom Hartl, Knut Wenzig
Publication Date:
27.11.2024
Number:
10/2024
DOI:
10.5281/zenodo.14215268
Proposal for Citation:
Han, X., Hartl, T. & Wenzig, K. (2024). Introducing Open Data Format: A Platform-Independent, Non-Proprietary, Metadata-Enriched, Multilingual Data Format and its Implementation in R and Stata. KonsortSWD Working Paper 10/2024. Konsortium für die Sozial-, Verhaltens-, Bildungs- und Wirtschaftswissenschaften (KonsortSWD). https://doi.org/10.5281/zenodo.14215268

ABSTRACT
This paper introduces the Open Data Format (ODF), a new, non-proprietary, multilingual, metadata enriched, and zip-compressed data format that meets the FAIR Guiding Principles for scientific data management and stewardship. The data format is specified as a CSV file with the raw data and an XML file containing the metadata both compressed into a zip file with the .zip extension. Data files can be enriched with multilingual metadata following the forthcoming DDI Codebook 2.6 standard. The paper also introduces software packages for R (opendataformat) and Stata (opendf) that provide import and export filters and enable data users to work with ODF data files in the respective environment.

Keywords: ODF, Open Data Format, Metadata, DDI Codebook, Multilingual, opendataformat, opendf, R-Package, Stata Package