DATA FORMATS
The UKDA works with different data formats for different purposes. There are optimal data formats that are used for long-term preservation and formats that are used for dissemination, which reflect the most popular formats requested by users.
Data files should be stored and offered to UKDA in relatively 'well-known' software formats where possible. Ideally, researchers should also try to ensure that back-ups of master copies of data are always in formats that are suitable for long-term digital preservation. This typically means using open formats (like XML) or as open as possible, as opposed to proprietary ones (e.g. Stata or N-Vivo). Please consult the pages on software and data conversion.
UKDA is able to accept data collections via the following:
- CD-ROM/DVD
- memory stick
- secure FTP
Depositors are advised to consider carefully the risks of using portable media, and are advised to encrypt data files where appropriate. See the UKDA data security pages for further information on protection and encryption, and how to make effective back-ups to prevent data loss.
UKDA does not normally accept large amounts of documentation when sent only in hard copy (paper) format, and reserves the right to ask the depositor to make documentation machine readable (either as text or image files).
The table below gives full technical details of preferred and acceptable formats, but suitable examples for upload include:
- quantitative data: SPSS, Stata, Excel, or tab-delimited ASCII text format (with suitable labels, or a data dictionary)
- qualitative data (e.g. interview transcripts): Rich Text Format (RTF) (may be generated from Microsoft Word) or ASCII text format
- documentation: RTF, Excel, Adobe Portable Document Format (PDF or PFF/A) or ASCII text format
| Type of data | Preferred format for deposit | Other acceptable formats for deposit |
|
Quantitative tabular data with extensive metadata
e.g. a survey dataset with variable labels, code labels, and defined missing values, in addition to the matrix of data |
|
|
|
Quantitative tabular data with minimal metadata
i.e. a matrix of data with or without column headings/variable names, but no other metadata or labelling |
|
|
|
GIS and CAD data
e.g. vector and raster |
|
|
| Qualitative data textual |
|
|
| Digital image data |
|
|
| Digital audio data |
|
|
| Digital video data |
|
Contributors should contact UKDA for advice before file upload. |
| Documentation |
|
In addition to those formats named in the 'preferred' column:
|
For any queries regarding data formats contact acquisitions@esds.ac.uk















