Repositories enable the storage, publication and findability of digital resources.
Via user interfaces, you can upload your research data together with contextual information and determine the use of your data with appropriate licences. Interested third parties can find and view your (open) data through common search engines on the internet and download and reuse it if defined so. Metadata, opens an external URL in a new window and persistent identifiers, opens an external URL in a new window ensure that datasets are findable and citable and increase the visibility of researchers and their associated research institutions.
The TU Wien does not yet operate its own data repository.
However, since the need for a TU-owned service for the publication and long-term storage of research data is considered high, the gradual establishment of suitable services is planned. In addition to the TU-GitLab instance, opens in new window that has already been realized, an institutional data repository is expected to be available to members of the TU Wien in autumn 2021. The implementation takes place within the FAIR Data Austria, opens an external URL in a new window project.
ReposiTUm, opens an external URL in a new window is a document server and can be used by you if your research data is available as text files. ReposiTUm is a service run by the TU Wien Bibliothek and primarily intended for making scientific publications from the TU Wien available open access and as full texts.
Types of publications/documents stored at reposiTUm:
Journal articles (pre-print, post-print, publisher’s version), book chapters, conference papers, conference lectures/poster presentations, books, scientific reports (working papers, reports), publications published by TU Wien (conference proceedings, journals), theses (doctoral theses, diploma theses, master’s theses)
You can find more information in the ReposiTUm FAQs, opens an external URL in a new window.
The global Register of Research Data Repositories re3data, opens an external URL in a new window can help you find a research data repository for specific disciplines. A similar platform is the DataCite Repository Finder, opens an external URL in a new window, which only lists disciplinary repositories which follow the FAIR-Principles.
If you cannot find an appropriate disciplinary repository for your needs, you can simply use a cross-disciplinary repository. Some of these general repositories, such as Zenodo, opens an external URL in a new window or Software Heritage, opens an external URL in a new window (for software in source code format), are financed by public funds or donations and are free of charge for the user. The operators of other repositories, such as Dryad, opens an external URL in a new window or Figshare, opens an external URL in a new window, charge for use. Preservation and publication costs should therefore be determined at an early stage and, if necessary, taken into account in the financial planning of your project.
The Zenodo Fact Sheet, opens a file in a new window offers brief information about Zenodo and the three steps to preservation and publication.
When selecting a repository, it may be helpful to consider the following factors. Good to know: For directories like re3data.org it is possible to set search filters according to these criteria.
- Does the repository issue persistent identifiers?
- What cost models does the repository offer? The price can, for example, depend on the amount of data.
- Does the repository offer clear conditions that comply with legal requirements (e.g. for data protection)?
- Does the repository offer the licence models I need (e.g. Creative Commons, special licences for software)?
- What access models does the repository offer? This question depends on how you want to make your data available - should it be accessible to the public (open access), for a limited group of users or just stored?
- Can I (temporarily) restrict access to my data, e.g. due to embargo periods?
- Is the repository certified as a "trusted data archive"? Such certifications are, for example, the CoreTrustSeal, the Nestor Seal or ISO 16363 certified depositories. Good to know: These seals also support the FAIR principles
- Does the repository provide access and download statistics?
- Are there any hints as to how the stored data should be cited?
- What is the longevity of the service provider (may depend on economic and political factors)?
- In which jurisdiction is the repository located? Note: There are different legal situations in Europe and the USA.
In order to avoid unplanned costs or increased time expenditure when uploading your data sets, you can consider a number of factors in advance:
- make sure that your selected repository supports your file formats and data types
- it may be useful to bundle all relevant data and documentation files in a zip file prior to uploading
- have the contextual metadata ready, e.g. the responsible person(s), the Principal Investigator, all project participants, information on the funding programme, licenses (make sure that you clarify before the upload whether you are authorized to assign a license)
- have any supplementary material ready to hand
- provide your data documentation, that is all information necessary for the traceability, interpretation and re-use of your data; if the repository of your choice does not have the appropriate metadata fields, we recommend uploading a clearly structured text file.
- if your records are associated with software, provide version information and a link to the location where the software is stored (e.g., Github)
- please note that multilingualism increases the findability of your data: at least for the abstract a description in English is necessary
- The visibility, efficiency and compliance of your research activities will improve.
- Your research will be more transparent and more comprehensible.
- Your data will be safely preserved and can be easily retrieved.
- Your data will be citable.
- You can thus increase your impact factor by quoting persistent internet addresses (e.g. DOIs).
- Collaborations and interdisciplinary exchange will be facilitated.
- Your data, which can be costly and labour-intensive to generate, can be used by others and thus repeatedly serve the research community in a resource-conserving manner.
- In return, you benefit from the publicly accessible data of others.
- You will comply with the policies and guidelines of funding bodies.
- Some journals only accept articles if you publish the underlying data.