Data organisation

In general, data organisation refers to the arrangement of data for retrieval. By improving data organisation, you improve the findability of your data, for yourself and for other data users. Additionally, clear structures and clear folder systems make it easier for you to manage your data, which plays an important role with regard to simple and reliable access control for sensitive data, for example.

FAQ

Clear data storage structures ensure that data is easy to retrieve. This is especially key when multiple people work on and with the data. The easiest way to improve retrievability is to establish and adhere to binding rules and structures from the very beginning of your project.

The goal of data management is to maintain an overview of the existing data, as well as all backup copies and editions, at all times. In this way, data loss is minimized, as is the risk of working with outdated files.

A clear data structure is also essential for the preservation and reuse of the data. By following a few rules for data organisation right from the start, you can avoid the tedious task of sorting your data after project completion.

Most likely, your Research Unit will already have a well-established storage structure in place, which can simply be adopted.

The following recommendations generally apply to data storage systems:

Store all data in folders and subfolders, sorted by structure and content.
Work with a maximum of three subfolder levels.
Name the folders so that the content is clearly recognizable.
Use the same folder structures in all projects, if possible.

It is advisable to work with predefined standard systems within working groups or projects. In addition to a clear description of the content, these should also take into account the development and modification of documents and data sets during the project.

It is advisable to keep the following guidelines in mind when naming files:

Number each data set on an ongoing basis.
Choose short, meaningful names. The names should consist only of letters, numbers, underscores and hyphens. Avoid spaces, slashes, umlauts and special characters. Only use abbreviations that are listed in your data naming rules.
If uniqueness requires the file name to contain multiple elements, the name should begin with the most common element and then become more specific. In any case, make sure that the file name does not get too long. It is recommended to separate the individual elements with an underscore.
If you do not use automatic version control, you should manually assign a version number and the date in YYYYMMDD format to altered data and documents.
Mark final versions of edited files with the word "final". Handle this marking with care to avoid constructs such as "...final_final_final...".
In the event of multiple editors, specify the editor by using initials or name abbreviations.

Example: 01_Labdata2017_V2_20181121_AW

You can use tags to actively assign keywords to files. The tags help you to find and organize files, for example by searching for tag names across folders. A file can have an unlimited number of tags.

It is advisable to keep the following guidelines in mind when naming tags:

Keep the names short, use one or two words only.
It is important to be consistent with names, also with regard to upper/lower case, singular/plural, symbols, etc..
You may display different hierarchy levels in the tags, for example raw data (superior) + X-ray images (inferior).

Depending on the software, there are different ways to assign tags. For Windows files, for example, you can add tags in the "Tags" field of the “Save As” dialog box or in the detail area of the Windows Explorer. After previous selection, tags can also be assigned to a group of files at the same time.

Name	Purpose	Lifetime	Type	Provider
CookieConsent	Saves your settings for the use of cookies on this website.	1 year	HTML	Homepage TU Wien
SimpleSAML	This is needed to distinguish between the sessions of the logged-in users.	session	HTTP	Login TU Wien
SimpleSAMLAuthToken	This is needed to distinguish between the sessions of the logged-in users.	session	HTTP	Login TU Wien
fe_typo_user	Is needed so that in case of a Typo3 frontend login the session ID is recognized to grant access to protected areas.	session	HTTP	Homepage TU Wien
staticfilecache	Is needed to optimize the delivery time of the website.	session	HTTP	Homepage TU Wien
JESSIONSID	Is needed so that in case of a LectureTube the session ID is recognized to grant access to protected areas.	session	HTTP	LectureTube TU Wien
_shibsession_lecturetube	This is needed to distinguish between the sessions of the logged-in users.	session	HTTP	LectureTube TU Wien

Name	Purpose	Lifetime	Type	Provider
_pk_id	Used to store a few details about the user such as the unique visitor ID.	13 months	HTML	Matomo TU Wien
_pk_ref	Is used to store the information of the users home website.	6 months	HTML	Matomo TU Wien
_pk_ses	Is needed to store temporary data of the visit.	30 minutes	HTML	Matomo TU Wien

Name	Purpose	Lifetime	Type	Provider
facebook	Is used to Enable ad delivery or retargeting	90 days	HTTP	Meta
__fb_chat_plugin	Is needed to store and track interactions (marketing/tracking).	persistent	HTTP	Meta
_js_datr	Is needed to save user settings.	2 years	HTTP	Meta
_fbc	Is needed to save the last visit (marketing/tracking).	2 years	HTTP	Meta
fbm	Is needed to store account data (marketing/tracking).	1 year	HTTP	Meta
xs	Is needed to store a unique session ID (marketing/tracking).	1 year	HTTP	Meta
wd	Is needed to log the screen resolution.	1 week	HTTP	Meta
fr	Is needed to serve ads and measure and improve their relevance.	3 months	HTTP	Meta
act	Is needed to store logged in users (marketing/tracking).	90 days	HTTP	Meta
_fbp	Is needed to store and track visits to various websites (marketing/tracking).	3 months	HTTP	Meta
datr	Is needed to identify the browser for security and website integrity purposes, including account recovery and identification of potentially compromised accounts.	2 years	HTTP	Meta
dpr	Is used for analysis purposes. Technical parameters are logged (e.g. aspect ratio and dimensions of the screen) so that Facebook apps can be displayed correctly.	1 week	HTTP	Meta
sb	Is needed to store browser details and security information of the Facebook account.	2 years	HTTP	Meta
dbln	Is needed to store browser details and security information of the Facebook account.	2 years	HTTP	Meta
spin	Is needed for promotional purposes and social campaign reporting.	session	HTTP	Meta
presence	Contains the "chat" status of logged in users.	1 month	HTTP	Meta
cppo	Is needed for statistical purposes.	90 days	HTTP	Meta
locale	Is needed to save the language settings.	session	HTTP	Meta
pl	Required for Facebook Pixel.	2 years	HTTP	Meta
lu	Required for Facebook Pixel.	2 years	HTTP	Meta
c_user	Required for Facebook Pixel.	3 months	HTTP	Meta
bcookie	Is needed to store browser data (marketing/tracking).	2 years	HTTP	LinkedIn
li_oatml	Is needed to identify LinkedIn members outside of LinkedIn for advertising and analytics purposes.	1 month	HTTP	LinkedIn
BizographicsOptOut	Is needed to save privacy settings.	10 years	HTTP	LinkedIn
li_sugr	Is needed to store browser data (marketing/tracking).	3 months	HTTP	LinkedIn
UserMatchHistory	Is needed to provide advertising or retargeting (marketing/tracking).	30 days	HTTP	LinkedIn
linkedin_oauth_	Is needed to provide cross-page functionality.	session	HTTP	LinkedIn
lidc	Is needed to store performed actions on the website (marketing/tracking).	1 day	HTTP	LinkedIn
bscookie	Is needed to store performed actions on the website (marketing/tracking).	2 years	HTTP	LinkedIn
X-LI-IDC	Is needed to provide cross-page functionality (marketing/tracking).	session	HTTP	LinkedIn
AnalyticsSyncHistory	Stores the time when the user was synchronized with the "lms_analytics" cookie.	30 days	HTTP	LinkedIn
lms_ads	Is needed to identify LinkedIn members outside of LinkedIn.	30 days	HTTP	LinkedIn
lms_analytics	Is needed to identify LinkedIn members for analytics purposes.	30 days	HTTP	LinkedIn
li_fat_id	Required for indirect member identification used for conversion tracking, retargeting and analytics.	30 days	HTTP	LinkedIn
U	Is needed to identify the browser.	3 months	HTTP	LinkedIn
_guid	Is needed to identify a LinkedIn member for advertising via Google Ads.	90 days	HTTP	LinkedIn

Data organisation

FAQ

Why should I introduce data management requirements at the start of my project?

How should I structure my data storage?

What is important when naming files?

What do I need tags for?