Text and data mining (TDM)
The term "text and data mining" (TDM) refers to processes of automated extraction of information from large quantities of texts or data (corpora). Information can be derived from unstructured or weakly structured text data (text mining) or from strucured data (data mining).
However, many content providers enable access via special interfaces (APIs). If you want to make use of these APIs, please contact us.
Comprehensive lists of free text and data sources
- Content Mining: Free Corpora for mining (University of Southern California Libraries)
- Text mining & text analysis > Open sources (The University of Queensland Library)
More details will shortly be available in the coLAB, opens an external URL in a new window virtual learning space.