Text and Datamining with TDM Studio

The EUR now has a license for TDM Studio (ProQuest), a resource meant for text and datamining research for all subscribed Proquest content. The database is a licensed ‘alternative’ to web scraping, which is usually not allowed.

The platform provides access to more than 16,000 resources! Examples of content included are, for instance, historical newspapers - the New York Times, the Wall Street Journal and the Times of India -, scholarly journals, trade journals, books, reports, magazines, blogs, podcasts and websites.

The platform consists of a data visualization area - requiring no further knowledge of programming languages. In addition, there is an analysis environment (the TDM workbench) accessible with R and Python. The platform doesn’t allow for (full) retrieval of the text of the resources, preventing copyright infringement. What users can do is store, export and share all analytics results and derived content.

Users need to register here with their full EUR email address to get access to the TDM Studio and the data visualisation area. If you want access to the TDM workbench, please send an email to edsc@eur.nl. Please explain briefly what you want to do, as the number of user accounts is limited. Note that workbench access is targeted to experienced R/Python users.

More information

For more information contact edsc@eur.nl.

Compare @count study programme

  • @title

    • Duration: @duration
Compare study programmes