Home > People & Projects > Big UK Domain Data for the Arts and Humanities

Project Details

not specified
Project Name: 
Big UK Domain Data for the Arts and Humanities
Principal Investigator / Director: 
Jane Winters (IHR)
Oxford participants: 
Other Participants: 
not specified
  • Division: Social Sciences
  • Unit: Oxford Internet Institute
  • Sub-Unit: not specified
Start Date: 
End Date: 
Partner organizations (inside or outside Oxford): 
British Library Institute of Historical Research, University of London Aarhus University
Subject Area: 
Project Description: 

Web archives are an increasingly important resource for arts and humanities researchers, yet we have neither the expertise nor the tools to use them effectively. Both the data itself and the process of collection are poorly understood, and it is possible only to draw the broadest of conclusions from current analytical analysis. The Big UK Domain Data for the Arts and Humanities project will work with the dataset derived from the UK domain crawl from 1996 to 2013 (that is, when legal deposit legislation was extended to cover digital materials), totalling approximately 65 terabytes and constituting many billions of words. For the arts and humanities, this is very big data indeed.

ICT Methods: 
CollaborationResource sharing collaboration
Data analysisText AnalysisContent analysis
- -Text mining
Last updated: 
15/12/2015 12:29:06
Updated by: