Home > Data analysis

Data analysis

Extraction of information, knowledge or meaning from a digital resource, using techniques such as searching and querying or feature measurement.

See the tabs in the box at the bottom for a list of People and Projects using this category of method at Oxford. The category comprises the subheadings below; click on the expand button next to each one to see a list of detailed methods, with definitions taken from arts-humanities.net. The links on each individual method take you to the national list of related projects there.

Image feature measurement : Image feature measurement is a term to describe techniques used to acquire, measure, and analyse the parameters of digital images, such as size, shape, relative locations, textures, grey tones and colours. These parameters are also known as ‘perception at
Image segmentation : Segmentation refers to the process of partitioning a digital image into multiple segments, also known as superpixels. The goal of segmentation is to simplify and/or change the representation of an image into something that is more meaningful and easie
Sound analysis : Refers to the extraction of information and meaning from sound signals for classification, storage, retrieval and synthesis. Different types of sound, for example voice and music, can be analysed in different ways.
Design analysis : Design analysis is a powerful software technology for simulating physical behaviour on the computer. Instead of building a prototype and developing elaborate testing regimens to analyse the physical behaviour of a product, engineers can elicit this inform
Motion analysis : Motion analysis provides systematic, time-dependent and quantitative data on any movement captured using digital video, as recorded in moving image collections. It is related to motion capture, which is the process of recording movement and translating th
Overlaying : Refers to the techniques used to produce a geometric intersection between two sets of data to highlight features of interest. Overlaying is often used when studying or displaying maps. Specifically, the term ‘overlaying’ refers to the use of vector data.
Spatial data analysis : This method comprises techniques used to analyse spatial (geographic) data. Such techniques include Thiessen polygon analysis, the X-tent principle, cost/friction analysis and network analysis, among others.
Content-based image retrieval : Content-based image retrieval (CBIR) refers to techniques used to search for digital images by features of their content, which is particularly helpful when studying large databases. It is often preferable to perform searches relying on metadata, which ca
Content-based sound retrieval : Refers to techniques used to search for sound files by features of their content, using specialist software, which is particularly helpful when studying large databases. It is often preferable to perform searches relying on metadata, which can be expensiv
Data mining : Data mining is the process of using computing power to extract hidden patterns from data, analysing the results from different perspectives and summarising it into a useful format, such as a graph or table. This process is often facilitated by the use of
Record linkages anaylsis : The term ‘record linkage’ refers to techniques used to link records from different sources, by finding entries that refer to the same entity (e.g. person) in two or more files. These entries can be combined to form individual micro records.
Searching and querying : In this context, ‘Searching and Querying’ refers to the extraction of information from data by means of query languages. This process is very different from queries performed using a web search engine, which are often unstructured and ambiguous.
Topic Detection and Tracking : Topic Detection and Tracking (TDT) refers to systems that monitor topically related material and sources, for example news stories, by algorithmic means and track these as they change over time. This data can be in a variety of different types of media fo

Statistical analysis methods include descriptive statistics, inferential/predictive statistics and kriging.

Collating : Collation is the process of comparing different versions of a text to discover the location and type of textual variants. Collation is fundamental to a variety of scholarly pursuits, for example in the Arts and Humanities field it can be used for the accu
Collocating : Refers to the techniques used to detect patterns of words that appear together in a text more often than would be expected by chance. A collocation is a group or pair of words that are always used together, and can illustrate restrictions on which verbs o
Content analysis : Content analysis is a research technique focused on the content and internal features of media. It is used to determine the presence of certain words, concepts, themes, phrases, characters, or sentences within texts or sets of texts and to quantify this p
Indexing : Indexing refers to techniques used to generate indexes of words in a text, in order that the reader can find information quickly and easily. Although indexing is still usually performed by hand, specialised computer software is often used to facilitat
Parsing : Parsing is an important method used in both computer science and linguistics. The term is synonymous with ‘syntactic analysis’, and refers to the process of taking a sequence (e.g. of characters), determining its structure, and checking whether it is lega
Stemmatics : Refers to techniques used to reconstruct the transmission of a text on the basis of relations between the various surviving manuscripts.
Stylometrics : Refers to the techniques used for the quantitative examination of textual and linguistic styles, which is often used as a method for authorship attribution studies. It can also be applied to music and fine art.
Text mining : Text mining, sometimes alternately referred to as text data mining, roughly equivalent to text analytics, refers generally to the process of deriving high-quality information from text. ‘High quality’ in text mining usually refers to some combination of r

Refers to techniques used to summarise and present data visually, in a form that enables people to understand and analyse the information. Formats can include images, maps, timelines, graphs and tables. Visualisation often uses computer graphics software,

Projects (117)
Poxy: Oxyrhynchus Online Classical Languages Dirk Obbink
Preparing DARIAH Oxford e-Research Centre Lou Burnard
Preserving and Enhancing Access to Historic Oxfordshire Ashmolean Museum
Project Bamboo Oxford e-Research Centre John Pybus
Prosopography of the Byzantine World History Faculty Avril Cameron
Recovering the material and visual cultures of the southern Sudan: A museological resource Pitt Rivers Museum Haas Ezzet
Rethinking Pitt-Rivers: Analysing the Activities of a Nineteenth-Century Collector Pitt Rivers Museum Haas Ezzet
Scene details in Ancient Egyptian monuments: Oxford Expedition's electronic database and publications project (c.2960 - 2040 BC). Oriental Studies Faculty Yvonne Harpur
SECT: Sustaining the EEBO-TCP Corpus in Transition Oxford Internet Institute Eric Meyer
Shields and Objects, Anthropology and Museums Pitt Rivers Museum Haas Ezzet
Sir John Evans Centenary Project Ashmolean Museum
South Cadbury Environs Project Archaeology Institute Gary Lock
Sphakia Survey Classical Archaeology Lucia Nixon
The First World War Poetry Digital Archive IT Services Stuart Lee
The Forster Collection Pitt Rivers Museum Pitt Rivers Museum Paul Groves
The Gascon Rolls 1317-1468 History Faculty Malcolm Vale
The John Johnson Collection: an Archive of Printed Ephemera Bodleian Libraries Research & Learning Services David Tomkins
The Other Within: Analysing the English collections at the Pitt Rivers Museum Pitt Rivers Museum Haas Ezzet
The reuniting of Osip Mandelstam's texts and archives in digital form Medieval & Modern Languages Faculty Jennifer Baines
The Shakespeare Quartos Archive Bodleian Digital Library Systems & Services Judith Siefring

Pages

People (101)
Philip Carter Senior Research and Publication Editor, Oxford Dictionary of National Biography History Faculty
Pieter Francois Associate Professor in Cultural Evolution Social Sciences
Richard Ashdowne Editor, Dictionary of Medieval Latin from British Sources Classical Languages
Richard Sharpe Professor History Faculty
Robert Parker Professor Ancient History
Rohini Jayatilaka English Faculty Senior Research Fellow and Project Manager English Faculty
Sebastian Rahtz Chief Data Architect IT Services
Sergio Grau Puerto Technical Project Officer (Media) Phonetics Laboratory
Sharron Pleydell-Pearce Senior Divisional Research Facilitator Divisional Administration
Simon Horobin Professor of English English Faculty
Sneha Krishnan Brasenose College, Radcliffe Square, Oxford Social Sciences Subject Area - LRS
Sukanta Chaudhuri Visiting Fellow All Souls College
Sushila Burgess DAMS Developer, BDLSS Bodleian Libraries Research & Learning Services
Suzanne Romaine Merton Professor of English Language English Faculty
Tara Stubbs Associate Professor, Oxford University Department for Continuing Education (OUDCE); Non-Tutorial Fellow, Kellogg College Continuing Education - HQ
Terhi Nurmikko-Fuller PDRA - Linked Data Oxford e-Research Centre
Ulrich Gehn Researcher Classical Archaeology
Vincent Razanajao Editor of the Topographical Bibliography and Keeper of the Archive Griffith Institute
Ylva Berglund Prytz Research Technology Specialist IT Services
Zena Kamash Lecturer at Magdalen College and Research Assistant to the Professor of European Archaeology (Institute of Archaeology) Archaeology Institute

Pages