Any views expressed within media held on this service are those of the contributors, should not be taken as approved or endorsed by the University, and do not necessarily reflect the views of the University in respect of any particular issue.

Towards Large-scale Cultural Analytics in the Arts and Humanities

Towards Large-scale Cultural Analytics in the Arts and Humanities

An AHRC funded project, exploring how to make use of large-scale cultural events data for research

Case Study Part Two: The Data Complexities

frequency of events tagged as ‘Kids’ and ‘LGBT’ in each category

We learned a lot from getting up close with the cultural events data from Data Thistle that we described in the previous blog post. One of the complexities of the data, which has an effect on the results of analyses undertaken with these data, is the difference between categories and tags.

Events are assigned only one category from a static list of 15 categories.

Events can also be assigned multiple tags and these are a crowd-sourced list of descriptors generated by the event creators.

For example, an event categorised as ‘Music’ may also have the tag ‘Folk’ to provide more nuanced information about the genre of music.

The above graph shows the frequency of events tagged as ‘Kids’ and ‘LGBT’ in each category in our dataset.

In the case of ‘Kids’ events, most events tagged as ‘Kids’ are also categorised as ‘Kids’, however, events tagged as ‘Kids’ also appear in most of the other categories as well. This is the same for events tagged as ‘LGBT’.

As can be seen here, tagging and categorisation are not always in agreement. Understanding the difference between querying the data by category, tag or both will affect the findings that can be gleaned from this data by future users.

 

(Image courtesy of Rosa Filgueira, ToLCAAH)

Leave a reply

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>

css.php

Report this page

To report inappropriate content on this page, please use the form below. Upon receiving your report, we will be in touch as per the Take Down Policy of the service.

Please note that personal data collected through this form is used and stored for the purposes of processing this report and communication with you.

If you are unable to report a concern about content via this form please contact the Service Owner.

Please enter an email address you wish to be contacted on. Please describe the unacceptable content in sufficient detail to allow us to locate it, and why you consider it to be unacceptable.
By submitting this report, you accept that it is accurate and that fraudulent or nuisance complaints may result in action by the University.

  Cancel