Matthew Sharpe

2papers

2 Papers

IRSep 25, 2020
A review of metadata fields associated with podcast RSS feeds

Matthew Sharpe

Podcasts are traditionally shared through RSS feeds. As well as pointing to the audio files, RSS gives a creator a way of providing metadata about the podcast shows and episodes. We investigate how certain metadata fields associated with podcasts are currently being used and comment on their applicability to recommendations. Specifically, we find that many creators are not using the itunes:type field in the expected fashion, and that using this field for recommendations might not lead to an optimal user experience. We perform similar explorations for the season number and the category associated with a podcast, and also find that the fields aren't being used in the expected fashion. Finally, we examine the notion that a single podcast show is the same as a single RSS feed. This also turns out to not be strictly true in all cases. In short, the metadata associated with many podcasts isn't always reflective of the show and should be used with caution.

CVAug 7, 2019
The Northumberland Dolphin Dataset: A Multimedia Individual Cetacean Dataset for Fine-Grained Categorisation

Cameron Trotter, Georgia Atkinson, Matthew Sharpe et al.

Methods for cetacean research include photo-identification (photo-id) and passive acoustic monitoring (PAM) which generate thousands of images per expedition that are currently hand categorised by researchers into the individual dolphins sighted. With the vast amount of data obtained it is crucially important to develop a system that is able to categorise this quickly. The Northumberland Dolphin Dataset (NDD) is an on-going novel dataset project made up of above and below water images of, and spectrograms of whistles from, white-beaked dolphins. These are produced by photo-id and PAM data collection methods applied off the coast of Northumberland, UK. This dataset will aid in building cetacean identification models, reducing the number of human-hours required to categorise images. Example use cases and areas identified for speed up are examined.