Student perspectives: Discovering the true positions of objects from pairwise similarities

A post by Annie Gray, PhD student on the Compass programme.

Introduction

Initially, my Compass mini-project aimed to explore what we can discover about objects given a matrix of similarities between them. More specifically, how to appropriately measure the distance between objects if we represent each as a point in \mathbb{R}^p (the embedding), and what this can tell us about the objects themselves. This led to discovering that the geodesic distances in the embedding relate to the Euclidean distance between the original positions of the objects, meaning we can recover the original positions of the objects. This work has applications in fields that work with relational data for example: genetics, Natural Language Processing and cyber-security.

This work resulted in a paper [3] written with my supervisors (Nick Whiteley and Patrick Rubin-Delanchy), which has been accepted at NeurIPS this year. The following gives an overview of the paper and how the ideas can be used in practice.

(more…)

Welcome Cohort 3

A huge welcome to Compass to our 3rd Cohort of CDT students.  Look out for their updates during this year about their research and experiences.

Edward Milsom, Ben Griffiths, Emerald Dilworth, Hannah Sansford, Daniel Milner, Harry Tata, Dominic Broadbent, Tennessee Hickling, Josh Givens, Ettore Fincato

Compass students Sept21

Compass Away Day

At the end of July 2021, Compass students and staff travelled together to the Brecon Beacons National Park in Wales for a day full of adventure, which was carefully planned by Call of the Wild.

 

 

 

 

 

Activities on the day started with some fun team tasks called the ‘Mini Olympics’. Some of the tasks tested logical thinking, the ability to do a task under time pressure, or simply work as a team to complete a certain objective but ultimately to have fun and a laugh.

 

The tasks were a great opportunity to work together and get to know each other better. Some of them have been more difficult to complete than our students and staff initially expected, but very enjoyable.

 

 

After lunch Compass students and staff started a 3-hour Canyoning adventure, guided by the very well trained Call of the Wild team.

The best way of describing this canyoning activity is white water rafting but without the raft. With qualified guides, our students and staff descended a stunning steep sided gorge by various ways and means. This involved sliding down rapids, swimming down rapids, floating down fast flowing chutes and waves, walking behind some breathtaking waterfalls and of course jumping off some jaw dropping waterfalls.

 

 

After this thrilling adventure, Compass students and staff travelled to the Vale Resort where they enjoyed dinner together and leisure time until the day after when it was time to come back to Bristol.

It was wonderful to be able to spend time together after the long months of working from home.

 

 

 

 

 

 

 

Student Perspectives: Contemporary Ideas in Statistical Philosophy

A post by Alessio Zakaria, PhD student on the Compass programme.

Introduction

Probability theory is a branch of mathematics centred around the abstract manipulation and quantification of uncertainty and variability. It forms a basic unit of the theory and practice of statistics, enabling us to tame the complex nature of observable phenomena into meaningful information. It is through this reliance that the debate over the true (or more correct) underlying nature of probability theory has profound effects on how statisticians do their work. The current opposing sides of the debate in question are the Frequentists and the Bayesians. Frequentists believe that probability is intrinsically linked to the numeric regularity with which events occur, i.e. their frequency. Bayesians, however, believe that probability is an expression of someones degree of belief or confidence in a certain claim. In everyday parlance we use both of these concepts interchangeably: I estimate one in five of people have Covid; I was 50% confident that the football was coming home. It should be noted that the latter of the two is not a repeatable event per se. We cannot roll back time to check what the repeatable sequence would result in.

(more…)

Student perspectives: How can we do data science without all of our data?

A post by Daniel Williams, Compass PhD student.

Imagine that you are employed by Chicago’s city council, and are tasked with estimating where the mean locations of reported crimes are in the city. The data that you are given only goes up to the city’s borders, even though crime does not suddenly stop beyond this artificial boundary. As a data scientist, how would you estimate these centres within the city? Your measurements are obscured past a very complex border, so regular methods such as maximum likelihood would not be appropriate.

Chicago Homicides
Figure 1: Homicides in the city of Chicago in 2008. Left: locations of each homicide. Right: a density estimate of the same crimes, highlighting where the ‘hotspots’ are.

This is an example of a more general problem in statistics named truncated probability density estimation. How do we estimate the parameters of a statistical model when data are not fully observed, and are cut off by some artificial boundary? (more…)

Skip to toolbar