Paper: Exploring the UK business landscape using unsupervised learning

That was my MSc Thesis project which I presented at the Data for Policy conference in 2017.

Policy interventions have to be timely and tailored to specific sectors of the economic ecosystem to maximise their potential impact. We propose a system based on open data that offers policy makers two capabilities. First, it enables them to explore the digital and tech company space with high granularity through keywords, specific technologies or company names, and identify relevant organisations and those most similar to them. Second, it provides an overview of the ecosystem by creating thematic topics that characterise the activities of these companies. We demonstrate the effectiveness of this system in three activity areas not currently captured in the SIC codes.

Read the paper here.