Extraction of Semi-Structured Online Data

Thesis Type Bachelor
Thesis Status
Student Julien Poissonnier
Thesis Supervisor

SnoopyDB is a database system developed by the Databases and Information Systems group at the University of Innsbruck which aims to avoid schema proliferation by providing recommendations for properties and values. These recommendations are based on existing data stored in the database. To import data from various JSON APIs found on the web in order to improve the quality of the provided recommendations, an application is developed which allows to easily import data without needing to manually create a custom importer for each such web service.