University of Twente Student Theses

Login

inSQeLto : a Query Language for Probabilistic Databases

Groot Roessink, Jochem (2021) inSQeLto : a Query Language for Probabilistic Databases.

[img] PDF
277kB
Abstract:A Database Management System (DBMS) is a useful tool to store, manipulate and query data. However, it is always possible that the database contains incorrect data. In most DBMSs, there is no direct way to indicate which data might be incorrect. For this reason, probabilistic databases exist, which have the ability to express how correct data is. An example of such a database is DuBio, currently under development at the University of Twente. DuBio has functionality that can keep track of the probability of correctness for its data and this probability can be presented to users when they query some data. This will help in identifying incorrect data which can then be rectified. The current way of querying data in DuBio is only meant to be a temporary solution and some queries can be quite lengthy and complicated. The goal of this research is to design the query language inSQeLto that can be used for probabilistic databases, and DuBio specifically. While the resulting language looks the same as standard SQL, it works differently under the hood. Namely, the inSQeLto queries are mapped onto DuBio queries, and these can perform the probabilistic functionality. Because of this existing SQL queries can now be used to interact with DuBio in a probabilistic way. This language could help spread the use of probabilistic databases. This could lead to a decrease in the amount of incorrect data that is being used, which will have a positive impact on an increasingly digital world.
Item Type:Essay (Bachelor)
Faculty:EEMCS: Electrical Engineering, Mathematics and Computer Science
Subject:54 computer science
Programme:Computer Science BSc (56964)
Link to this item:https://purl.utwente.nl/essays/86906
Export this item as:BibTeX
EndNote
HTML Citation
Reference Manager

 

Repository Staff Only: item control page