University of Twente Student Theses

Login

Developing a reproducible workflow for batch geoprocessing social media in a cloud environment

Trosino, Ricardo Morales (2019) Developing a reproducible workflow for batch geoprocessing social media in a cloud environment.

[img] PDF
2MB
Abstract:The main objective of this research is to deliver workflow scenarios that can process and geoprocess socialmedia with batch data. The research focused on defining useful tasks and sub-tasks to explore and analyzebatch social media and to deliver a prototype able to reproduce the workflow. Two architectural scenarioswere identified. One scenario designed for newcomers in a local machine and another for more advancedusers in a cloud environment. A local machine scenario developed to explore a stored data set with asample of the data set, and a more complex scenario to explore the complete data set in the cloud and witha big data framework such as Spark. A prototype was designed to test the workflow and to achievereproducibility. To test the prototype, a data set was provided with the intention to search for tick bitesevents in the Netherlands. The results showed that, following the workflow, the example data set containssome noisy words and the processing in the cloud environment was relatively cheap and efficient.
Item Type:Essay (Master)
Faculty:ITC: Faculty of Geo-information Science and Earth Observation
Programme:Geoinformation Science and Earth Observation MSc (75014)
Link to this item:https://purl.utwente.nl/essays/85878
Export this item as:BibTeX
EndNote
HTML Citation
Reference Manager

 

Repository Staff Only: item control page