Integration of the WebMonitoring software into Internet Archive’s Wayback Machine (Bachelor thesis)

Τσαλαμπούνας, Φώτιος


This Thesis presents the work the author did during the Google Summer of Code 2018 for the Internet Archive. To be precise,the development of a ReactJS front end application called Waybackdiff and a Python backend application called Wayback-discover-diffisdiscussed. Wayback-diff is an application that allows users to compare two snapshots of the Wayback Machine side-by-side. Wayback-discover-diffcalculates the sim has values of snapshots of webpages offering information that acts as a measure to how much a webpage has changed over time. Information about the tools, frameworks and libraries that were used for their development as well as analgorithmic explanation of the applications’code can be found in chapters two and three. Chapter three also contains ananalysis of theWayback-discover-diff’s runtime,the actions that were taken to improve it and the statistical analysis of those improvements. Finally, chapter four provides an insight as to how the applications were integrated into the Wayback Machine and chapter five discusses future improvements.
Alternative title / Subtitle: Η υλοποίηση του wayback-diff και του wayback-discover-diff
The development of wayback-diff and wayback-discover-diff
Institution and School/Department of submitter: Σχολή Τεχνολογικών Εφαρμογών. Τμήμα Μηχανικών Πληροφορικής
Keywords: Open source licenses;Google Open Source;Internet Archive;Call back functions;React Router;Components
Description: Πτυχιακή εργασία - Σχολή Τεχνολογικών Εφαρμογών - Τμήμα Μηχανικών Πληροφορικής, 2019 (α/α 11457)
URI: http://195.251.240.227/jspui/handle/123456789/14739
Appears in Collections:Πτυχιακές Εργασίες

Files in This Item:
File Description SizeFormat 
Tsalampounis.pdfΠτυχιακή εργασία1.76 MBAdobe PDFView/Open



 Please use this identifier to cite or link to this item:
http://195.251.240.227/jspui/handle/123456789/14739
  This item is a favorite for 0 people.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.