Data curation with ontology functional dependences

dc.contributor.advisorSzlichta, Jaroslaw
dc.contributor.authorKeller, Alexander
dc.date.accessioned2017-08-03T19:59:40Z
dc.date.accessioned2022-03-29T17:39:20Z
dc.date.available2017-08-03T19:59:40Z
dc.date.available2022-03-29T17:39:20Z
dc.date.issued2017-04-01
dc.degree.disciplineComputer Science
dc.degree.levelMaster of Science (MSc)
dc.description.abstractPoor data quality has become a pervasive issue due to the increasing complexity and size of modern datasets. Functional dependencies have been used in existing cleaning solutions to model syntactic equivalence. They are not able to model semantic equivelence, however. We advance the state of data quality constraints by defining, discovering, and cleaning Ontology Functional Dependencies. We define their theoretical foundations, including sound and complete axioms, and linear inference procedure. We develop algorithms for data verification, constraint discovery, data cleaning, ontology versus data inconsistency identification, and optimizations to each. Our experimental evaluation shows the scalability and accuracy of our algorithms. We show that ontology FDs are useful to capture domain attribute relationships, and can significantly reduce the number of false positive errors in data cleaning techniques that rely on traditional FDs.en
dc.description.sponsorshipUniversity of Ontario Institute of Technologyen
dc.identifier.urihttps://hdl.handle.net/10155/792
dc.language.isoenen
dc.subjectConstraintsen
dc.subjectDataen
dc.subjectQualityen
dc.subjectCleaningen
dc.subjectDiscoveryen
dc.titleData curation with ontology functional dependencesen
dc.typeThesisen
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Ontario Institute of Technology
thesis.degree.nameMaster of Science (MSc)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Keller_Alexander.pdf
Size:
404 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.61 KB
Format:
Plain Text
Description: