Unsupervised brand name extraction using domain adaptation

dc.contributor.advisorMakrehchi, Masoud
dc.contributor.advisorRahnamayan, Shahryar
dc.contributor.authorTowhidi, Afsaneh
dc.date.accessioned2019-10-17T16:25:07Z
dc.date.accessioned2022-03-29T16:49:39Z
dc.date.available2019-10-17T16:25:07Z
dc.date.available2022-03-29T16:49:39Z
dc.date.issued2019-08-01
dc.degree.disciplineElectrical and Computer Engineering
dc.degree.levelMaster of Applied Science (MASc)
dc.description.abstractBusiness intelligence and analytics is an area of research that analyzes the existing business data to extract the insights needed for a successful business planning. Textual data derived from tweets, forum posts, and blogs are from different business domains, and contain useful information for the organizations. This thesis proposes a method for extracting brand and product names from text; brand names as a subset of named entities can give a great deal of information about the whole document. In this thesis, a context window is defined to capture the context of a word in a sentence. In addition, a word embedding model is locally trained to have a domain specific model and finally, a domain adaptation technique is employed to transfer the knowledge from one domain with labeled data to a new domain. The results indicate a significant improvement in recall measure for extracting brand names from a new domain.en
dc.description.sponsorshipUniversity of Ontario Institute of Technologyen
dc.identifier.urihttps://hdl.handle.net/10155/1079
dc.language.isoenen
dc.subjectNatural language processingen
dc.subjectNamed entity recognitionen
dc.subjectWord embeddingen
dc.subjectDomain adaptationen
dc.titleUnsupervised brand name extraction using domain adaptationen
dc.typeThesisen
thesis.degree.disciplineElectrical and Computer Engineering
thesis.degree.grantorUniversity of Ontario Institute of Technology
thesis.degree.nameMaster of Applied Science (MASc)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Towhidi_Afsaneh.pdf
Size:
880.27 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.61 KB
Format:
Plain Text
Description: