Repository logo
 

Predicting Political Donations Using Data Driven Lifestyle Profiles Generated from Character N-Gram Analysis of Heterogeneous Online Sources

dc.contributor.authorConrad, Colin
dc.contributor.copyright-releaseNot Applicableen_US
dc.contributor.degreeMaster of Electronic Commerceen_US
dc.contributor.departmentFaculty of Computer Scienceen_US
dc.contributor.ethics-approvalNot Applicableen_US
dc.contributor.external-examinern/aen_US
dc.contributor.graduate-coordinatorDr. Evangelos Miliosen_US
dc.contributor.manuscriptsNot Applicableen_US
dc.contributor.thesis-readerDr. Peter Bodoriken_US
dc.contributor.thesis-readerDr. C. Edward Leachen_US
dc.contributor.thesis-supervisorDr. Vlado Keseljen_US
dc.date.accessioned2015-08-20T17:45:08Z
dc.date.available2015-08-20T17:45:08Z
dc.date.defence2015-08-07
dc.date.issued2015
dc.description.abstractThis paper describes an approach for generating multi-dimensional Activities, Interests, and Opinions (AIO) insights from disparate web sources. The method involves identifying psychographic profiles using text analysis of social media data. The approach is tested on tweets from 438 Twitter profiles, 219 of which are integrated with filing records from the United States Federal Election Commission, 219 others were used for control. Profiles were matched using demographic criteria and analyzed using political parties and donation values as labels. Standard probabilistic, entropy and kernel based approaches are used to make predictions based on word n-grams, while the CNG technique is explored as an alternative. Using CNG two predictive models were created that were able to exceed benchmarks extracted from the literature. Using these models, we are able to demonstrate a method for generating qualitative psychographic profiles, which can in turn be used to label customers for marketing insight.en_US
dc.identifier.urihttp://hdl.handle.net/10222/60748
dc.language.isoenen_US
dc.subjectpsychographicen_US
dc.subjectmarket segmentationen_US
dc.subjecttwitteren_US
dc.subjectdata miningen_US
dc.subjecttext miningen_US
dc.subjectmachine learningen_US
dc.subjectpolitical scienceen_US
dc.titlePredicting Political Donations Using Data Driven Lifestyle Profiles Generated from Character N-Gram Analysis of Heterogeneous Online Sourcesen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Conrad-Colin-MEC-ECMM-August-2015.pdf
Size:
466.35 KB
Format:
Adobe Portable Document Format
Description:
Thesis text document

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: