Query-Driven Large-Scale Portfolio Aggregate Risk Analysis on MapReduce

Yao, Zhimin

Query-Driven Large-Scale Portfolio Aggregate Risk Analysis on MapReduce

dc.contributor.author	Yao, Zhimin
dc.contributor.copyright-release	Not Applicable	en_US
dc.contributor.degree	Master of Computer Science	en_US
dc.contributor.department	Faculty of Computer Science	en_US
dc.contributor.ethics-approval	Not Applicable	en_US
dc.contributor.external-examiner	n/a	en_US
dc.contributor.graduate-coordinator	Dr. Evangelos E. Milios	en_US
dc.contributor.manuscripts	Not Applicable	en_US
dc.contributor.thesis-reader	Dr. Meng He	en_US
dc.contributor.thesis-reader	Dr. Qigang Gao	en_US
dc.contributor.thesis-supervisor	Dr. Andrew Rau-Chaplin, Dr. Norbert Zeh	en_US
dc.date.accessioned	2014-08-15T18:30:52Z
dc.date.available	2014-08-15T18:30:52Z
dc.date.defence	2014-08-07
dc.date.issued	2014-08-15
dc.description.abstract	Modern reinsurance companies use stochastic simulation techniques for portfolio risk analysis, often referred to as aggregate risk analysis, to support risk management. Their risk portfolios may consist of thousands of reinsurance contracts covering millions of individually insured locations. To quantify risk and to help ensure capital adequacy, each portfolio must be evaluated in large-scaled simulation trials, each capturing a different possible sequence of catastrophic events (e.g., earthquakes, hurricanes, etc.) over the course of a contractual year. In practice, due to the amount of data and computations involved, it is highly attractive to explore high performance parallel computing solutions to accelerate the analysis. In this thesis, we explore the design of a flexible framework, called QuPARA, which exploits parallelism to perform aggregate risk analysis via distributed computing by using the MapReduce programming model. The goal is to provide a flexible framework that can be used by analysts to answer a wide variety of unanticipated but natural ad hoc queries to help them better understand multiple dimensions of risks that can impact portfolio performance and thus company solvency. The QuPARA framework was implemented using Apache Hadoop, Apache Hive, and Pentaho. This prototype allows the user to take advantage of large parallel servers in order to answer ad hoc risk analysis queries efficiently even on large data sets. We also present data structure optimizations and tuning that greatly accelerate QuPARA's computation. The performance of the prototype system is competitive with highly tuned production systems that are only capable of answering a narrow set of portfolio queries, in contrast to the wide range of ad hoc queries QuPARA is able to resolve.	en_US
dc.identifier.uri	http://hdl.handle.net/10222/53798
dc.language.iso	en	en_US
dc.subject	ad hoc risk analytics	en_US
dc.subject	reinsurance risk analysis	en_US
dc.subject	distributed computing	en_US
dc.subject	MapReduce	en_US
dc.subject	Hadoop	en_US
dc.title	Query-Driven Large-Scale Portfolio Aggregate Risk Analysis on MapReduce	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Yao-Zhimin-MCSc-CSCI-August-2014.pdf
Size:: 5.9 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Faculty of Graduate Studies Online Theses