Aarhus University Seal

Talk by Ken Yiu on Query Optimization over Cloud Data Market

Info about event

Time

Wednesday 4 May 2016,  at 14:00 - 15:00

Location

Nygaard-395

Abstract: Data market is an emerging type of cloud service that enables a data owner to sell their data sets in a public cloud. Buyers who are interested in a certain dataset can access the data in the market via a RESTful API. Accessing data in the data market may not be free. For example, it costs USD 12 per month to obtain 100 "transactions" from the WorldWide Historical Weather dataset in Windows Azure Data Marketplace,  where a transaction is a unit of result size (e.g., a query result of 4400 records would consume 44 transactions as Windows Azure Data Marketplace confines one transaction to 100 records). Therefore, in this talk, we present PayLess, a system that helps data buyers to optimize their queries so that they can obtain the query results by paying less to the data sellers. Experiments over synthetic data and real data sets in Windows Azure Marketplace show that PayLess can cost-effectively handle SQL query processing over data markets.