Show simple item record

dc.contributor.authorHernandez, Roger
dc.contributor.authorBecerra Fontal, Yolanda
dc.contributor.authorTorres Viñals, Jordi
dc.contributor.authorAyguadé Parra, Eduard
dc.date.accessioned2015-06-03T11:47:56Z
dc.date.available2015-06-03T11:47:56Z
dc.date.issued2015-05-05
dc.identifier.citationHernandez, Roger [et al.]. Automatic query driven data modelling in Cassandra. A: "BSC Doctoral Symposium (2nd: 2015: Barcelona)". 2nd ed. Barcelona: Barcelona Supercomputing Center, 2015, p. 114.
dc.identifier.urihttp://hdl.handle.net/2099/16571
dc.description.abstractNon-relational databases have recently been the preferred choice when it comes to dealing with Big Data challenges, but their performance is very sensitive to the chosen data organisations. We have seen differences of over 70 times in response time for the same query on different models. This brings users the need to be fully conscious of the queries they intend to serve in order to design their data model. The common practice then, is to replicate data into different models designed to fit different query requirements. In this scenario, the user is in charge of the code implementation required to keep consistency between the different data replicas. Manually replicating data in such high layers of the database results in a lot of squandered storage due to the underlying system replication mechanisms that are formerly designed for availability and reliability ends. We propose and design a mechanism and a prototype to provide users with transparent management, where queries are matched with a well-performing model option. Additionally, we propose to do so by transforming the replication mechanism into a heterogeneous replication one, in order to avoid squandering disk space while keeping the availability and reliability features. The result is a system where, regardless of the query or model the user specifies, response time will always be that of an affine query.
dc.format.extent1 p.
dc.language.isoeng
dc.publisherBarcelona Supercomputing Center
dc.relation.ispartofBSC Doctoral Symposium (2nd: 2015: Barcelona)
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subjectÀrees temàtiques de la UPC::Informàtica::Sistemes d'informació::Emmagatzematge i recuperació de la informació
dc.subject.lcshHigh performance computing
dc.subject.lcshSupercomputers
dc.subject.lcshDatabase management
dc.titleAutomatic query driven data modelling in Cassandra
dc.typeConference report
dc.subject.lemacCàlcul intensiu (Informàtica)
dc.subject.lemacSupercomputadors
dc.subject.lemacBases de dades -- Gestió
dc.rights.accessOpen Access
local.identifier.drac25532693
local.citation.authorHernandez, Roger; Becerra Fontal, Yolanda; Torres Viñals, Jordi; Ayguadé Parra, Eduard
local.citation.pubplaceBarcelona
local.citation.publicationNameBSC Doctoral Symposium (2nd: 2015: Barcelona)
local.citation.startingPage114
local.citation.endingPage114
local.citation.edition2nd


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record