dc.contributor.author | Kolici, Vladi |
dc.contributor.author | Xhafa Xhafa, Fatos |
dc.contributor.author | Barolli, Leonard |
dc.contributor.author | Lala, Algenti |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Ciències de la Computació |
dc.date.accessioned | 2017-06-05T13:48:00Z |
dc.date.available | 2017-06-05T13:48:00Z |
dc.date.issued | 2014 |
dc.identifier.citation | Kolici, V., Xhafa, F., Barolli, L., Lala, A. Scalability, memory issues and challenges in mining large data sets. A: International Conference on Intelligent Networking and Collaborative Systems. "2014 International Conference on Intelligent Networking and Collaborative Systems: IEEE INCoS 2014: 10–12 September 2014, University of Salerno, Salerno, Italy: proceedings". Salerno: Institute of Electrical and Electronics Engineers (IEEE), 2014, p. 268-273. |
dc.identifier.isbn | 978-1-4799-6386-7 |
dc.identifier.uri | http://hdl.handle.net/2117/105131 |
dc.description | (c) 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works. |
dc.description.abstract | Data mining is an active field of research and development aiming to automatically extract "knowledge" from analyzing data sets. Knowledge can be defined in different ways such as discovering (structured, frequent, approximate, etc.) patterns in data, grouping/clustering/bi-clustering data according to one or more criteria, finding association rules, etc. Such knowledge is then fed-back to decision support systems enabling end-users (actors) to make more informed decisions, which in economic terms could lead to advantages as compared to traditional decision support systems. It should be noted however, that data mining algorithms and frameworks have been proposed prior to the "Big Data" explosion. While data mining algorithms have considered efficiency and computational complexity as an important requirement, they did not take into account features of Big Data such as very large size, velocity with which data is generated, variety, etc. On the other hand, these features are indeed posing issues and challenges to data mining algorithms and frameworks. In this paper we analyse some of the issues in mining large data sets such as scalability and in-memory needs. We also show some computational results pointing out to such issues. |
dc.format.extent | 6 p. |
dc.language.iso | eng |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Sistemes d'informació |
dc.subject.lcsh | Data mining |
dc.subject.other | Data Mining |
dc.subject.other | Distributed Data Mining |
dc.subject.other | Hadoop |
dc.subject.other | Large Data Sets |
dc.subject.other | Map Reduce |
dc.subject.other | Memory |
dc.subject.other | Scalability |
dc.title | Scalability, memory issues and challenges in mining large data sets |
dc.type | Conference report |
dc.subject.lemac | Mineria de dades |
dc.identifier.doi | 10.1109/INCoS.2014.50 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7057101 |
dc.rights.access | Open Access |
local.identifier.drac | 17839533 |
dc.description.version | Postprint (author's final draft) |
local.citation.author | Kolici, V.; Xhafa, F.; Barolli, L.; Lala, A. |
local.citation.contributor | International Conference on Intelligent Networking and Collaborative Systems |
local.citation.pubplace | Salerno |
local.citation.publicationName | 2014 International Conference on Intelligent Networking and Collaborative Systems: IEEE INCoS 2014: 10–12 September 2014, University of Salerno, Salerno, Italy: proceedings |
local.citation.startingPage | 268 |
local.citation.endingPage | 273 |