Ponències/Comunicacions de congressos

Ponències/Comunicacions de congressos http://hdl.handle.net/2117/3095 2024-04-18T09:54:28Z Polynomial calculus for MaxSAT http://hdl.handle.net/2117/405497 Polynomial calculus for MaxSAT Bonacina, Ilario; Bonet Carbonell, M. Luisa; Levy Díaz, Jordi MaxSAT is the problem of finding an assignment satisfying the maximum number of clauses in a CNF formula. We consider a natural generalization of this problem to generic sets of polynomials and propose a weighted version of Polynomial Calculus to address this problem. Weighted Polynomial Calculus is a natural generalization of MaxSAT-Resolution and weighted Resolution that manipulates polynomials with coefficients in a finite field and either weights in N or Z. We show the soundness and completeness of these systems via an algorithmic procedure. Weighted Polynomial Calculus, with weights in N and coefficients in F2, is able to prove efficiently that Tseitin formulas on a connected graph are minimally unsatisfiable. Using weights in Z, it also proves efficiently that the Pigeonhole Principle is minimally unsatisfiable. 2024-03-28T10:32:40Z Bonacina, Ilario Bonet Carbonell, M. Luisa Levy Díaz, Jordi MaxSAT is the problem of finding an assignment satisfying the maximum number of clauses in a CNF formula. We consider a natural generalization of this problem to generic sets of polynomials and propose a weighted version of Polynomial Calculus to address this problem. Weighted Polynomial Calculus is a natural generalization of MaxSAT-Resolution and weighted Resolution that manipulates polynomials with coefficients in a finite field and either weights in N or Z. We show the soundness and completeness of these systems via an algorithmic procedure. Weighted Polynomial Calculus, with weights in N and coefficients in F2, is able to prove efficiently that Tseitin formulas on a connected graph are minimally unsatisfiable. Using weights in Z, it also proves efficiently that the Pigeonhole Principle is minimally unsatisfiable. GMX: Instruction set extensions for fast, scalable, and efficient genome sequence alignment http://hdl.handle.net/2117/405488 GMX: Instruction set extensions for fast, scalable, and efficient genome sequence alignment Doblas Font, Max; Lostes Cazorla, Oscar; Aguado Puig, Quim; Cebry, Nicholas; Fontova Muste, Pau; Batten, Christopher; Marco Sola, Santiago; Moretó Planas, Miquel Sequence alignment remains a fundamental problem in computer science with practical applications ranging from pattern matching to computational biology. The ever-increasing volumes of genomic data produced by modern DNA sequencers motivate improved software and hardware sequence alignment accelerators that scale with longer sequence lengths and high error rates without losing accuracy. Furthermore, the wide variety of use cases requiring sequence alignment demands flexible and efficient solutions that can match or even outperform expensive application-specific accelerators. To address these challenges, we propose GMX, a set of ISA extensions that enable efficient sequence alignment computations based on dynamic programming (DP). GMX extensions provide the basic building-block operations to perform fast tile-wise computations of the DP matrix, reducing the memory footprint and allowing easy integration into widely-used algorithms and tools. Furthermore, we provide an efficient hardware implementation that integrates GMX extensions in a RISC-V-based edge system-on-chip (SoC). Compared to widely-used software implementations, our hardware-software co-design leveraging GMX extensions obtains speed-ups from 25–265 ×, scaling to megabyte-long sequences. Compared to domain-specific accelerators (DSA), we demonstrate that GMX-accelerated implementations demand significantly less memory bandwidth, requiring less area per processing element (PE). As a result, a single GMX-enabled core achieves a throughput per area between 0.35-0.52 × that of state-of-the-art DSAs while being more flexible and reusing the core’s resources. Post-place-and-route results for a GMX-enhanced SoC in 22nm technology shows that GMX extensions only account for 1.7% of the overall area while consuming just 8.47mW. We conclude that GMX extensions represent versatile and scalable ISA additions to improve the performance of genome analysis tools and other use cases that require fast and efficient sequence alignment. 2024-03-28T07:34:55Z Doblas Font, Max Lostes Cazorla, Oscar Aguado Puig, Quim Cebry, Nicholas Fontova Muste, Pau Batten, Christopher Marco Sola, Santiago Moretó Planas, Miquel Sequence alignment remains a fundamental problem in computer science with practical applications ranging from pattern matching to computational biology. The ever-increasing volumes of genomic data produced by modern DNA sequencers motivate improved software and hardware sequence alignment accelerators that scale with longer sequence lengths and high error rates without losing accuracy. Furthermore, the wide variety of use cases requiring sequence alignment demands flexible and efficient solutions that can match or even outperform expensive application-specific accelerators. To address these challenges, we propose GMX, a set of ISA extensions that enable efficient sequence alignment computations based on dynamic programming (DP). GMX extensions provide the basic building-block operations to perform fast tile-wise computations of the DP matrix, reducing the memory footprint and allowing easy integration into widely-used algorithms and tools. Furthermore, we provide an efficient hardware implementation that integrates GMX extensions in a RISC-V-based edge system-on-chip (SoC). Compared to widely-used software implementations, our hardware-software co-design leveraging GMX extensions obtains speed-ups from 25–265 ×, scaling to megabyte-long sequences. Compared to domain-specific accelerators (DSA), we demonstrate that GMX-accelerated implementations demand significantly less memory bandwidth, requiring less area per processing element (PE). As a result, a single GMX-enabled core achieves a throughput per area between 0.35-0.52 × that of state-of-the-art DSAs while being more flexible and reusing the core’s resources. Post-place-and-route results for a GMX-enhanced SoC in 22nm technology shows that GMX extensions only account for 1.7% of the overall area while consuming just 8.47mW. We conclude that GMX extensions represent versatile and scalable ISA additions to improve the performance of genome analysis tools and other use cases that require fast and efficient sequence alignment. The K-Robinson Foulds measures for labeled trees http://hdl.handle.net/2117/405052 The K-Robinson Foulds measures for labeled trees Khayatian, Elahe; Valiente Feruglio, Gabriel Alejandro; Zhang, Louxin Investigating the mutational history of tumor cells is important for understanding the underlying mechanisms of cancer and its evolution. Now that the evolution of tumor cells is modeled using labeled trees, researchers are motivated to propose different measures for the comparison of mutation trees and other labeled trees. While the Robinson-Foulds distance is widely used for the comparison of phylogenetic trees, it has weaknesses when it is applied to labeled trees. Here, k-Robinson-Foulds dissimilarity measures are introduced for labeled tree comparison. 2024-03-21T07:49:22Z Khayatian, Elahe Valiente Feruglio, Gabriel Alejandro Zhang, Louxin Investigating the mutational history of tumor cells is important for understanding the underlying mechanisms of cancer and its evolution. Now that the evolution of tumor cells is modeled using labeled trees, researchers are motivated to propose different measures for the comparison of mutation trees and other labeled trees. While the Robinson-Foulds distance is widely used for the comparison of phylogenetic trees, it has weaknesses when it is applied to labeled trees. Here, k-Robinson-Foulds dissimilarity measures are introduced for labeled tree comparison. On the consistency of circuit lower bounds for non-deterministic time http://hdl.handle.net/2117/403574 On the consistency of circuit lower bounds for non-deterministic time Atserias, Albert; Buss, Sam; Müller, Moritz We prove the first unconditional consistency result for superpolynomial circuit lower bounds with a relatively strong theory of bounded arithmetic. Namely, we show that the theory ‍V20 is consistent with the conjecture that ‍NEXP ‍⊈ ‍P/poly, i.e., some problem that is solvable in non-deterministic exponential time does not have polynomial size circuits. We suggest this is the best currently available evidence for the truth of the conjecture. Additionally, we establish a magnification result on the hardness of proving circuit lower bounds. 2024-03-01T09:03:15Z Atserias, Albert Buss, Sam Müller, Moritz We prove the first unconditional consistency result for superpolynomial circuit lower bounds with a relatively strong theory of bounded arithmetic. Namely, we show that the theory ‍V20 is consistent with the conjecture that ‍NEXP ‍⊈ ‍P/poly, i.e., some problem that is solvable in non-deterministic exponential time does not have polynomial size circuits. We suggest this is the best currently available evidence for the truth of the conjecture. Additionally, we establish a magnification result on the hardness of proving circuit lower bounds. WFAsic: A high-performance ASIC accelerator for DNA sequence alignment on a RISC-V SoC http://hdl.handle.net/2117/402007 WFAsic: A high-performance ASIC accelerator for DNA sequence alignment on a RISC-V SoC Haghi, Abbas; Álvarez Martí, Lluc; Fornt Mas, Jordi; Haro Ruiz, Juan Miguel de; Figueras Bagué, Roger; Doblas Font, Max; Marco Sola, Santiago; Moretó Planas, Miquel The ever-increasing yields in genome sequence data production pose a computational challenge to current genome sequence analysis tools, jeopardizing the future of personalized medicine. Leveraging hardware accelerators (GPUs, FPGAs, and ASICs) to accelerate computationally-intensive algorithms like sequence alignment has become paramount. Recently, the wavefront alignment algorithm was introduced, significantly reducing the execution time to perform sequence alignment. This paper presents the first-ever ASIC accelerator of the WFA integrated into a RISC-V system-on-chip. Our designed chip greatly accelerates sequence alignment, delivering up to 1076 × better performance over the CPU implementation of the WFA running on the RISC-V core of the chip. 2024-02-15T12:23:48Z Haghi, Abbas Álvarez Martí, Lluc Fornt Mas, Jordi Haro Ruiz, Juan Miguel de Figueras Bagué, Roger Doblas Font, Max Marco Sola, Santiago Moretó Planas, Miquel The ever-increasing yields in genome sequence data production pose a computational challenge to current genome sequence analysis tools, jeopardizing the future of personalized medicine. Leveraging hardware accelerators (GPUs, FPGAs, and ASICs) to accelerate computationally-intensive algorithms like sequence alignment has become paramount. Recently, the wavefront alignment algorithm was introduced, significantly reducing the execution time to perform sequence alignment. This paper presents the first-ever ASIC accelerator of the WFA integrated into a RISC-V system-on-chip. Our designed chip greatly accelerates sequence alignment, delivering up to 1076 × better performance over the CPU implementation of the WFA running on the RISC-V core of the chip. An aggregation rule under uncertainty http://hdl.handle.net/2117/399722 An aggregation rule under uncertainty Freixas Bosch, Josep Many decision-making situations require the evaluation of several agents or judges. In a situation where agents evaluate candidates, the question arises of how best to aggregate evaluations so as to compare the candidates. The aim of this work is to propose a method of aggregating the evaluations of the agents, which has outstanding properties and becomes a potential evaluative tool in many contexts. The proposed rule is useful even when the agents who evaluate candidates are not the same. As as example, just to remark that it is an ideal tool to rank restaurants or movies on designed websites. 2024-01-17T17:33:19Z Freixas Bosch, Josep Many decision-making situations require the evaluation of several agents or judges. In a situation where agents evaluate candidates, the question arises of how best to aggregate evaluations so as to compare the candidates. The aim of this work is to propose a method of aggregating the evaluations of the agents, which has outstanding properties and becomes a potential evaluative tool in many contexts. The proposed rule is useful even when the agents who evaluate candidates are not the same. As as example, just to remark that it is an ideal tool to rank restaurants or movies on designed websites. Markov chains applied to Parrondo’s paradox: the coin tossing problem http://hdl.handle.net/2117/387141 Markov chains applied to Parrondo’s paradox: the coin tossing problem Molinero Albareda, Xavier; Mégnien, Camille Parrondo’s paradox was introduced by Juan Parrondo in 1996. In game theory, this paradox is described as: A combination of losing strategies becomes a winning strategy. At first glance, this paradox is quite surprising, but we can easily explain it by using simulations and mathematical arguments. Indeed, we first consider some examples with the Parrondo’s paradox and, using the software R, we simulate one of them, the coin tossing. Actually, we see that specific combinations of losing games become a winning game. Moreover, even a random combination of these two losing games leads to a winning game. Later, we introduce the major definitions and theorems over Markov chains to study our Parrondo’s paradox applied to the coin tossing problem. In particular, we represent our Parrondo’s game as a Markov chain and we find its stationary distribution. In that way, we exhibit that our combination of two losing games is truly a winning combination. We also deliberate possible applications of the paradox in some fields such as ecology, biology, finance or reliability theory. 2023-05-05T11:01:41Z Molinero Albareda, Xavier Mégnien, Camille Parrondo’s paradox was introduced by Juan Parrondo in 1996. In game theory, this paradox is described as: A combination of losing strategies becomes a winning strategy. At first glance, this paradox is quite surprising, but we can easily explain it by using simulations and mathematical arguments. Indeed, we first consider some examples with the Parrondo’s paradox and, using the software R, we simulate one of them, the coin tossing. Actually, we see that specific combinations of losing games become a winning game. Moreover, even a random combination of these two losing games leads to a winning game. Later, we introduce the major definitions and theorems over Markov chains to study our Parrondo’s paradox applied to the coin tossing problem. In particular, we represent our Parrondo’s game as a Markov chain and we find its stationary distribution. In that way, we exhibit that our combination of two losing games is truly a winning combination. We also deliberate possible applications of the paradox in some fields such as ecology, biology, finance or reliability theory. How can graph databases and reasoning be combined and integrated? http://hdl.handle.net/2117/386349 How can graph databases and reasoning be combined and integrated? Pasarella Sánchez, Ana Edelmira Nowadays the graph data model has been accepted as one of the most suitable data models to formalize relationships among entities of many domains. Deductive databases based on the Datalog language have been used to deduce new information from large amounts of data. Most of the attempts to combine logic and graph databases are based on translating knowledge in graph databases into Datalog and then use its inference engine. We aim to open the discussion about combining graph databases and a graph-oriented logic to define «native» deductive graph databases. This is, graph databases equipped with an inference mechanism based on graph based logic. To be concrete, we plan to use the recently introduced graph navigational logic. 2023-04-18T13:27:11Z Pasarella Sánchez, Ana Edelmira Nowadays the graph data model has been accepted as one of the most suitable data models to formalize relationships among entities of many domains. Deductive databases based on the Datalog language have been used to deduce new information from large amounts of data. Most of the attempts to combine logic and graph databases are based on translating knowledge in graph databases into Datalog and then use its inference engine. We aim to open the discussion about combining graph databases and a graph-oriented logic to define «native» deductive graph databases. This is, graph databases equipped with an inference mechanism based on graph based logic. To be concrete, we plan to use the recently introduced graph navigational logic. Decomposition of transition systems into sets of synchronizing Free-choice Petri Nets http://hdl.handle.net/2117/385796 Decomposition of transition systems into sets of synchronizing Free-choice Petri Nets Teren, Viktor; Cortadella, Jordi; Villa, Tiziano Petri nets and transition systems are two important formalisms used for modeling concurrent systems. One interesting problem in this domain is the creation of a Petri net with a reachability graph equivalent to a given transition system. This paper focuses on the creation of a set of synchronizing Free-choice Petri nets (FCPNs) from a transition system. FCPNs are more amenable for visualization and structural analysis while not being excessively simple, as in the case of state machines. The results show that with a small set of FCPNs, the complexity of the model can be reduced when compared to the synthesis of a monolithic Petri net. 2023-03-30T11:53:45Z Teren, Viktor Cortadella, Jordi Villa, Tiziano Petri nets and transition systems are two important formalisms used for modeling concurrent systems. One interesting problem in this domain is the creation of a Petri net with a reachability graph equivalent to a given transition system. This paper focuses on the creation of a set of synchronizing Free-choice Petri nets (FCPNs) from a transition system. FCPNs are more amenable for visualization and structural analysis while not being excessively simple, as in the case of state machines. The results show that with a small set of FCPNs, the complexity of the model can be reduced when compared to the synthesis of a monolithic Petri net. Improved reconstruction of random geometric graphs http://hdl.handle.net/2117/385380 Improved reconstruction of random geometric graphs Dani, Varsha; Díaz Cort, Josep; Hayes, Thomas P.; Moore, Cristopher Embedding graphs in a geographical or latent space, i.e. inferring locations for vertices in Euclidean space or on a smooth manifold or submanifold, is a common task in network analysis, statistical inference, and graph visualization. We consider the classic model of random geometric graphs where n points are scattered uniformly in a square of area n, and two points have an edge between them if and only if their Euclidean distance is less than r. The reconstruction problem then consists of T A inferring the vertex positions, up to the symmetries of the square, given only the adjacency matrix of the resulting graph. We give an algorithm that, if r = nα for α > 0, with high probability reconstructs the vertex positions with a maximum error of O(nβ) where β = 1/2 − (4/3)α, until α ≥3/8 where β =0 and the error becomes O(√ log n). This improves over earlier results, which E were unable to reconstruct with error less than r. Our method estimates Euclidean distances using a hybrid of graph distances and short-range estimates based on the number of common neighbors. We extend our results to the surface of the sphere in R3 and to hypercubes in any constant dimension. 2023-03-23T12:56:36Z Dani, Varsha Díaz Cort, Josep Hayes, Thomas P. Moore, Cristopher Embedding graphs in a geographical or latent space, i.e. inferring locations for vertices in Euclidean space or on a smooth manifold or submanifold, is a common task in network analysis, statistical inference, and graph visualization. We consider the classic model of random geometric graphs where n points are scattered uniformly in a square of area n, and two points have an edge between them if and only if their Euclidean distance is less than r. The reconstruction problem then consists of T A inferring the vertex positions, up to the symmetries of the square, given only the adjacency matrix of the resulting graph. We give an algorithm that, if r = nα for α > 0, with high probability reconstructs the vertex positions with a maximum error of O(nβ) where β = 1/2 − (4/3)α, until α ≥3/8 where β =0 and the error becomes O(√ log n). This improves over earlier results, which E were unable to reconstruct with error less than r. Our method estimates Euclidean distances using a hybrid of graph distances and short-range estimates based on the number of common neighbors. We extend our results to the surface of the sphere in R3 and to hypercubes in any constant dimension.