High-performance computing applied to next generation sequencing technologies.
Next generation sequencing (NGS) has changed the bottleneck in sequence knowledge acquisition from sequencing to data interpretation, data integration and the experimental design. Few years ago the sequencing process was a tedious work where researchers had considerably high difficulties to manage all data generated during genomes sequencing and microarray analysis.
Currently the huge technological growth in the whole genomes sequencing raised the necessity of specialized software to store, handle and extract information from this data amount at the same time that this is produced in order to avoid the data accumulation. This action proposes the knowledge and expertise exchange on two of the nowadays most important Next Generation Sequencing platforms.
The coordination of experimentalist and data generators on one side and analyst specialized on producing the necessary technology to process the data processing on the other side creates the ideal study context to profile the application domain of the NGS and high performance computing. The computational requirements analysis of the large-scale data volume generation and treatment like storing, data access, integration with other data sources, problems with memory and CPU time consumption during their processing among others are some of the questions included in the frame of this bilateral collaboration action This action will coordinate different research groups each aiming at data treatment at different level.
The experimental side will provide information about data production (e.g. storage formats and production levels); the final user groups will define the analysis and exploitation framework in the application domain. Our technological group will propose master lines to address the algorithmic and parallel solution to cover such demands.