M.Sc. Tezi Görüntüleme

Student: Shafiei Abdi Suleiman
Supervisor: Asst. Prof. Dr. İbrahim SAVRAN
Department: Bilgisayar Mühendisliği
Institution: Graduate School of Natural and Applied Sciences
University: Karadeniz Technical University Turkey
Title of the Thesis: ACCELERATING GENE IDENTIFICATION IN DNA SEQUENCES WITH CUDA AND OPENCL
Level: M.Sc.
Acceptance Date: 8/8/2019
Number of Pages: 37
Registration Number: i3664
Summary:

      Metagenome is the genomic data obtained from the environment. It may contain hundreds or even millions of different species. The current generation sequencing devices can produce billions of sequences at once. Applications such as MGC and Orphelia were initially used for the detection of the genes within those sequences that have been produced by the genetic sequencing devices are now insufficient. Such applications that are not equipped to process billions of sequences could take several days for the gene identification once the sequence file is divided into small divisions up to 20K sequence. In this work, the use of Metagenome Gene Caller (MGC) application for the development of metagenome gene detection has been used. The core functions have been prepared for the OpenCL and CUDA platforms through the appropriate utilization of the data transformations. Furthermore, these core functions were run through the GPU and the results are debated accordingly. The entire gene detection method has been reduced from several days to a few minutes through data structure simplification.

       Key Words: Computational Genomics; Next generation Technology; Bioinformatics; Metagenome Gene Caller-MGC; High Performance Computing; MetaGene; Orphelia; FragGeneScan; CUDA; OpenCL, Graphic Processor Unit.