loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Jader Garbelini 1 ; Danilo Sanches 2 ; André Kashiwabara 2 and Aurora Pozo 1

Affiliations: 1 Federal University of Paraná, Curitiba, Brazil ; 2 Federal University of Technology, Cornélio Procópio, Brazil

Keyword(s): Kmers, Motifs, Sequence Analysis, Optimization.

Abstract: Motivation: Finding conserved motifs in DNA sequences is a key problem in bioinformatics. The growing availability of large-scale genomic data poses significant challenges for computational biology, particularly in terms of efficiency in analysis, kmer identification, and noise presence. The detection of conserved motifs and patterns in DNA sequences is determinant for understanding gene functions and regulations. Therefore, it is essential to develop a novel approaches and methods that can handle these large volumes of information and provide accurate and fast results. Results: We present SMT, an innovative tool designed to efficiently store and count kmers, optimizing memory usage and computation time. The application of SMT has also proven effective in discovering motifs in CHIP-SEQ data, allowing the identification of conserved regions in sequences. Furthermore, SMT allows exact searches in constant time proportional to the size of k and retrieves the most abundant kmers through a frequency table. This approach facilitates large-scale data analysis and provides important insights into the conserved properties of biological sequences. The application of SMT in motif discovery demonstrates its potential to drive research in bioinformatics and genomics. Availability and implementation: Supplementary data and results are available to provide additional information and support the conclusions. SMT and source code can be found at the following address: https://github.com/jadermcg/smt. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.22.240.205

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Garbelini, J.; Sanches, D.; Kashiwabara, A. and Pozo, A. (2024). SMT: A High-Performance Approach for Counting Kmers. In Proceedings of the 17th International Joint Conference on Biomedical Engineering Systems and Technologies - BIOINFORMATICS; ISBN 978-989-758-688-0; ISSN 2184-4305, SciTePress, pages 545-552. DOI: 10.5220/0012546500003657

@conference{bioinformatics24,
author={Jader Garbelini. and Danilo Sanches. and André Kashiwabara. and Aurora Pozo.},
title={SMT: A High-Performance Approach for Counting Kmers},
booktitle={Proceedings of the 17th International Joint Conference on Biomedical Engineering Systems and Technologies - BIOINFORMATICS},
year={2024},
pages={545-552},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012546500003657},
isbn={978-989-758-688-0},
issn={2184-4305},
}

TY - CONF

JO - Proceedings of the 17th International Joint Conference on Biomedical Engineering Systems and Technologies - BIOINFORMATICS
TI - SMT: A High-Performance Approach for Counting Kmers
SN - 978-989-758-688-0
IS - 2184-4305
AU - Garbelini, J.
AU - Sanches, D.
AU - Kashiwabara, A.
AU - Pozo, A.
PY - 2024
SP - 545
EP - 552
DO - 10.5220/0012546500003657
PB - SciTePress