PD-APCn: Pattern-directed aligned pattern clustering of bio-sequences

Summary of the technology

Background

Identifying functional segments or regions from bio-sequences is a major challenge in bioinformatics. Functional segments of a bio-sequence could reveal folded structure, physio-chemical functionality and mutation hotspots for better understanding of biological mechanisms; directing the design of new drugs and discovering new knowledge about the cure of genetic diseases. With explosive data streaming in, effective, accurate and scalable methods are still lacking. Existing methods such as: MEME, GLAM2 are incompetent to capture frameshift and rare mutations.

University of Waterloo
University of Waterloo

Description of the invention

University of Waterloo (UW) researchers have developed a novel software that uses a systematic process to align pattern clusters of bio-sequence families and thereby, to identify functional regions. The software also adaptively determines the width and mutation spots without relying on exhaustive search and without relying on explicit prior knowledge or clue. While the software discovers new patterns with strong statistical support, it also spots mutational rare patterns with minor substitution and frameshift (insertion and deletion). This is of ample importance for personalized medicine, gene therapy/marker and drug research.

Advantages

  • Allows variable pattern length
  • Capable of identifying mutations and rare mutations (Fig. 2)
  • Fast (400X compared to MEME method), accurate, and precise (location-wise)
  • Does not need parameter pruning (compared to MEME, GLAM2 method)
  • No explicit prior knowledge needed
  • Compatible with hardware acceleration/multitasking

From the APCs discovered, the software can disentangle patterns within APCs to further reveal deeper knowledge on subgroup characteristics in different specific statistical/functional spaces with/without class labels given.

Potential applications

PD-APCn software can be used in drug discovery, personalized medicine and gene therapy/marker for:

  • Bio-sequence functional region identification
  • Residue-Residue and Protein-Protein interaction prediction
  • Protein-DNA binding cores discovery
  • Drug-able site discovery

Related Keywords

  • Artificial Intelligence (AI)
  • Software Technologies

About University of Waterloo

The University of Waterloo, renowned for its innovative spirit and co-op education model, excels in cutting-edge research across diverse fields, including advanced manufacturing, artificial intelligence, sustainability, and health technologies.

University of Waterloo

Never miss an update from University of Waterloo

Create your free account to connect with University of Waterloo and thousands of other innovative organizations and professionals worldwide

University of Waterloo

Send a request for information
to University of Waterloo

About Technology Offers

Technology Offers on Innoget are directly posted
and managed by its members as well as evaluation of requests for information. Innoget is the trusted open innovation and science network aimed at directly connect industry needs with professionals online.

Help

Need help requesting additional information or have questions regarding this Technology Offer?
Contact Innoget support