Phusion2 is a pipeline for de novo genome assembly using NGS data. It is based upon a strategy called read clustering. Starting with kmer frequency analysis, this allows for a reasonable selection of the kmer sizes. K-tuples from raw reads are merged and sorted into a table so that multiple occurring kmer words shared by different reads can be linked.  A relation matrix is used to record the shared kmer words among all the reads. Setting a minimum threshold of shared k-tuples, the whole set of reads can then be clustered into groups using kmer sharing information in the relational matrix. After obtaining small read clusters with a controllable size, a local assembler can be used to produce contigs.

Project Activity

See All Activity >

Follow Phusion2

Phusion2 Web Site

Other Useful Business Software
One platform for marketing tools and customer data. Icon
One platform for marketing tools and customer data.

Loved by Marketing teams, trusted by Data teams

Discover the all-in-one data platform which makes it easy to collect, enrich, segment, and activate your customer data in all your business platform.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Phusion2!

Additional Project Details

Registered

2014-08-21