Navigation
What is MapSplice?
How does MapSplice work?
What's new in MapSplice 2?
System requirements
Obtaining & Building MapSplice 2
Command Line
Sample Dataset
Output of MapSplice 2
What is MapSplice?
MapSplice is a software for mapping RNA-seq read to reference genome for splice junction discovery.
- It depends only on reference genome, and not on any further annotations
- It supports both paired-end reads and single-end reads, and utilizes the advantage of pair-end read for better mapping accuracy
It supports variable length reads
It aligns unspliced and spliced alignments simultaneous - It detects:
- novel canonical, semi-canonical and non-canonical splice junctions
- novel insertions and deletions
- novel gene fusion events
How does MapSplice work?
MapSplice first splits reads into segments, and maps them to reference genome by using Bowtie. Then for unmapped segements, MapSplice tries to fix it as gapped alignments, with each gap corresponding to a splice junction. And later a remapping step is used to identify spliced alignments that are in the presence of small exons. MapSplice leverages the quality and diversity of read alignments of a given splice to increase accuracy.
What's new in MapSplice 2?
- MapSplice 2 improved mapping sensitiviy.
- MapSplice 2 now supports multi-thread, dramatically improves the running time on multi-core system.
- MapSplice 2 now supports variable length reads.
- MapSplice 2 is optimized for repeats.
- All the command line parameters have been re-designed for easier use.
System requirements
- OS: Linux x86 64bit system
- Memory: 6GB
- Compiler: g++ 4.3.3 or higher
- Script: Python 2.4.3 or higher
Obtaining & Building MapSplice 2
- You can download the lastest version of MapSplice here. For better compatibility and user's convenience, Bowtie 0.12.7 and SAMtools 0.1.9 are included in the package.
- To build MapSplice, extract compressed file, go to the MapSplice directory, and run "make".
Command Line
Usage
python mapsplice.py [options]* -c <Reference_Sequence> -x <Bowtie_Index> -1 <Read_List1> -2 <Read_List2>
Main Arguments
| -c <string> |
The directory containing the sequence files of reference genome. All sequence files are required to:
|
| -x <string> |
The basename (including directory path) of Bowtie index to be searched. The basename is the name of any of the index files up to but not including the final .1.ebwt / .rev.1.ebwt / etc.
|
| -1 <string> | Comma-separated (no blank space) list of read sequence files in FASTA/FASTQ format. When running with pair-end read, this should contain #1 mates (filename usually includes _1). |
| -2 <string> | Comma-separated (no blank space) list of read sequence files in FASTA/FASTQ format. -2 is only used when running with pair-end read. This should contain #2 mates (filename usually includes _2). Files must be in the same order with those specified in -1. |
Optional arguments
- Input/Output and Performance options
- Alignment options
- Other Options
| -h/--help | Print the usage message |
| -v/--version | Print the version of MapSplice |
Sample Dataset
Coming Soon...
Output of MapSplice 2
- Alignment
By default, read alignments are reported in SAM format to alignments.sam. If --bam is specified, read alignments are reported in BAM format to alignments.bam. Please see the SAM / BAM format specification.
- Normal Splice Junction
Splice junctions are reported to "junctions.txt". Please see the detailed description of all the columns here.
- Insertion
Inserstions are reported to insertions.txt. Please see the detailed description of all the columns here.
- Deletion
Deletions are reported to deletions.txt. Please see the detailed description of all the columns here.
- Fusion alignment
If --fusion | --fusion-non-canonical is specified, fusion alignment are reported to fusion_alignment.txt. Please see the detailed description of all the columns here.
- Fusion Splice junction
If --fusion | --fusion-non-canonical is specified, fusion splice junctions are reported to fusion_junction.txt. Please see the detailed description of all the columns here.
