This is the pipeline process for 454 sequencing. I am not a biotech guy but after spending time, giving effort and getting help from my collague, I came up with a way to do that. The tools I used are NCBI SRA Toolkit and 454 sequencing tool for multiplexing.
process
Downloaded SRA files from …
Converted these SRA files into SFF format using sff-dump tool
- Rebuilt the scores of converted sff dataset with sfffile tool
- Split the file according to MID groupname
- Calculated the total MID matches for each group
- Extract sequence:
1
* Count total sequence no:
- Combine sff files into one main file
Other useful Commands
- Get the quality scores from the sff file:
- Retrieve the flow intensities:
- View file: