-
Notifications
You must be signed in to change notification settings - Fork 210
feat: added parser for bionano custom file formats #603
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
src/io/cmap.rs
Outdated
pub struct Record { | ||
id: u32, | ||
len: f64, | ||
num_labels: u32, | ||
label_channel: u8, | ||
labels: Vec<Label>, | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why does a cmap record not have a contig name?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are no contig names in CMAPs; they are identified solely by their integer ID.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I see. Where are the contig names stored then?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nowhere. Bionano formats do not have any contig names and do not seem to support them at all.
If you convert a FASTA (with existing contig names) to a CMAP, you could store this information separately and integrate it later on if desired. Note that this will only work if you do not use a reference already in CMAP format, unlike the ones Bionano or FaNDOM provide, so this is not universally applicable.
For OM2DNA, I have always used CMAPs converted from FASTAs though.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Then the .fai of the fasta is the right source.
No description provided.