Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Include ambiguous bases but not Ns #43

Open
GonzaloYebra opened this issue Jun 8, 2021 · 1 comment
Open

Include ambiguous bases but not Ns #43

GonzaloYebra opened this issue Jun 8, 2021 · 1 comment

Comments

@GonzaloYebra
Copy link

Hi! I know there's been a few similar issues raised here but they didn't match quite exactly what I'm looking for...

My question is, would there be any way to run snp-dists including ambiguous bases in the calculation while disregarding Ns? Basically a hybrid between the default and the -a options.

In my case, I wouldn't mind what specific ambiguous base is found, I'd like to count them all as different to ATCG.

Any ideas?

Thanks a lot!

Gonzalo

@tseemann
Copy link
Owner

tseemann commented Jul 11, 2021

@GonzaloYebra
Normally a match is +1 and a mismatch is 0.
How do you want the ambiguous IUPAC codes measured?

A vs R = 0
C vs R = 0
A vs N = 0
R vs R = 1 <---- ?
N vs R = 0

Is that correct?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants