Dear Dr. Hiller,
Thank you so much for providing all the resources. I am sorry I still get some issues when applying the provided script.
I am currently preparing the percent identity files for the ForwardGenomics input. I run the command as:
GetGlobalAndLocalPercentID.perl alignment_for_an_element.BDB -treeFile full_species_tree -allowedAncestralNodes my_nodes -requireNoOutgroup -local -global
I have successfully got the global %ID result file, but failed in getting the local %ID result. The error message is like:
ERROR: do not find node XXX in the alignment.
after checking the input and output, I understand that the error was caused because some species are missing in the alignment, which is very normal in my dataset. Those missing species are assigned "NA" values in the global %ID result, but cause errors in local %ID calculation. I wonder if I should simply trim the input tree, so only species exist in the alignment are included (and probably solve the error)? But in this case each element will probably have different number of nodes in the local %ID result file. I don't know if this will cause problems in the formal ForwardGenomics analysis.
Alternatively, maybe I can go back to previous prank alignment step, and try to fill in all missing species in the alignment by inference (not sure if this makes sense and how)?
Thank you for any input.
Best wishes,
Hongxin
Dear Dr. Hiller,
Thank you so much for providing all the resources. I am sorry I still get some issues when applying the provided script.
I am currently preparing the percent identity files for the ForwardGenomics input. I run the command as:
GetGlobalAndLocalPercentID.perl alignment_for_an_element.BDB -treeFile full_species_tree -allowedAncestralNodes my_nodes -requireNoOutgroup -local -global
I have successfully got the global %ID result file, but failed in getting the local %ID result. The error message is like:
ERROR: do not find node XXX in the alignment.
after checking the input and output, I understand that the error was caused because some species are missing in the alignment, which is very normal in my dataset. Those missing species are assigned "NA" values in the global %ID result, but cause errors in local %ID calculation. I wonder if I should simply trim the input tree, so only species exist in the alignment are included (and probably solve the error)? But in this case each element will probably have different number of nodes in the local %ID result file. I don't know if this will cause problems in the formal ForwardGenomics analysis.
Alternatively, maybe I can go back to previous prank alignment step, and try to fill in all missing species in the alignment by inference (not sure if this makes sense and how)?
Thank you for any input.
Best wishes,
Hongxin