Miniproject to be done: a script to show a biodiversity of metagenomic sample on a taxonomic tree.
There are quite a few options to visualize phylogenetic trees in a programmatic way. See summary by Kat Holt. Out of three (four) ways mentioned E.T.E Python library looks as a best starting point. E.T.E original paper has been cited 31 times, many times by other software packages.
Webpage seems to be abandoned, but Google Group is active and it seems the package in under continous development (see this thread, as a example ).
While classic biodiversity analysis in metagenomic studies doesn’t involve making real phylogenetic trees (well, it often does, but given small length of sequences in the alignment, resulting tree is not that accurate in more general terms), I have some ongoing projects which would require large scale visualization and analysis of trees. Coupling E.T.E-based script with Bio.Phylo might be interesting path to explore.
BTW, to parse results from mothur I would need to rewrite Pierre’s script to Python.