Hello,
I am trying to filter a mmseqs database generated from mmseqs search using mmseqs filterdb command based on percent identity, e-value, aligned length and query coverage prior to piping this into another mmseqs analysis. I have used the following to filter for hit-read pairs with at least 80% identity:
mmseqs filterdb ./results_db ./filtered_step1 --filter-expression '$3 >= 0.80' --threads 24
However, I cannot "find" the columns for e-value, aligned length, and query coverage. I have tried using --filter-file to only select specific hit-read pairs based on manual fitler but cannot see to format this correctly.
Any insight? Is there a better way to do this or is this outside of the scope of mmseqs2's capabilities?
Thanks
Hello,
I am trying to filter a mmseqs database generated from
mmseqs searchusingmmseqs filterdbcommand based on percent identity, e-value, aligned length and query coverage prior to piping this into another mmseqs analysis. I have used the following to filter for hit-read pairs with at least 80% identity:mmseqs filterdb ./results_db ./filtered_step1 --filter-expression '$3 >= 0.80' --threads 24However, I cannot "find" the columns for e-value, aligned length, and query coverage. I have tried using
--filter-fileto only select specific hit-read pairs based on manual fitler but cannot see to format this correctly.Any insight? Is there a better way to do this or is this outside of the scope of mmseqs2's capabilities?
Thanks