3.1 years ago by
United States
Hello,
Using a combination of the tools in Filter and Sort (Filter), Text manipulation (Compute), and Join, Subtract, Group (Group) might be able to do this work, but I am fairly certain that it will not retain all the other information contained in the output (which seems important). Joining back into the original file (by they key used to Group on) would recover multiple data rows - this is undesirable and nullifies the filtering steps. That said, some manipulations could be made to combine the final output into a final "key", and the same done to the original input, and then a Join performed. This will be several steps, but could be placed into a Workflow for re-use to behave "like a tool".
Because this appears to be a unique function and one that others would likely also use, I have opened a tool enhancement request. The dev-team may pick this up, or another group, or this is something that anyone with coding resource and time could contribute to Galaxy. How I outlined the requirements is flexible and built upon existing functions - other methods of implementing are certainly possible and should be considered. Please feel free to add in additional comments/requirements/ideas: https://github.com/galaxyproject/tools-devteam/issues/260
Filemaker was a very useful tool for me as well many years ago, in particular when working with non-tech scientific users on analysis projects. Slow, but did the job. For the really large datasets most are working with now, having the best of those FM functions contained all in one place (Galaxy!) is better for the most obvious of reasons: no data transfers out then back in. FM was (is) based loosely on line-command utilities and SQL concepts (with a GUI front-end) and the "best of" should be relatively easy to translate into Galaxy tools using python or similar.
Sorry we cannot help more, but thank you for the tool idea with a clear example usage! Jen, Galaxy team