Hi guys,
I am trying to make some nice images out of my ChIP-seq data. I want to compare regions bound by my TF with known histone marks occupancy sites. To that I wanted to use deeptools tool PlotCorrelation. I have a following error coming up. Any ideas about how to deal with it?
Thank you a lt in advance!
Fatal error: Exit code 1 () Warning. 24397 NaN values were found. They will be removed along with the corresponding bins in other samples for the computation and plotting Traceback (most recent call last): File "/galaxy/main/deps/_conda/envs/mulled-v1-0fa290085c742a3ffee6a142d1fce47c178e1c289ff6f0c66023b506f3377842/bin/plotCorrelation", line 11, in <module> main(args) File "/galaxy/main/deps/_conda/envs/mulled-v1-0fa290085c742a3ffee6a142d1fce47c178e1c289ff6f0c66023b506f3377842/lib/python2.7/site-packages/deeptools/plotCorrelation.py", line 209, in main plot_numbers=args.plotNumbers) File "/galaxy/main/deps/_conda/envs/mulled-v1-0fa290085c742a3ffee6a142d1fce47c178e1c289ff6f0c66023b506f3377842/lib/python2.7/site-packages/deeptools/correlation.py", line 256, in plot_correlation y_var = sch.linkage(corr_matrix, method='complete') File "/galaxy/main/deps/_conda/envs/mulled-v1-0fa290085c742a3ffee6a142d1fce47c178e1c289ff6f0c66023b506f3377842/lib/python2.7/site-packages/scipy/cluster/hierarchy.py", line 676, in linkage raise ValueError("The condensed distance matrix must contain only finite values.") ValueError: The condensed distance matrix must contain only finite values.
Can you share a history containing this and the input matrix with me (my account on usegalaxy.org is ryan@ie-freiburg.mpg.de)? I can then more easily have a look at this.
BTW, I suspect that the input matrix had 24397 (or so) entries and that each of these contained an NaN in it. That's the most likely cause, but then that begs the question regarding how THAT happened.
Davon, thank you so so much! That is amazing. I have just shared the history with you. I was thinking that maybe this has happened because I don't have enough overlapp? I was also trying to perform plotHeatmapand even though the tools worked, the image I obtained doesn't really look ok. I have my ChIP-seq in two different tissue types. To increase the dataset to plot I wanted to use the regions boun in both tissue, but I get an info that some regionsoverlapp with each other and Ineed to remove overlapping regions. Could you maybe also sugest me how can I do that?
Thank you once again for trying to help me. Dorota
oh, and the history I sent from razu@o2.pl account