Pandas: np.where with multiple conditions on dataframes

swyx picture swyx · Dec 31, 2014 · Viewed 22.2k times · Source

hi folks i have look all over SO and google and cant find anything similar...

I have a dataframe x (essentially consisting of one row and 300 columns) and another dataframe y with same size but different data. I would like to modify x such that it is 0 if it has a different sign to y AND x itself is not 0, else leave it as it is. so this requires the use of np.where with multiple conditions. However the multiple condition examples i've seen all use scalars, and when i use the same syntax it does not seem to work (ends up setting -everything- to zero, no error). i'm worried about assign-by-reference issues hidden somewhere or other (y is x after shifting but as far as i can tell there is no upstream issue above this code) any ideas?

the code i am trying to debug is:

tradesmade[i:i+1] = np.where((sign(x) != sign(y)) & (sign(x) != 0), 0, x) 

which just returns a bunch of zeros. I have also tried

tradesmade[i:i+1][(sign(x) != sign(y)) * (sign(x) != 0)] = 0

but this does not seem to work either. I have been at this for hours and am at a total loss. please help!

Answer

gboffi picture gboffi · Dec 31, 2014

It is not clear to me what you exactly want to do when a y element is equal to zero... anyway the key point in this answer is "use np.logical_{and,not,or,xor} functions".

I think that the following, albeit formulated differently from your example, does what you want, but if I'm wrong you should be able to combine different tests to achieve what you want,

x = np.where(np.logical_or(x*y>0, y==0), x, 0)