Sort on 2 columns which are inter

Question

I have a dataframe :

I expect following dataframe :

here sort order is noting but end of nth row must be start+1 of n+1th row.If not found, search for other starts where start is one.

can anyone suggest what combination of sort and group by can I use to convert above dataframe in required format?

zabop · Answer 1 · 2019-02-25T15:07:14.593

0

You could transform the df to a list and then do:

l=[1,10,26,50,6,15,1,5,11,25]
result=[]
for x in range(int(len(l)/2)):
    result.append(sorted([l[2*x],l[2*x+1]])[1])
    result.append(sorted([l[2*x],l[2*x+1]])[0])

This will give you result:

[1, 10, 26, 50, 6, 15, 1, 5, 11, 25]

To transform the original df to list you can do:

startcollist=df['start'].values.tolist()

endcollist=df['end'].values.tolist()

l=[]
for index, each in enumerate(originaldf):
    l.append(each)
    l.append(endcollist[index])

You can then transform result back to a dataframe:

df=pd.DataFrame({'start':result[1::2], 'end':result[0::2]})

Giving the result:

The expression result[1::2] gives every odd element of result, result[0::2] gives every even element. For explanation, see here: https://stackoverflow.com/a/12433705/8565438

edited Feb 25 '19 at 15:07

answered Feb 25 '19 at 14:38

zabop

6,750
3
39
84

Thanks for quick reply! but will this ensure that start,end pairs in original dataframe remain same? – Nikhil Gaikwad Feb 25 '19 at 14:43
Edited the answer, is that satisfying now? – zabop Feb 25 '19 at 14:53
the sort order of the output is not correct.Also I tried your solution for array [26, 50, 1, 10, 6, 15, 1, 5, 11, 25],but didn't work – Nikhil Gaikwad Feb 25 '19 at 15:05
Changed order of lines in the first for loop of my answer, might want to try again. – zabop Feb 25 '19 at 15:08

Sort on 2 columns which are inter

1 Answers1