How can I get a previous value with some condition in a dataframe in pandas

Question

Hello I´m trying to get the previous value in an specific column if this valuecontains "-":

this is my code:

count=-1
for i, row in df1.iterrows():
    count=count + 1
    if row["SUBCAPITULO"]== " "and count>0 and "-" in df1.loc[count-1:"SUBCAPITULO"]:
        row["SUBCAPITULO"]= df1.loc[count-1:"SUBCAPITULO"]

Can you update your post with a sample of `df1` please? The output of `df1.head()` should be sufficient. — Corralien, Aug 04 '21 at 12:48
@woblob. It's not possible immediately here because there is a second condition. Probably use shift is more appropriate. — Corralien, Aug 04 '21 at 12:52
Take a while to read [How to make good reproducible pandas examples](https://stackoverflow.com/questions/20109391/how-to-make-good-reproducible-pandas-examples) — Corralien, Aug 04 '21 at 13:03

score 0 · Answer 1 · answered Aug 04 '21 at 13:01

0

Use shift:

Sample:

>>> df
  SUBCAPITULO
0       dash-
1
2      comma,
3
4        dot.

df.loc[(df['SUBCAPITULO'] == ' ') &
       (df['SUBCAPITULO'].shift().str.contains('-'))] = df['SUBCAPITULO'].shift()

>>> df
  SUBCAPITULO
0       dash-
1       dash-
2      comma,
3
4        dot.

answered Aug 04 '21 at 13:01

Corralien

109,409
8
28
52

1

I think OP wants to fill the `dash-` value over multiple lines. – not_speshal Aug 04 '21 at 13:04
@Corralien hi, i tried your code but is not working : ValueError: Must have equal len keys and value when setting with an iterable – Diego González Castellanos Aug 04 '21 at 13:12
Explain why, please? – Corralien Aug 04 '21 at 13:13
@Corralien look I want to replace empty spaces if the previous row has "-" – Diego González Castellanos Aug 04 '21 at 13:14

score 0 · Answer 2 · answered Aug 04 '21 at 14:17

Desired output has not been posted and the request is unclear.

Taking these comments into account:

trying to get the previous row value in an specific column if this value contains a dash
I think OP wants to fill the dash- value over multiple lines

Maybe this helps...

import pandas as pd

df = pd.read_csv('test.csv')

print(df, '\n\n')

'''
Shows:

  Other_data       SUBCAPITULO
0       qwer               NaN
1       vfds               NaN
2       sdfg  1.01 – TORRE – 1
3       hfgt               NaN
4       jkiu          capitulo
5       bvcd  2.01 – TORRE – 1
6       grnc               NaN
7       sdfg          capitulo
8       poij               NaN
9       fghg  2.01 – TORRE – 1 

'''

for i in reversed(df.index):
    if i >= 1:
        if '–' in str(df.loc[i, 'SUBCAPITULO']):
            if str(df.loc[i-1, 'SUBCAPITULO']) == 'nan': 
                df.loc[i-1, 'SUBCAPITULO'] = df.loc[i, 'SUBCAPITULO']

print(df)


'''
Shows:

  Other_data       SUBCAPITULO
0       qwer  1.01 – TORRE – 1
1       vfds  1.01 – TORRE – 1
2       sdfg  1.01 – TORRE – 1
3       hfgt               NaN
4       jkiu          capitulo
5       bvcd  2.01 – TORRE – 1
6       grnc               NaN
7       sdfg          capitulo
8       poij  2.01 – TORRE – 1
9       fghg  2.01 – TORRE – 1

'''

print('\n')

score 0 · Answer 3 · answered Aug 04 '21 at 17:05

df1.reset_index(drop=True, inplace=True)
for i in range(1, len(df1)):
  if df1.loc[i, 'SUBCAPITULO'] == " " and "-" in df1.loc[i-1, 'SUBCAPITULO']:
    df1.loc[i, 'SUBCAPITULO']=df1.loc[i-1, 'SUBCAPITULO']

df1.dropna(inplace=True)

The problem was that I had TO RESET INDEX before the loop.

enter image description here

How can I get a previous value with some condition in a dataframe in pandas

3 Answers3