Do you have to check exit_status_ready if you are going to check recv_ready()?

Question

I am running a remote command with:

ssh = paramiko.SSHClient()
ssh.connect(host)
stdin, stdout, stderr = ssh.exec_command(cmd)

Now I want to get the output. I have seen things like this:

# Wait for the command to finish
while not stdout.channel.exit_status_ready():
    if stdout.channel.recv_ready():
        stdoutLines = stdout.readlines()

But that seems to sometimes never run the readlines() (even when there is supposed to be data on stdout). What that seems to mean to me is that stdout.channel.recv_ready() is not necessarily ready (True) as soon as stdout.channel.exit_status_ready() is True.

Is something like this appropriate?

# Wait until the data is available
while not stdout.channel.recv_ready():
    pass

stdoutLines = stdout.readlines()

That is, do I really first have to check the exit status before waiting for recv_ready() to say the data is ready?

How would I know if there is supposed to be data on stdout before waiting in an infinite loop for stdout.channel.recv_ready() to become True (which it does not if there is not supposed to be any stdout output)?

I tried to do the same thing: http://stackoverflow.com/questions/14643861/paramiko-channel-stucks-when-reading-large-ouput. Check out. — vipulb, May 07 '14 at 14:01

tintin · Accepted Answer · 2015-10-25T20:37:16.030

That is, do I really first have to check the exit status before waiting for recv_ready() to say the data is ready?

No. It is perfectly fine to receive data (e.g. stdout/stderr) from the remote process even though it did not yet finish. Also some sshd implementations do not even provide the exit status of the remote proc in which case you'll run into problems, see paramiko doc: exit_status_ready.

The problem with waiting for exit_status_code for short living remote commands is that your local thread may receive the exit_code faster than you check your loop condition. In this case you won't ever enter the loop and readlines() will never be called. Here's an example:

# spawns new thread to communicate with remote
# executes whoami which exits pretty fast
stdin, stdout, stderr = ssh.exec_command("whoami") 
time.sleep(5)  # main thread waits 5 seconds
# command already finished, exit code already received
#  and set by the exec_command thread.
# therefore the loop condition is not met 
#  as exit_status_ready() already returns True 
#  (remember, remote command already exited and was handled by a different thread)
while not stdout.channel.exit_status_ready():
    if stdout.channel.recv_ready():
        stdoutLines = stdout.readlines()

How would I know if there is supposed to be data on stdout before waiting in an infinite loop for stdout.channel.recv_ready() to become True (which it does not if there is not supposed to be any stdout output)?

channel.recv_ready() just indicates that there is unread data in the buffer.

def recv_ready(self):
    """
    Returns true if data is buffered and ready to be read from this
    channel.  A ``False`` result does not mean that the channel has closed;
    it means you may need to wait before more data arrives.

This means that potentially due to networking (delayed packets, retransmissions, ...) or just your remote process not writing to stdout/stderr on a regular basis may result in recv_ready being False. Therefore, having recv_ready() as the loop condition may result in your code returning prematurely as it is perfectly fine for it to sometimes yield True (when the remote process wrote to stdout and your local channel thread received that output) and sometimes yield False (e.g. your remote proc is sleeping and not writing to stdout) within an iteration.

Besides that, people occasionally experience paramiko hangs that might be related to having stdout/stderr buffers filling up (pot. related to problems with Popen and hanging procs when you never read from stdout/stderr and the internal buffers fill up).

The code below implements a chunked solution to read from stdout/stderr emptying the buffers while the channel is open.

def myexec(ssh, cmd, timeout, want_exitcode=False):
  # one channel per command
  stdin, stdout, stderr = ssh.exec_command(cmd) 
  # get the shared channel for stdout/stderr/stdin
  channel = stdout.channel

  # we do not need stdin.
  stdin.close()                 
  # indicate that we're not going to write to that channel anymore
  channel.shutdown_write()      

  # read stdout/stderr in order to prevent read block hangs
  stdout_chunks = []
  stdout_chunks.append(stdout.channel.recv(len(stdout.channel.in_buffer)))
  # chunked read to prevent stalls
  while not channel.closed or channel.recv_ready() or channel.recv_stderr_ready(): 
      # stop if channel was closed prematurely, and there is no data in the buffers.
      got_chunk = False
      readq, _, _ = select.select([stdout.channel], [], [], timeout)
      for c in readq:
          if c.recv_ready(): 
              stdout_chunks.append(stdout.channel.recv(len(c.in_buffer)))
              got_chunk = True
          if c.recv_stderr_ready(): 
              # make sure to read stderr to prevent stall    
              stderr.channel.recv_stderr(len(c.in_stderr_buffer))  
              got_chunk = True  
      '''
      1) make sure that there are at least 2 cycles with no data in the input buffers in order to not exit too early (i.e. cat on a >200k file).
      2) if no data arrived in the last loop, check if we already received the exit code
      3) check if input buffers are empty
      4) exit the loop
      '''
      if not got_chunk \
          and stdout.channel.exit_status_ready() \
          and not stderr.channel.recv_stderr_ready() \
          and not stdout.channel.recv_ready(): 
          # indicate that we're not going to read from this channel anymore
          stdout.channel.shutdown_read()  
          # close the channel
          stdout.channel.close()
          break    # exit as remote side is finished and our bufferes are empty

  # close all the pseudofiles
  stdout.close()
  stderr.close()

  if want_exitcode:
      # exit code is always ready at this point
      return (''.join(stdout_chunks), stdout.channel.recv_exit_status())
  return ''.join(stdout_chunks)

The channel.closed is just the ultimate exit condition in case the channel prematurely closes. Right after a chunk was read the code checks if the exit_status was already received and no new data was buffered in the meantime. If new data arrived or no exit_status was received the code will keep on trying to read chunks. once the remote proc exited and there is no new data in the buffers we're assuming that we've read everything and begin closing the channel. Note that in case you wan to receive the exit status you should always wait until it was received otherwise paramiko might block forever.

This way it is guaranteed that the buffers do not fill up and make your proc hang. exec_command only returns if the remote command exited and there is no data left in our local buffers. The code is also a bit more cpu friendly by utilizing select() instead of polling in a busy loop but might be a bit slower for short living commands.

Just for reference, to safeguard against some infinite loops one can set a channel timeout that fires when no data arrives for a period of time

 chan.settimeout(timeout)
 chan.exec_command(command)

Thanks for detailed explanation. It seems I have the same problem. For some reason I get a situation (on the 1st iteration) when exit code is ready, but stdout/stderr are not, so it does not even enter the cycle. Which is very strange -- how is it possible at all? Can you please explain the code a bit more? Why do you duplicate the checks in while & if? While-check is not enough? Also, why do you read before while-cycle? It seems that the same will be done automatically, because recv_ready() will be true on the first iteration, won't it? Also, channel.closed is undocumented, right? — Sergey Vasilyev, Nov 13 '15 at 11:28
Great answer. Helped me a lot, thanks! Where did you find that Paramiko's Channel object has `in_buffer` data member? I couldn't find it anywhere in the doc. — so.very.tired, Sep 09 '16 at 06:07
Afaik it is not documented and probably not intended to be used directly. `Channel.__repr__` also uses it to get the buffers current size (https://github.com/paramiko/paramiko/blob/master/paramiko/channel.py#L144). That said, we've had major issues with stalling ssh sessions using the built-in `exec_command` in our test automation systems (great amount of parallel ssh sessions) and got all of them resolved with this trick. — tintin, Sep 14 '16 at 08:52
@tintin in your example, if user executes command that contains something like `'sudo -S -p "" ls -l`, then it will be stuck waiting for chunks in `for c in readq` loop due to `-p ""` is present in command being executed. Any idea why it happens, and how to handle it ? — Alex D, Jan 31 '19 at 00:17
Why are you using `channel` and `stdout.channel` randomly? Is there any reason for that or is that just leftovers in the code? — alexandernst, Jul 02 '19 at 16:19
@tintin just a note that error output *is* returned in the `stderr` channel and therefore shouldn't be discarded. — Philip Colmer, Jan 28 '21 at 09:02

score 0 · Answer 2 · edited Dec 15 '20 at 11:42

0

Add the below lines after the ssh.exec_command(cmd). The loop will continue as long as the shell script is running and exits immediately after it's completed.

while int(stdout.channel.recv_exit_status()) != 0:
    time.sleep(1)

edited Dec 15 '20 at 11:42

Adrian Mole

49,934
160
51
83

answered Apr 24 '20 at 10:18

Sarath Baby

51
1
4

Do you have to check exit_status_ready if you are going to check recv_ready()?

2 Answers2

Linked