Output not as expected mpi_gather

Question

I am not getting output as expected after gathering a matrix (4x4 global matrix). I am generating a global matrix:

global =  [0           1           2           3
           1           2           3           4
           2           3           4           5
           3           4           5           6 ]

And trying to split it into 2X2 sub matrices and gather in 2D topology created using mpi_cart_create. I am gethering and printing the matrix. I am expecting the same matrix again in global_recv. But after gathering my matrix gathers like this:

global_recv = [ 0           2           2           4
                1           3           3           5
                1           3           3           5
                2           4           4           6]

What changes I have to make and where I am missing? I tried with mpi_gahterv first, but no success. I want to implement the same using mpi_gatherv too. The following is the code:

PROGRAM MAIN
    implicit none
    include "mpif.h"
    integer, parameter:: nx = 4         ! global number of rows 
    integer, parameter:: ny = 4         ! global number of columns                           
    integer, parameter:: Root = 0
    integer global(nx,ny),global_recv(nx,ny)
    integer,allocatable::loc(:,:)       ! local matrix 
    integer rows,cols                   ! rows and columns in local matrix 
    integer,allocatable::counts(:),displs(:)
    integer myid,numprocs
    integer comm2d,ierr,req
    integer sx, ex, sy, ey
    integer dims(2),coord(2)
    logical periods(2)
    integer status(MPI_STATUS_SIZE)
    data periods/2*.false./
    integer i,j
! Initialize mpi
    CALL MPI_INIT( ierr )
    CALL MPI_COMM_RANK(MPI_COMM_WORLD,myid,ierr)
    CALL MPI_COMM_SIZE(MPI_COMM_WORLD,numprocs,ierr)
! Get a new communicator for a decomposition of the domain.  
! Let MPI find a "good" decomposition
    dims(1) = 0
    dims(2) = 0
    CALL MPI_DIMS_CREATE(numprocs,2,dims,ierr)
    CALL MPI_CART_CREATE(MPI_COMM_WORLD,2,dims,periods,.true.,   &
                         comm2d,ierr)
! Get my position in this communicator
    CALL MPI_COMM_RANK(comm2d,myid,ierr)
    if (myid==Root) then
        print *,'dimensions:',dims(1),dims(2)
    endif
! Compute the decomposition
    CALL fnd2ddecomp(comm2d,nx,ny,sx,ex,sy,ey)
    rows = ex-sx+1 
    cols = ey-sy+1  
! Initialize the a matrix
   do  i=1,nx
    do j=1,ny
       global(i,j) = (i-1)+(j-1)
     enddo
   enddo
! print global matrix 
   if (myid.EQ.Root) then
   print *, 'Global matrix :'
      do i=1,nx
         print *, global(i,:)
      enddo
    endif  
! get local matrix 
   allocate(loc(rows,cols))     
   loc = global(sx:ex,sy:ey)
   print *, myid,loc
! Build counts and displs for mpi_gatherv
   allocate(counts(numprocs),displs(numprocs))
   !dx = rows + (cols-1)*size(M,1);
   displs(1) = 0
   do j=1,numprocs
     counts(j) = rows*cols
    if((j-1).ne.0) displs(j) = displs(j-1) + counts(j-1)
  enddo

! Recieve the results using mpi_gather    
!   CALL MPI_GATHERV(loc,cols,MPI_INT,     &   
!                   b,counts,displs,MPI_INT,root,   &
!                   MPI_COMM_WORLD,ierr)         

    CALL MPI_GATHER(loc,rows*cols,MPI_INT, &  
                     global_recv,rows*cols,MPI_INT,   &  
                     Root, comm2d, ierr)
!      print the results
    if (myid.EQ.Root) then
    print *, 'Global recieved matrix:'
      do i=1,nx
         print *, global_recv(i,:)
      enddo
    endif
!      Cleanup goes here.
      CALL MPI_COMM_FREE( comm2d, ierr )
      CALL MPI_FINALIZE(ierr)

      STOP
      END
!******************************************************* 
      subroutine fnd2ddecomp(comm2d,nx,ny,sx,ex,sy,ey)
      integer   comm2d
      integer   nx,ny,sx,ex,sy,ey
      integer   dims(2),coords(2),ierr
      logical   periods(2)
! Get (i,j) position of a processor from Cartesian topology.
      CALL MPI_Cart_get(comm2d,2,dims,periods,coords,ierr)
! Decomposition in first (ie. X) direction
      CALL MPE_DECOMP1D(nx,dims(1),coords(1),sx,ex)
! Decomposition in second (ie. Y) direction
      CALL MPE_DECOMP1D(ny,dims(2),coords(2),sy,ey)
      return
      end
!*********************************************************************
      SUBROUTINE MPE_DECOMP1D(n,numprocs,myid,s,e)
      integer n,numprocs,myid,s,e,nlocal,deficit
      nlocal  = n / numprocs
      s       = myid * nlocal + 1
      deficit = mod(n,numprocs)
      s       = s + min(myid,deficit)
! Give one more slice to processors
      if (myid .lt. deficit) then
          nlocal = nlocal + 1
      endif
      e = s + nlocal - 1
      if (e .gt. n .or. myid .eq. numprocs-1) e = n
      return
      end

Notice than the columns of the second matrix corresponds to the local matrices of each process. Indeed, `MPI_Gather()` gathers the chuncks from the different processes into a large 1D array. It is the same thing for `MPI_Gatherv()` expect that the chuncks can feature different lengths. It might be interesting to use a dedicated datatype, such as those produced by `MPI_Type_create_subarray()`. — francis, Dec 26 '16 at 18:43
@francis I have referred this link [link](http://stackoverflow.com/questions/17508647/sending-2d-arrays-in-fortran-with-mpi-gather). There is no escape from *MPI_Type_create_subarray()*? — Siva Srinivas Kolukula, Dec 27 '16 at 04:24
Code written from the given link works for 2x2 but fails for 3x2. — Siva Srinivas Kolukula, Dec 27 '16 at 04:26
You can do something without `MPI_Type_create_subarray()`, but it is likely that you will end up doing something similar. `MPI_Type_create_subarray()` has been designed to communicate subarrays: it seem more an help than a hurdle to me! If each process handles a chunck of a different size (ex: 3x2 processes for a 4x4 array), it may be a good idea to call `MPI_Type_create_subarray()` for each rank and use `MPI_Send()`/`MPI_Irecv()` for each rank. Then a call to `MPI_Waitall()` on the root process. I'll give it a try if i have time to do so! — francis, Dec 27 '16 at 10:19
I tried earlier with mpi_isend and recv (http://stackoverflow.com/questions/41119403/updating-matrix-in-mpi-fortran). I faced a problem, shifted to gather and scatter. I will give a try and get back to you. — Siva Srinivas Kolukula, Dec 28 '16 at 04:07

Output not as expected mpi_gather

0 Answers0