Questions tagged [numpy-memmap]

An advanced numpy.memmap() utility to avoid RAM-size limit and reduce final RAM-footprint ( at a reasonable cost of O/S-cached fileIO mediated via a small-size in-RAM proxy-view window into whole array-data ) Creates and handles a memory-map to an array stored in a binary file on disk.

Creates and handles a memory-map to an array stored in a binary file on disk.

Memory-mapped files are used for arranging access to large non-in-RAM arrays via small proxy-segments of an O/S-cached area of otherwise unmanageably large data files.

Leaving most of the data on disk, without reading the entire file into RAM memory and working with data via smart, moving, O/S-cached window-view into the non-in-RAM big file, enables to escape from both O/S RAM-limits and from adverse side-effects of python's memory management painfull reluctance to release once allocated memory-blocks anytime before the python program termination.

numpy's memmap's are array-like objects.

This differs from Python's mmap module, which uses file-like objects.

101 questions

votes

2 answers

Can memmap pandas series. What about a dataframe?

It seems that I can memmap the underlying data for a python series by creating a mmap'd ndarray and using it to initialize the Series. def assert_readonly(iloc): try: iloc[0] = 999 # Should be non-editable …

asked Aug 29 '17 at 15:36

user48956

14,850
19
93
154

votes

2 answers

numpy memmap memory usage - want to iterate once

let say I have some big matrix saved on disk. storing it all in memory is not really feasible so I use memmap to access it A = np.memmap(filename, dtype='float32', mode='r', shape=(3000000,162)) now let say I want to iterate over this matrix (not…

python numpy numpy-memmap

asked Jul 16 '17 at 20:16

user2717954

1,822
2
17
28

votes

1 answer

numpy mean is larger than max for memmap

I have an array of timestamps, increasing for each row in the 2nd column of matrix X. I calculate the mean value of the timestamps and it's larger than the max value. I'm using a numpy memmap for storage. Why is this happening? >>>…

python numpy numpy-memmap

asked Apr 09 '16 at 21:13

siamii

23,374
28
93
143

votes

1 answer

Do xarray or dask really support memory-mapping?

In my experimentation so far, I've tried: xr.open_dataset with chunks arg, and it loads the data into memory. Set up a NetCDF4DataStore, and call ds['field'].values and it loads the data into memory. Set up a ScipyDataStore with mmap='r', and…

numpy dask numpy-memmap python-xarray

asked Jun 24 '17 at 05:23

chrisbarber

votes

0 answers

Efficient way of using numpy memmap when training neural network with pytorch

I'm training a neural network on a database of images. My images are of full HD (1920 x 1080) resolution, but for training, I use random crops of size 256x256. Since reading the full image and then cropping is not efficient, I'm using numpy memmap…

python numpy pytorch pytorch-dataloader numpy-memmap

asked Oct 29 '21 at 15:17

Nagabhushan S N

6,407
8
44
87

votes

2 answers

How to read a large text file avoiding reading line-by-line :: Python

I have a large data file (N,4) which I am mapping line-by-line. My files are 10 GBs, a simplistic implementation is given below. Though the following works, it takes huge amount of time. I would like to implement this logic such that the text file…

python numpy hdf5 h5py numpy-memmap

asked Jul 22 '20 at 20:04

nuki

votes

0 answers

Caching a data frame in joblib

Joblib has functionality for sharing Numpy arrays across processes by automatically memmapping the array. However this makes use of Numpy specific facilities. Pandas does use Numpy under the hood, but unless your columns all have the same data type,…

python python-3.x pandas joblib numpy-memmap

asked Feb 07 '19 at 02:59

shadowtalker

12,529
3
53
96

votes

1 answer

Is it possible to save boolean numpy arrays on disk as 1bit per element with memmap support?

Is it possible to save numpy arrays on disk in boolean format where it takes only 1 bit per element? This answer suggests to use packbits and unpackbits, however from the documentation, it seems that this may not support memory mapping. Is there a…

python numpy boolean numpy-memmap

asked May 18 '22 at 06:12

Nagabhushan S N

6,407
8
44
87

votes

3 answers

numpy.memmap returns not enough memory while there are plenty available

During a typical call to numpy.memmap() on a 64bit windows machine, python raise the following error: OSError: [WinError 8] Not enough memory resources are available to process this command A different windows machine raise the same error with a…

python numpy numpy-memmap

asked May 10 '18 at 00:44

auzn

votes

0 answers

Numpy Memmap WinError8

My first StackOverflow message after 6 years of using great experience from this site. Thank you all for all the great help you have offered to me and to others. This problem, however, baffles me completely and I would like to ask for assistance…

python numpy numpy-memmap

asked Mar 21 '18 at 10:27

Urban Avsec

votes

1 answer

Numpy Memmap Ctypes Access

I'm trying to use a very large numpy array using numpy memmap, accessing each element as a ctypes Structure. class My_Structure(Structure): _fields_ = [('field1', c_uint32, 3), ('field2', c_uint32, 2), ('field3',…

python numpy ctypes numpy-memmap

asked Dec 14 '17 at 18:18

sheridp

1,386
1
11
24

votes

1 answer

I can't remove file, created by memmap

I can't remove file created numpy.memmap funtion class MyClass def __init__(self): self.fp = np.memmap(filename, dtype='float32', mode='w+', shape=flushed_chunk_shape) ... def __del__(self): del self.fp os.remove(filename) When I…

python numpy numpy-memmap

asked Oct 10 '16 at 07:47

Cat-with-a-pipe

votes

2 answers

packing boolean array needs go throught int (numpy 1.8.2)

I'm looking for the more compact way to store boolean. numpy internally need 8bits to store one boolean, but np.packbits allow to pack them, that's pretty cool. The problem is that to pack in a 4e6 bytes array a 32e6 bytes array of boolean we need…

python arrays numpy memory numpy-memmap

asked Dec 29 '15 at 12:44

user3313834

7,327
12
56
99

votes

0 answers

Does an ndarray have a buffer which is mmap?

How to tell if an ndarray has a buffer which is mmap? I want to tell apart x and y. import numpy as np import mmap with open("f.dat", "wb+") as f: f.seek(np.dtype(float).itemsize - 1, 0) f.write(b'\0') f.seek(0, 0) mm =…

python numpy numpy-memmap

asked Mar 04 '21 at 18:59

slitvinov

5,693
20
31

votes

0 answers

Numpy memmap throttles with Pytorch Dataloader when available RAM less than file size

I'm working on a dataset that is too big to fit into RAM. The solution I'm trying currently is to use numpy memmap to load one sample/row at a time using Dataloader. The solution looks something like this: class MMDataset(torch.utils.data.Dataset): …

numpy pytorch dataloader numpy-memmap

asked May 28 '20 at 19:02

Kevin

2 3 4 5 6 7 Next