Questions tagged [oom]

39 questions
3
votes
0 answers

Make Spring application crash on OutOfMemoryError

we have Spring application, deployed on Cloud Foundry. This application executes some tasks like calling different endpoints and APIs. Recently we had a problem: the application started to throw OutOfMemoryErrors for Direct buffer memory and could…
Denis
  • 31
  • 2
1
vote
1 answer

K8S OOM kill JDK17 application with huge RSS size

We are running a JDK17 spring-boot application on our production server with following configuration: JDK Vendor : Amazon corretto (17.0.6) K8S version : 1.17 Max pod memory : 5GB Min pod memory : 5GB Xmx : 2GB Xms : 2GB The problem we are running…
1
vote
0 answers

Metaspace OOM after repeated hot deployment of Spring Boot war file in Tomcat

I create a very simple SpringBoot Application accessing Oracle database with one table by JPA. I create a war file and repeatedly deploy and undeploy in a tomcat server. After serveral times there is MetaSpace OOM. Environment: a. OS : Ubuntu…
1
vote
1 answer

How can I fix a CUDA Out of Memory Exception while saving a PyTorch model?

I am fine-tuning an LLM model. I use an GPU with 15 GB RAM memory, but when PyTorch saves a checkpoint, the OOM exception happens. The full exception stack is: Enter image description here Can I change the parameter from GPU memory to CPU memory and…
王泽君
  • 11
  • 1
1
vote
2 answers

OOM for no reason (Arch/Raspberry)

My Raspberry Pi 4B is dying every time it does something (for example, when backup job starts). I'm running Arch Linux (armv7l) on it. The memory usage is always below 15%. Below is the log, including an output from free -hw, which logged 7 seconds…
jswcom
  • 11
  • 2
1
vote
1 answer

How to model (sub)classes with identical attributes and methods?

I am trying to model the following situation in Java, but I suppose it might be a general OOP problem: I have 3 (or maybe more) different classes, which have some attributes and methods in common, and some attributes and methods which are specific…
Flow
  • 11
  • 3
1
vote
0 answers

Find root cause of out-of-memory condition (Fortran on Linux)?

I am currently trying to debug the root cause of an out-of-memory condition in a Fortran-based simulation program, running on Linux, compiled with Intel Fortran. The program is large and developed over many years, so I have only partial knowledge of…
kdb
  • 4,098
  • 26
  • 49
1
vote
1 answer

mongod killed by OOM kill: "Memory cgroup out of memory: Kill process 20391 (mongod) score 993 or sacrifice child"

mongod killed by OOM kill: "Memory cgroup out of memory" The machine has 32GB RAM, but the OOM kills the mongod process when it has 7GB RAM in use. [11988.010379] Memory cgroup out of memory: Kill process 16392 (mongod) score 994 or sacrifice…
Rea Haas
  • 2,018
  • 1
  • 16
  • 18
1
vote
0 answers

Why does JAX + STAX model take more GPU memory than needed?

I'm trying to run a JAX + STAX model from Kaggle kernels on GPU but it fails due to Out Of Memory Error. I've set the XLA_PYTHON_CLIENT_PREALLOCATE to false to avoid preallocation of GPU memory and also tried setting XLA_PYTHON_CLIENT_ALLOCATOR to…
Suraja
  • 11
  • 2
0
votes
1 answer

Why JVM NOT crash after oom?

This maybe an interesting question to ask. In most cases, we do not want jvm crash. But sometimes the 'oom-ed but not crash' jvm process keeps runing, and may return a value with NULL instead of the correct value. For some scenarios sensitive to…
rufushuang
  • 302
  • 4
  • 17
0
votes
0 answers

heapsnapshot-near-heap-limit creates empty heapdump for docker container

I'm running the node js app inside of Docker container on production My app has a memory leak and the container fails with OOM. I'm trying to automatically create a heapdump using --heapsnapshot-near-heap-limit node option. Testing this on my local…
0
votes
0 answers

OOM error while reading .parquet file. How do I solve this?

I am working on a ETL project. For that I am trying to read a .parquet file in order to see, transform the data and upload it. I´ve been failing with that as I always get an "OOM error" while reading it. Is there some way I could read this…
mdein
  • 1
0
votes
0 answers

redisson-netty- invoked oom-killer

This is the oomkill log for linux Aug 16 13:37:15 node-172020002225 kernel: redisson-netty- invoked oom-killer: gfp_mask=0x6000c0(GFP_KERNEL), nodemask=(null), order=0, oom_score_adj=-997 Aug 16 13:37:15 node-172020002225 kernel: …
0
votes
1 answer

Reproducible OOM Kill when using the bioinformatics package "GetBaseCountsMultiSample" on WLS2 with Ubuntu 22.04 and native Ubuntu

I just started out running some bioinformatic scripts for variant calling from a friend of mine and I am still very new to this. Unfortunately, I get hard stuck when I get to the part of the script where I have to use GetBaseCountsMultiSample…
0
votes
1 answer

'CUDA out of memory' when using a GPU services for reinforcement learning in Torch rpc tutorial

I followed this tutorial to implement reinforcement learning with RPC on Torch. Currently, I use one trainer process and one observer process. The trainer process creating the model, and the observer process calls the model forward using RPC. After…
Edisonlu
  • 1
  • 1
1
2 3